Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You need to think of fake data being a more broad term than you are. If we talk about play by play for american college football you will notice how ncaa.com, espn.com, foxsports.com and others have slight differences in what a play's down/togo/time/etc is. It is not as simple as ESPN inserting an entire fake team or fake game; if you were to compare to the last example it would be a real record with a slightly modified price. I analyze college football data sets and can determine where they came from, so I have no doubt that companies can as well.

If you have enough data sources you could theoretically recreate a play by play from all of them and have a data set that would be difficult to prove was stolen from someplace in particular. I say theoretically because (at least with college football) you are often not given enough information to recreate the game (simple example would be how long a play took to execute to determine drive possession time), so often you are left using a best guess method.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: