I’d personally like the Google full scale historical cache dataset.
Google caches everything, fully backed up with every change to every website covering the last 20 years. Imagine the insight and knowledge you could gain processing that. Every lost website, every forum comment, every tweet, old reddit deleted posts. We have archive but a searchable time backtrackable complete Google cache dataset would be magical.
And you know they have it.
Keeps me up some nights just thinking about it.
What are some datasets that you can only dream of getting access to?
submitted by /u/omgsoftcats
[link] [comments]