Inspired by Mimi Ọnụọha’s The Library of Missing Datasets, this site uses Markov chains to serve as an inspiration for new datasets that yet have to be created.
The results are sometimes a bit nonsensical, but I’ve found a few gems:
- EU-funded projects in the US.
- Three centuries of UK general elections.
- 4,500 years of urbanization.
- Many millions of street addresses.
- The Marvel Cinematic Universe is the largest movie franchise in history, I counted how many people were wearing masks.
- Refugees resettled in the kitchen.
- Datasets you can play on in an antique store.
- The popularity of the world.
Datasets used to create this website:
- List of datasets published in Jeremy Singer-Vine’s Data Is Plural newsletter
- Names of top submissions from r/DataIsBeautiful and r/datasets
These datasets may or may not exist, yet.
Feedback and suggestions are welcome!
UPDATE 02/06: You can now follow @thisdatasetdoesnotexist!