Hello everyone!
I’m a data analyst/software developer. Ive built a data cleaning, processing, and analyses software but I need datasets to clean and test it out thoroughly.
I’ve used AI generated datasets, which works great but hallucinates a lot with random data after a while.
I’ve used datasets from kaggle but most of them are pretty clean.
I’m looking for any datasets in any industry to test the cleaning process. Preferably datasets that take a long time to clean and process before doing the data analysis.
CSV and xlsx file types. Anything helps! 🙂 Thanks
submitted by /u/spicytree21
[link] [comments]