Hi everyone,
I’m looking for a large e-commerce dataset (at least ~5GB) for a personal data engineering project. Ideally I’m hoping to find something with raw CSV files rather than already processed datasets.
The dataset could include things like:
- orders
- customers
- products
- order_items
- payments / transactions
- reviews or clickstream data (optional but nice to have)
I’m mainly trying to simulate a realistic transactional dataset for building a small data warehouse and running analytics queries.
Requirements:
- Size: ~5GB or larger
- Format: CSV preferred
- Structure: multiple tables
- Domain: e-commerce / retail
If you know any Kaggle datasets, public data dumps, GitHub repos, or open data sources that match this, please share.
Thanks!
submitted by /u/Historical-Web3638
[link] [comments]