I couldn’t find a realistic, ML-ready dataset for sleep analysis, so I built one.
This dataset contains:
- 100,000 records
- 32 features covering sleep, lifestyle, psychology, and health
- 3 prediction targets (regression + classification)
It is synthetic, but designed to reflect real-world patterns using research-backed correlations (e.g., stress vs sleep quality, REM vs cognition).
Some highlights:
• Occupation-based sleep patterns (12 job types)
• Non-linear relationships (optimal sleep duration effects)
• Zero missing values (fully ML-ready)
Use cases:
- Data analysis & visualization
- Machine learning (beginner → advanced)
- Research experiments
Dataset: https://www.kaggle.com/datasets/mohankrishnathalla/sleep-health-and-daily-performance-dataset
Would appreciate any feedback!
submitted by /u/Mohan137
[link] [comments]