Hey everyone,
I made a synthetic real hybrid employee dataset with over 800000+ records. the dataset is fully synthetic so there is no personal or sensitive data but it is generated to match real-world distributions of employee metrics. it includes performance scores burnout risk satisfaction scores tenure salaries skill arrays and 12 behavioral personas. the dataset is available in json and parquet formats for easy use
you can use it for things like:
- predicting who might leave a company
- analyzing burnout hotspots
- exploring skill gaps across roles and departments
- practicing machine learning models on realistic hr data
here is the dataset link for anyone who might be interested: https://huggingface.co/datasets/BrotherTony/employee-burnout-turnover-prediction-800k
would love to hear what you think or if you make something cool with it
submitted by /u/AnyCookie10
[link] [comments]