[synthetic] [self-promotion] Synthetic Employee Dataset 800k+ Records For Burnout Turnover And Hr Analytics

Hey everyone,

I made a synthetic real hybrid employee dataset with over 800000+ records. the dataset is fully synthetic so there is no personal or sensitive data but it is generated to match real-world distributions of employee metrics. it includes performance scores burnout risk satisfaction scores tenure salaries skill arrays and 12 behavioral personas. the dataset is available in json and parquet formats for easy use

you can use it for things like:

  • predicting who might leave a company
  • analyzing burnout hotspots
  • exploring skill gaps across roles and departments
  • practicing machine learning models on realistic hr data

here is the dataset link for anyone who might be interested: https://huggingface.co/datasets/BrotherTony/employee-burnout-turnover-prediction-800k

would love to hear what you think or if you make something cool with it

submitted by /u/AnyCookie10
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *