{"id":37997,"date":"2026-01-16T04:27:10","date_gmt":"2026-01-16T03:27:10","guid":{"rendered":"https:\/\/www.graviton.at\/letterswaplibrary\/dataset-for-school-incident-classification\/"},"modified":"2026-01-16T04:27:10","modified_gmt":"2026-01-16T03:27:10","slug":"dataset-for-school-incident-classification","status":"publish","type":"post","link":"https:\/\/www.graviton.at\/letterswaplibrary\/dataset-for-school-incident-classification\/","title":{"rendered":"Dataset For School Incident Classification"},"content":{"rendered":"<p><!-- SC_OFF --><\/p>\n<div class=\"md\">\n<p>Hi everyone! I\u2019m currently working on a school-related machine learning project where I\u2019m trying to classify short incident reports written in free text. The goal is to help guidance counselors sort through reports more easily by grouping them based on the type of incident and how serious it might be.<\/p>\n<p>I\u2019m using a pretty simple approach (Naive Bayes) and focusing on things like bullying, harassment, misconduct, vandalism, and facility concerns, with labels like minor or major. The model is just meant to assist with organization and prioritization (all final decisions are still made by people).<\/p>\n<p>Right now, I\u2019m looking for a public, anonymized, or synthetic dataset with short complaint- or incident-style text that I can train the model on. It doesn\u2019t have to be school-specific; anything similar (complaints, reports, misconduct descriptions, etc.) would be super helpful as long as it\u2019s ethical to use.<\/p>\n<p>Since this is an academic project, I can\u2019t use real or identifiable student data, and everything will only be used for research.<\/p>\n<p>If you know of any datasets, past projects, or even tools for generating realistic synthetic text, I\u2019d really appreciate the help. Thanks in advance!<\/p>\n<\/div>\n<p><!-- SC_ON -->   submitted by   <a href=\"https:\/\/www.reddit.com\/user\/Soggy_Macaron_5276\"> \/u\/Soggy_Macaron_5276 <\/a> <br \/> <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1qe4ed0\/dataset_for_school_incident_classification\/\">[link]<\/a><\/span>   <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1qe4ed0\/dataset_for_school_incident_classification\/\">[comments]<\/a><\/span><\/p><div class='watch-action'><div class='watch-position align-right'><div class='action-like'><a class='lbg-style1 like-37997 jlk' href='javascript:void(0)' data-task='like' data-post_id='37997' data-nonce='65e0e39b87' rel='nofollow'><img class='wti-pixel' src='https:\/\/www.graviton.at\/letterswaplibrary\/wp-content\/plugins\/wti-like-post\/images\/pixel.gif' title='Like' \/><span class='lc-37997 lc'>0<\/span><\/a><\/div><\/div> <div class='status-37997 status align-right'><\/div><\/div><div class='wti-clear'><\/div>","protected":false},"excerpt":{"rendered":"<p>Hi everyone! I\u2019m currently working on a school-related machine learning project where I\u2019m trying to classify short&#8230;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[85],"tags":[],"class_list":["post-37997","post","type-post","status-publish","format-standard","hentry","category-datatards","wpcat-85-id"],"_links":{"self":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/37997","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/comments?post=37997"}],"version-history":[{"count":0,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/37997\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/media?parent=37997"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/categories?post=37997"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/tags?post=37997"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}