Nobody Asked But I Organized The FBI NIBRS Dataset (30M+ Records) Into A Searchable Site

Hello everyone reading. I finally got around to publishing a small project I’ve been working on for the past few months.

I was experimenting with the FBI NIBRS dataset and ended up organizing about 30M+ incident records into parquet files so they’re easier to query. I used DuckDB on the backend and built a simple site to explore incidents, offenders, and victims without needing to download the raw files.

The original dataset is pretty messy and spread across a lot of tables, so most of the work was figuring out how to structure it and join everything correctly.

It’s nothing crazy, just something I built out for fun while learning more about data engineering. If anyone has suggestions on improving the schema or query performance I’d definitely like to hear your thoughts.

Repo: https://github.com/that-dog-eater/nibrs-search

submitted by /u/Empty-Individual4835
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *