[Disclaimer: I created this project]
I’ve created a comprehensive, searchable database of 1.3 million Epstein-related documents scraped from DOJ Transparency Act releases, House Oversight Committee archives, and estate proceedings.
The dataset includes:
– Full-text search across all documents
– AI-powered entity extraction (238,000+ people identified)
– Document categorization and summarization
– Interactive network graphs showing connections between entities
– Crowdsourced document upload feature
All documents were processed through OpenAI’s batch API for entity extraction and summarization. The site is free to use.
Tech stack: Next.js + Postgres + D3.js for visualizations
Check it out: https://epsteingraph.com
Feedback is appreciated, I would especially be interested in thoughts on how to better showcase this data and correlate various data points. Thank you!
submitted by /u/indienow
[link] [comments]