Epstein Graph: 1.3M+ Searchable Documents From DOJ, House Oversight, And Estate Proceedings With AI Entity Extraction

[Disclaimer: I created this project]

I’ve created a comprehensive, searchable database of 1.3 million Epstein-related documents scraped from DOJ Transparency Act releases, House Oversight Committee archives, and estate proceedings.

The dataset includes:
– Full-text search across all documents
– AI-powered entity extraction (238,000+ people identified)
– Document categorization and summarization
– Interactive network graphs showing connections between entities
– Crowdsourced document upload feature

All documents were processed through OpenAI’s batch API for entity extraction and summarization. The site is free to use.

Tech stack: Next.js + Postgres + D3.js for visualizations

Check it out: https://epsteingraph.com

Feedback is appreciated, I would especially be interested in thoughts on how to better showcase this data and correlate various data points. Thank you!

submitted by /u/indienow
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *