Built a civic transparency platform that aggregates data from 40+ government APIs into a single SQLite database. The dataset covers 2020-present and includes:
- 4,600+ congressional stock trades (STOCK Act disclosures + House Clerk PDFs)
- 26,000+ lobbying records across 8 sectors (Senate LDA API)
- 230,000+ government contracts (USASpending.gov)
- 14,600+ PAC donations (FEC)
- 29,000+ enforcement actions (Federal Register)
- 222,000+ individual congressional vote records
- 7,300+ state legislators (all 50 states via OpenStates)
- 4,200+ patents, 60,000+ clinical trials, SEC filings
All sourced from: Congress.gov, Senate LDA, USASpending, FEC, SEC EDGAR, Federal Register, OpenFDA, EPA GHGRP, NHTSA, ClinicalTrials.gov, House Clerk disclosures, and more.
Stack: FastAPI backend, React frontend, SQLite. Code is AGPL-3.0 on GitHub.
submitted by /u/Prestigious-Wrap2341
[link] [comments]