Sharing a dataset I built.
Disclaimer: this is my project. Free to download and use.
https://huggingface.co/datasets/fineset-io/mechanistic-interpretability-papers
Stats:
– 748 records, 2022–present
– Sources: arXiv + Semantic Scholar, cross-referenced by arxiv_id and DOI
– quality_score: 0–1, citation-normalized
Fields: id, title, abstract, authors, categories, published_date, citation_count, quality_score, has_code, code_url, venue
Built with FineSet (fineset.io).
The waitlist is open if you want daily-refreshed datasets on your own topic.
submitted by /u/fineset-io
[link] [comments]