748 mechanistic interpretability papers from arXiv + Semantic Scholar; quality-scored JSONL, free

Sharing a dataset I built.

Disclaimer: this is my project. Free to download and use.

Stats:

– 748 records, 2022–present

– Sources: arXiv + Semantic Scholar, cross-referenced by arxiv_id and DOI

– quality_score: 0–1, citation-normalized

Fields: id, title, abstract, authors, categories, published_date, citation_count, quality_score, has_code, code_url, venue

Built with FineSet (fineset.io).

The waitlist is open if you want daily-refreshed datasets on your own topic.

748 Mechanistic Interpretability Papers From ArXiv + Semantic Scholar; Quality-scored JSONL, Free