748 Mechanistic Interpretability Papers From ArXiv + Semantic Scholar; Quality-scored JSONL, Free

Sharing a dataset I built.

Disclaimer: this is my project. Free to download and use.

https://huggingface.co/datasets/fineset-io/mechanistic-interpretability-papers

Stats:

– 748 records, 2022–present

– Sources: arXiv + Semantic Scholar, cross-referenced by arxiv_id and DOI

– quality_score: 0–1, citation-normalized

Fields: id, title, abstract, authors, categories, published_date, citation_count, quality_score, has_code, code_url, venue

Built with FineSet (fineset.io).

The waitlist is open if you want daily-refreshed datasets on your own topic.

submitted by /u/fineset-io
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *