[OC] 100 Million Domains Ranked By Authority – Free Dataset (1.7GB, Monthly Updates)

I’ve built a dataset of 100 million domains ranked by web authority and releasing it publicly under MIT license.

Dataset: https://github.com/WebsiteLaunches/top-100-million-domains

Stats: – 100M domains ranked by authority – Updated monthly (last: Nov 15, 2025) – MIT licensed (free for any use) – Multiple size tiers: 1K, 10K, 100K, 1M, 10M, 100M – CSV format, simple ranked lists

Methodology: Rankings based on Common Crawl web graph analysis, domain age, traffic patterns, and site quality metrics from Website Launches data. Domains ordered from highest to lowest authority.

Potential uses: – ML training data for domain/web classification – SEO and competitive research – Web graph analysis – Domain investment research – Large-scale web studies

Free and open. Feedback welcome.

submitted by /u/antiochIst
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *