[Self-Promotion] What Technologies Are Running On 100,000 Websites (Sept 2025- Oct 2025)

Each dataset includes

  • What technologies were detected (e.g. WordPress 4.5.3)
  • The domain it was found on
  • The page it was found on
  • The IP address associated with the page
  • Who owns the IP address
  • The geolocation for that IP address
  • The URLs found on the page
  • The meta description tags for that page
  • The size of the HTTP response
  • What protocol was used to fulfill the HTTP request
  • The date the page was crawled

September 2025: https://www.dropbox.com/scl/fi/0zsph3y6xnfgcibizjos1/sept_2025_jumbo_sample.zip?rlkey=ozmekjx1klshfp8r1y66xdtvx&e=2&st=izkt62t6&dl=0

October 2025: https://www.dropbox.com/scl/fi/xu8m2kzeu5z3wurvilb9t/oct_2025_jumbo_sample.zip?rlkey=ygusc6p42ipo0kmma8oswqf16&e=1&st=gb0hctyl&dl=0

You can find the full version of the October 2025 dataset here: https://versiondb.io

I hope you guys like it.

submitted by /u/Upper-Character-6743
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *