{"id":37661,"date":"2026-01-06T05:27:08","date_gmt":"2026-01-06T04:27:08","guid":{"rendered":"https:\/\/www.graviton.at\/letterswaplibrary\/built-something-for-turning-websites-into-datasets-with-ai\/"},"modified":"2026-01-06T05:27:08","modified_gmt":"2026-01-06T04:27:08","slug":"built-something-for-turning-websites-into-datasets-with-ai","status":"publish","type":"post","link":"https:\/\/www.graviton.at\/letterswaplibrary\/built-something-for-turning-websites-into-datasets-with-ai\/","title":{"rendered":"Built Something For Turning Websites Into Datasets With AI"},"content":{"rendered":"<p><!-- SC_OFF --><\/p>\n<div class=\"md\">\n<p>I made a tool to turn websites into structured datasets using AI, mainly for cases where data only exists on web pages and not as APIs or downloads. The idea is to make it easier to repeatedly extract the same fields and build datasets over time without hand-maintaining scrapers.<\/p>\n<p>I\u2019m curious what kinds of datasets people here wish existed but are hard to create today, and whether an approach like this feels useful or too fragile for serious dataset work.<\/p>\n<p>Disclaimer: I built this tool and am sharing it for feedback, not selling datasets.<br \/> Can be found by searching Lection on chrome webstore<\/p>\n<\/div>\n<p><!-- SC_ON -->   submitted by   <a href=\"https:\/\/www.reddit.com\/user\/MarketingJaded6157\"> \/u\/MarketingJaded6157 <\/a> <br \/> <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1q57mmy\/built_something_for_turning_websites_into\/\">[link]<\/a><\/span>   <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1q57mmy\/built_something_for_turning_websites_into\/\">[comments]<\/a><\/span><\/p><div class='watch-action'><div class='watch-position align-right'><div class='action-like'><a class='lbg-style1 like-37661 jlk' href='javascript:void(0)' data-task='like' data-post_id='37661' data-nonce='65e0e39b87' rel='nofollow'><img class='wti-pixel' src='https:\/\/www.graviton.at\/letterswaplibrary\/wp-content\/plugins\/wti-like-post\/images\/pixel.gif' title='Like' \/><span class='lc-37661 lc'>0<\/span><\/a><\/div><\/div> <div class='status-37661 status align-right'><\/div><\/div><div class='wti-clear'><\/div>","protected":false},"excerpt":{"rendered":"<p>I made a tool to turn websites into structured datasets using AI, mainly for cases where data&#8230;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[85],"tags":[],"class_list":["post-37661","post","type-post","status-publish","format-standard","hentry","category-datatards","wpcat-85-id"],"_links":{"self":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/37661","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/comments?post=37661"}],"version-history":[{"count":0,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/37661\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/media?parent=37661"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/categories?post=37661"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/tags?post=37661"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}