{"id":35195,"date":"2025-08-28T15:27:11","date_gmt":"2025-08-28T13:27:11","guid":{"rendered":"https:\/\/www.graviton.at\/letterswaplibrary\/need-massive-collections-of-schemas-for-ai-training-any-bulk-sources\/"},"modified":"2025-08-28T15:27:11","modified_gmt":"2025-08-28T13:27:11","slug":"need-massive-collections-of-schemas-for-ai-training-any-bulk-sources","status":"publish","type":"post","link":"https:\/\/www.graviton.at\/letterswaplibrary\/need-massive-collections-of-schemas-for-ai-training-any-bulk-sources\/","title":{"rendered":"Need Massive Collections Of Schemas For AI Training &#8211; Any Bulk Sources?"},"content":{"rendered":"<p><!-- SC_OFF --><\/p>\n<div class=\"md\">\n<p>looking for massive collections of schemas\/datasets for AI training &#8211; mainly financial and ecommerce domains but really need vast quantities from all sectors. need structured data formats that I can use to train models on things like transaction patterns, product recommendations, market analysis etc. talking like thousands of different schema types here. anyone have good sources for bulk schema collections? even pointers to where people typically find this stuff at scale would be helpful<\/p>\n<\/div>\n<p><!-- SC_ON -->   submitted by   <a href=\"https:\/\/www.reddit.com\/user\/Fragrant-Dog-3706\"> \/u\/Fragrant-Dog-3706 <\/a> <br \/> <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1n2c5xb\/need_massive_collections_of_schemas_for_ai\/\">[link]<\/a><\/span>   <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1n2c5xb\/need_massive_collections_of_schemas_for_ai\/\">[comments]<\/a><\/span><\/p><div class='watch-action'><div class='watch-position align-right'><div class='action-like'><a class='lbg-style1 like-35195 jlk' href='javascript:void(0)' data-task='like' data-post_id='35195' data-nonce='65e0e39b87' rel='nofollow'><img class='wti-pixel' src='https:\/\/www.graviton.at\/letterswaplibrary\/wp-content\/plugins\/wti-like-post\/images\/pixel.gif' title='Like' \/><span class='lc-35195 lc'>0<\/span><\/a><\/div><\/div> <div class='status-35195 status align-right'><\/div><\/div><div class='wti-clear'><\/div>","protected":false},"excerpt":{"rendered":"<p>looking for massive collections of schemas\/datasets for AI training &#8211; mainly financial and ecommerce domains but really&#8230;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[85],"tags":[],"class_list":["post-35195","post","type-post","status-publish","format-standard","hentry","category-datatards","wpcat-85-id"],"_links":{"self":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/35195","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/comments?post=35195"}],"version-history":[{"count":0,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/35195\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/media?parent=35195"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/categories?post=35195"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/tags?post=35195"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}