[self-promotion] Train LLMs With Text-rich Data Covering SEC Corporate Filings, US Patent Grants, And US Gov’t Contracts

Cybersyn LLM Training Essentials just launched on Snowflake Marketplace. SEC corporate filings, US patent grants, and US government contracts are all public domain, but gated and not included in traditional common-crawl like datasets. Use the product for LLM training, fine tuning, and inference and let us know what you think.

submitted by /u/aiatco2
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *