Hi there,
I’m the Co-Founder of a startup specialised in creating custom datasets for AI.
We are currently growing and willing to invest in a few datasets we will offer to the AI community. Up to 3 datasets will be built and made available on HuggingFace through the months.
Thus I thought about asking the community. What dataset you think is difficult to find and would help your LLM fine tuning Use Cases? Our clients ask us a lot of coding datasets (e.g. prompt & responses about how to develop in C++), but this could be anything.
Let me know your thoughts!
Cheers.
submitted by /u/Any-Adagio-6174
[link] [comments]