Title.
I’m currently training my small LLM (~192.8M RWKV v6 model) for edge-RP (Role Playing on phones, tablets, bad laptops etc, I already made full inference in Java (UI)+C and C++ (via JNI, C/C++, made both for CPU and GPU) for Android) and I wanna get new really good datasets (even if they’re small). I don’t really care if they’re synthetic, human-made, mixed or human with AI, cuz I only care if it’s good enough. Better, if its’ available via datasets python lib (if dataset available on huggigface.co).
Thanks !
EDIT: Please, mark if it’s in English, in Ukrainian (there’s almost no RP datasets in Ukrainian) or multi-languaged
submitted by /u/Lines25
[link] [comments]