Hi everyone,
I’m working on a research project comparing LLM-generated text with human-written text. Does anyone know of a validated dataset (with DOI) that includes both? If not, could you share tips on creating one?
LLM text: Best models/prompts to generate diverse samples? Human text: Reliable sources for high-quality text? Validation: How to ensure balance and avoid bias?
Any help or pointers would be greatly appreciated! Thanks in advance.
submitted by /u/National_Evidence548
[link] [comments]