Looking For Dataset: LLM-Generated Vs. Human Text

Hi everyone,

I’m working on a research project comparing LLM-generated text with human-written text. Does anyone know of a validated dataset (with DOI) that includes both? If not, could you share tips on creating one?

LLM text: Best models/prompts to generate diverse samples? Human text: Reliable sources for high-quality text? Validation: How to ensure balance and avoid bias?

Any help or pointers would be greatly appreciated! Thanks in advance.

submitted by /u/National_Evidence548
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *