Seeking Open Public Medical Datasets For LLM Finetuning

Good evening, community. This is my first post; if I break a rule, please let me know.

I’m working on MedeX v25.8.3, a clinical assistant aimed at professional use with an educational mode. I’m looking for public, open medical datasets for finetuning.

Ideal traits: clear licenses, solid annotations, documented pipelines, population diversity, common formats (CSV/JSON/DICOM), and standard benchmarks/splits.

Disclosure: I’m the developer of MedeX. I’ll add the repo in the first comment if the sub allows.

submitted by /u/DeepRatAI
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *