Need Large-Scale Indian Audio/Music Dataset (100k+ Hours) For AI/ML Training

Hi Everyone,

I’m looking for large-scale Indian audio/music datasets (100,000+ hours preferred) mainly containing:
– Indian songs/music
– Vocals
– Bollywood music
– Regional language audio
– Speech + music mixed data
– Instrumental/music tracks

Purpose is AI/ML training and audio research.

I’m okay with both:
– Commercial datasets
– Non-commercial/free datasets

Would appreciate suggestions for:
– Indian music datasets
– Open-source audio datasets
– Hugging Face/Kaggle datasets
– Large audio archives
– APIs/platforms with Indian audio
– Any legal bulk audio source

If anyone has worked on similar projects or knows good sources, please share links/suggestions.

Thanks!

submitted by /u/No_Wafer_2023
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *