Need free, high-quality audio datasets for tasks like speech recognition, sound classification, or environmental noise analysis—ideally with labels, metadata, and permissive licenses (CC0 or similar). Does anyone have recommendations for sources beyond Hugging Face (Common Voice, AudioSet) or Kaggle? Bonus if they’re preprocessed or good for big data tools like Spark/Hadoop. Links, sizes, and usage tips appreciated.
submitted by /u/yobigp
[link] [comments]