MLCommons and Hugging Face Release Massive Speech Dataset for AI Research

Archyde The Promise and Peril of Massive voice Datasets for AI Table of Contents 1. The Promise and Peril of Massive voice Datasets for AI 2. how can we ensure informed consent from individuals whose voices are included in the unsupervised People’s Speech dataset, especially regarding potential commercial uses? 3. The Promise and Peril of … Read more

MLCommons and Hugging Face team up to release massive speech data set for AI research

MLCommons, a nonprofit AI safety working group, has teamed up with AI dev platform Hugging Face to release one of the world’s largest collections of public domain voice recordings for AI research. The data set, called Unsupervised People’s Speech, contains more than a million hours of audio spanning at least 89 different languages. MLCommons says … Read more