0 citations0 references

Project Euphonia: advancing inclusive speech recognition through expanded data collection and evaluation

Frontiers in Language Sciences2025Vol. 4

Citations Over TimeTop 10% of 2025 papers

Alicia R. Martin, Robert L. Macdonald, Pan-Pan Jiang, Marilyn Ladewig, Julie Cattiau, Rus Heywood, Richard Cave, Jimmy Tobin, Philip Nelson, Katrin Tomanek

Abstract

Speech recognition models, predominantly trained on standard speech, often exhibit lower accuracy for individuals with accents, dialects, or speech impairments. This disparity is particularly pronounced for economically or socially marginalized communities, including those with disabilities or diverse linguistic backgrounds. Project Euphonia, a Google initiative originally launched in English dedicated to improving Automatic Speech Recognition (ASR) of disordered speech, is expanding its data collection and evaluation efforts to include international languages like Spanish, Japanese, French and Hindi, in a continued effort to enhance inclusivity. This paper presents an overview of the extension of processes and methods used for English data collection to more languages and locales, progress on the collected data, and details about our model evaluation process, focusing on meaning preservation based on Generative AI.

Related Papers

Lucknow Daily : how a Hindi newspaper constructs society(2002)
→ EVOLUTIONARY DEVELOPMENT OF 'HINGLISH' LANGUAGE WITHIN THE INDIAN SUB-CONTINENT(2020)4 cited
→ Towards a convivial tool for narrative assessment: Adapting MAIN to Gondi (Dantewada, India), Halbi and Hindi for Gondi- and Halbi-Hindi speaking bilinguals(2020)3 cited
Hindi in the world and science in Hindi(2014)
An Overview of the Hindi New Novels(2008)