Automatic language identification using deep neural networks
2014pp. 5337–5341
Citations Over TimeTop 1% of 2014 papers
Ignacio López Moreno, Javier Gónzalez-Domínguez, Oldřich Plchot, David Martínez, Joaquín González-Rodríguez, Pedro J. Moreno
Abstract
This work studies the use of deep neural networks (DNNs) to address automatic language identification (LID). Motivated by their recent success in acoustic modelling, we adapt DNNs to the problem of identifying the language of a given spoken utterance from short-term acoustic features. The proposed approach is compared to state-of-the-art i-vector based acoustic systems on two different datasets: Google 5M LID corpus and NIST LRE 2009. Results show how LID can largely benefit from using DNNs, especially when a large amount of training data is available. We found relative improvements up to 70%, in C avg , over the baseline system.
Related Papers
- → Development of an approach to automatic language identification based on phone recognition(1996)24 cited
- → Spoken Language Characterization(2007)11 cited
- → Automatic language identification based on Gaussian mixture model and universal background model(2003)1 cited
- Robust Techniques Of Language Modeling For Spoken Language Identification(2009)
- → Robust Open-Set Spoken Language Identification and the CU MultiLang Dataset(2023)