Yi Language Speech Recognition using Deep Learning Methods
Citations Over Time
Abstract
Yi language is one of the most representative languages in the Yi branch of the Tibetan-Burmese language family. To strengthen the protection of Yi language, an endangered minority language, this paper covers the continuous speech recognition of Yi language with different deep learning methods to achieve the optimal recognition performance. In the training stage, we first analyzed the Yi language text to realize the language model modeling, and then trained the audio files based on the four acoustic models: hidden Markov model (HMM), deep neural network (DNN), time-delay neural network (TDNN) and end-to-end. In the recognition stage, after matching the language model according to the acoustic model, we obtained the Yi recognized text by combining the dictionary and acoustic feature vector for joint recognition and decoding. Compared with the word error rates of acoustic models, the time-delay neural network is the best in existed Yi corpus, only 16.65%.
Related Papers
- → Incorporating language constraints in sub-word based speech recognition(2005)38 cited
- → Recurrent neural network-based language modeling for an automatic Russian speech recognition system(2015)10 cited
- → Language model adaptation in speech recognition using document maps(2003)3 cited
- → Sub-world language modelling for Turkish speech recognition(2005)2 cited
- → Style Adaptation of Statistical Language Models(2000)