Large population speaker identification using clean and telephone speech
Citations Over TimeTop 16% of 1995 papers
Abstract
This paper presents text-independent speaker identification results for varying speaker population sizes up to 630 speakers for both clean, wideband speech, and telephone speech. A system based on Gaussian mixture speaker models is used for speaker identification, and experiments are conducted on the TIMIT and NTIMIT databases. The TIMIT results show large population performance under near-ideal conditions, and the NTIMIT results show the corresponding accuracy loss due to telephone transmission. These are believed to be the first speaker identification experiments on the complete 630 speaker TIMIT and NTIMIT databases and the largest text-independent speaker identification task reported to date. Identification accuracies of 99.5 and 60.7% were achieved on the TIMIT and NTIMIT databases, respectively.>
Related Papers
- → Hierarchical speaker identification using speaker clustering(2004)24 cited
- → Selective use of the speech spectrum and a VQGMM method for speaker identification(2002)26 cited
- → An investigation on speaker vector-based speaker identification under noisy conditions(2008)5 cited
- → Parameter Settings for Speaker Identification using Gaussian Mixture Model(2007)
- → Method and apparatus for speaker identification using mixture discriminant analysis to develop speaker models(2002)