Text-dependent speaker identification using hidden Markov model with stress compensation technique
Citations Over TimeTop 10% of 2002 papers
Abstract
We present an algorithm for an isolated-word text-dependent speaker identification under normal and four stressful styles. The styles which are designed to simulate speech produced under real stressful conditions are: shout, slow, loud, and soft. The algorithm is based on the hidden Markov model (HMM) with a cepstral stress compensation technique. Comparing the HMM without cepstral stress compensation with the HMM combined with cepstral stress compensation, the recognition rate has improved with a little increase in the computations. The recognition rate has improved: from 90% to 93% in normal style, from 19% to 73% in shout style, from 62% to 84% in slow style, from 38% to 75% in loud style, and from 30% to 81% in soft style. The cepstral coefficients and transitional coefficients are combined to form an observation vector of the hidden Markov model. This algorithm is tested on a limited number of speakers due to our limited data base.
Related Papers
- → Infant cry recognition based on feature extraction(2010)3 cited
- → Pitch-based cepstral features for gender classification in noisy environments(2013)1 cited
- → The classification of emotion based on human voice by using Mel Frequency Cepstrum Coefficient (MFCC) and Naive Bayes method(2023)1 cited
- A Robust Mel-frequency Cepstrum Coefficients(2008)
- Application of Biomimetic Technology to Feature Extraction from Acoustic Objects(2014)