Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition
Citations Over TimeTop 10% of 1999 papers
Abstract
The focus of a continuous speech recognition process is to match an input signal with a set of words or sentences according to some optimality criteria. The first step of this process is parameterization, whose major task is data reduction by converting the input signal into parameters while preserving virtually all of the speech signal information dealing with the text message. This contribution presents a detailed analysis of a widely used set of parameters, the mel frequency cepstral coefficients (MFCCs), and suggests a new parameterization approach taking into account the whole energy zone in the spectrum. Results obtained with the proposed new coefficients give a confidence interval about their use in a large-vocabulary speaker-independent continuous-speech recognition system.
Related Papers
- → Infant cry recognition based on feature extraction(2010)3 cited
- → Infant asphyxia detection using autoencoders trained on locally linear embedded-reduced Mel Frequency Cepstrum Coefficient (MFCC) features(2018)2 cited
- → Pitch-based cepstral features for gender classification in noisy environments(2013)1 cited
- A Robust Mel-frequency Cepstrum Coefficients(2008)
- Application of Biomimetic Technology to Feature Extraction from Acoustic Objects(2014)