Factors responsible and phases of speaker recognition system

2022pp. 185–211

Abstract

The method of identifying a speaker based on his or her speech is known as automatic speaker recognition. Speaker/voice recognition is a biometric sensory device that recognizes people by their voices. Most speaker recognition systems nowadays are focused on spectral information, which means they use spectral information derived from speech signal segments of 10-30 ms in length. However, if the received speech signal contains some noise, the cepstral-based system's output suffers. The primary goal of the study is to see the various factors responsible for improved performance of the speaker recognition systems by modeling prosodic features, and phases of speaker recognition system. Furthermore, in the presence of background noise, the analysis focused on a text-independent speaker recognition system.

Related Papers

→ An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization(2010)30 cited
→ Infant cry recognition based on feature extraction(2010)3 cited
→ Pitch-based cepstral features for gender classification in noisy environments(2013)1 cited
A Robust Mel-frequency Cepstrum Coefficients(2008)
Application of Biomimetic Technology to Feature Extraction from Acoustic Objects(2014)