GMM-UBM Modeling for Speaker Recognition on a Romanian Large Speech Corpora
Citations Over Time
Abstract
In recent years, non-intrusive biometric identification systems have evolved very quickly. Their main advantage over classical systems, based on usernames and passwords, is the fact that the anatomical features are less exposed to stealing or losing. In this context, this study deals with speaker recognition - one of the most fashionable biometric technologies. It presents speaker verification and speaker identification experiments carried out on Romanian speech, using the GMM-UBM framework in different configurations. The experiments are performed on a corpus about 5 times larger than other previous attempts for Romanian language. It comprises connected digits uttered in Romanian by over 120 speakers. The results show that the false rejection rate is close to 0% and false acceptance rate is around 1.50%. For speaker identification, the results are relatively good for the closed-set scenario (identification error < 1%), but not still acceptable for the open-set scenario. The error rates were computed in different configurations, error trends being obtained depending on the variation of some main system parameters.
Related Papers
- → A new set of features for text-independent speaker identification(2006)30 cited
- → Hierarchical speaker identification using speaker clustering(2004)24 cited
- → Speaker Recognition and Diarization(2010)3 cited
- A Semi-Automatic Approach to Create Large Gender-and Age-Balanced Speaker Corpora: Usefulness of Speaker Diarization & Identification(2022)
- → VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification Benchmark(2024)