State-of-the-Art Performance in Text-Independent Speaker Verification Through Open-Source Software
Citations Over TimeTop 10% of 2007 papers
Abstract
This paper illustrates an evolution in state-of-the-art speaker verification by highlighting the contribution from newly developed techniques. Starting from a baseline system based on Gaussian mixture models that reached state-of-the-art performances during the NIST'04 SRE, final systems with new intersession compensation techniques show a relative gain of around 50%. This work highlights that a key element in recent improvements is still the classical maximum a posteriori (MAP) adaptation, while the latest compensation methods have a crucial impact on overall performances. Nuisance attribute projection (NAP) and factor analysis (FA) are examined and shown to provide significant improvements. For FA, a new symmetrical scoring (SFA) approach is proposed. We also show further improvement with an original combination between a support vector machine and SFA. This work is undertaken through the open-source ALIZE toolkit.
Related Papers
- → Uncertainty in NIST force measurements(2005)21 cited
- → Primary atomic frequency standards at NIST(2001)31 cited
- → Assessing the Impact of the National Institute of Standards and Technology’s Research Collaborations(2014)1 cited
- → Evaluation of National Institute of Standards and Technology (NIST) Thermal Conductivity Standard Reference Materials Using a 20-cm Heat Flow Meter Apparatus(1989)1 cited
- → Technique for assessing PNA measurement repeatability using a NIST standard(2005)1 cited