Speaker Verification Using Support Vector Machines and High-Level Features
Citations Over TimeTop 10% of 2007 papers
Abstract
High-level characteristics such as word usage, pronunciation, phonotactics, prosody, etc., have seen a resurgence for automatic speaker recognition over the last several years. With the availability of many conversation sides per speaker in current corpora, high-level systems now have the amount of data needed to sufficiently characterize a speaker. Although a significant amount of work has been done in finding novel high-level features, less work has been done on modeling these features. We describe a method of speaker modeling based upon support vector machines. Current high-level feature extraction produces sequences or lattices of tokens for a given conversation side. These sequences can be converted to counts and then frequencies of <i xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">n -gram for a given conversation side. We use support vector machine modeling of these n-gram frequencies for speaker verification. We derive a new kernel based upon linearizing a log likelihood ratio scoring system. Generalizations of this method are shown to produce excellent results on a variety of high-level features. We demonstrate that our methods produce results significantly better than standard log-likelihood ratio modeling. We also demonstrate that our system can perform well in conjunction with standard cesptral speaker recognition systems.
Related Papers
- → A comparison of feature selection methods for machine learning based automatic malarial cell recognition in wholeslide images(2016)19 cited
- → Face recognition using HMAX method for feature extraction and support vector machine classifier(2009)4 cited
- SAR Image Feature Extraction and Target Recognition Based on Contourlet and SVM(2012)
- Research on target recognition for SAR image based on contourlet transform and SVM(2010)