Target Speaker Recognition in The Cocktail Party
Citations Over Time
Abstract
The arise of the deep learning techniques has accelerated the advance of the speaker recognition and the increase in personalized devices weighted the importance of the target speaker recognition (TSR). More precisely, it is important to recognize the target speaker correctly even when a variety of speakers utter at the same time. In this paper the TSR methods are proposed and evaluated in the multi-speaker environments: (1) TSE is performed before the speaker recognition on the input voice; (2) results from (1) and the speaker recognition are fused. Among the proposed methods, the latter method showed the better results; more precisely, the fusion based method showed the relative performance improvements of at least 11% from the ordinary speaker recognition system.
Related Papers
- → Target Speaker Recognition in The Cocktail Party(2022)1 cited
- → Text-dependent speaker recognition using speaker specific compensation(2004)3 cited
- → Speaker Recognition and Diarization(2010)3 cited
- → Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition(2023)1 cited
- → Factors responsible and phases of speaker recognition system(2022)