CHiME-6 Challenge: Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Citations Over TimeTop 1% of 2020 papers
Abstract
Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges we organize the 6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge revisits the previous CHiME-5 challenge and further considers the problem of distant multi-microphone conversational speech diarization and recognition in everyday home environments. Speech material is the same as the previous CHiME-5 recordings except for accurate array synchronization. The material was elicited using a dinner party scenario with efforts taken to capture data that is representative of natural conversational speech. This paper provides a baseline description of the CHiME-6 challenge for both segmented multispeaker speech recognition (Track 1) and unsegmented multispeaker speech recognition (Track 2). Of note, Track 2 is the first challenge activity in the community to tackle an unsegmented multispeaker speech recognition scenario with a complete set of reproducible open source baselines providing speech enhancement, speaker diarization, and speech recognition modules.
Related Papers
- → Leveraging speaker diarization for meeting recognition from distant microphones(2010)19 cited
- → Unsupervised text independent speaker classification(2002)15 cited
- → Speaker Recognition and Diarization(2010)3 cited
- → Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition(2010)
- → Overview of Speaker Identification with Privacy(2012)