CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Citations Over Time
Abstract
Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges we organize the 6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge revisits the previous CHiME-5 challenge and further considers the problem of distant multi-microphone conversational speech diarization and recognition in everyday home environments. Speech material is the same as the previous CHiME-5 recordings except for accurate array synchronization. The material was elicited using a dinner party scenario with efforts taken to capture data that is representative of natural conversational speech. This paper provides a baseline description of the CHiME-6 challenge for both segmented multispeaker speech recognition (Track 1) and unsegmented multispeaker speech recognition (Track 2). Of note, Track 2 is the first challenge activity in the community to tackle an unsegmented multispeaker speech recognition scenario with a complete set of reproducible open source baselines providing speech enhancement, speaker diarization, and speech recognition modules.
Related Papers
- → Leveraging speaker diarization for meeting recognition from distant microphones(2010)19 cited
- → Unsupervised text independent speaker classification(2002)15 cited
- → Speaker Recognition and Diarization(2010)3 cited
- → Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition(2010)
- → Overview of Speaker Identification with Privacy(2012)