An algorithm for multi-pitch tracking in co-channel speech
Citations Over Time
Abstract
Most multi-pitch algorithms are tested for performance only in voiced regions of speech, and are prone to yield pitch estimates even when the participating speakers are unvoiced. This paper presents a multi-pitch algorithm that detects the voiced and unvoiced regions in a mixture of two speakers, identifies the number of speakers in voiced regions, and yields the pitch estimates of each speaker in those regions. The algorithm relies on the 2-Dimensional AMDF for estimating the periodicity of the signal, and uses the temporal evolution of the 2-D AMDF to estimate the number of speakers present in periodic regions. Evaluation of this algorithm on a frame-wise basis demonstrates accurate voiced / unvoiced decisions and also gives pitch estimation results comparable to the state of the art. The pitch estimation errors are quantitatively analyzed and shown to be resulting partly from speaker domination & pitch matching between speakers.
Related Papers
- → Simplified pitch detection algorithm of mixed speech signals(2002)8 cited
- → Estimation and tracking of pitch for noisy speech signals using EMD based autocorrelation function algorithm(2017)4 cited
- A Summarize of Pitch Detection Algorithmic in Speech Signals Processing(2010)
- → Estimating the pitch period of voiced speech(1980)5 cited
- VLSI implementation of an AMDF pitch detector(2003)