Speech segmentation and spoken document processing
IEEE Signal Processing Magazine2008Vol. 25(3), pp. 59–69
Citations Over TimeTop 10% of 2008 papers
Mari Ostendorf, Benoît Favre, Ralph Grishman, Dilek Hakkani‐Tür, Mary P. Harper, Dustin Hillard, Julia Hirschberg, Heng Ji, Jeremy G. Kahn, Yang Liu, Sameer Maskey, Evgeny Matusov, Hermann Ney, Andrew Rosenberg, Elizabeth Shriberg, Wen Wang, Chuck Wooters
Abstract
Progress in both speech and language processing has spurred efforts to support applications that rely on spoken rather than written language input. A key challenge in moving from text-based documents to such spoken documents is that spoken language lacks explicit punctuation and formatting, which can be crucial for good performance. This article describes different levels of speech segmentation, approaches to automatically recovering segment boundary locations, and experimental results demonstrating impact on several language processing tasks. The results also show a need for optimizing segmentation for the end task rather than independently.
Related Papers
- → Word Segmentation: The Role of Distributional Cues(1996)1,398 cited
- → How Transitional Probabilities and the Edge Effect Contribute to Listeners’ Phonological Bootstrapping Success(2016)25 cited
- → Flexibility in Statistical Word Segmentation: Finding Words in Foreign Speech(2014)6 cited
- → Computer assisted document production at Carnegie-Mellon University(1980)
- Electronic document processing : document editing, formatting, typesetting, mark-up, storing, interchanging, managing : bibliography(1994)