Retrieval and browsing of spoken content
Citations Over TimeTop 10% of 2008 papers
Abstract
Ever-increasing computing power and connectivity bandwidth, together with falling storage costs, are resulting in an overwhelming amount of data of various types being produced, exchanged, and stored. Consequently, information search and retrieval has emerged as a key application area. Text-based search is the most active area, with applications that range from Web and local network search to searching for personal information residing on one's own hard-drive. Speech search has received less attention perhaps because large collections of spoken material have previously not been available. However, with cheaper storage and increased broadband access, there has been a subsequent increase in the availability of online spoken audio content such as news broadcasts, podcasts, and academic lectures. A variety of personal and commercial uses also exist. As data availability increases, the lack of adequate technology for processing spoken documents becomes the limiting factor to large-scale access to spoken content. In this article, we strive to discuss the technical issues involved in the development of information retrieval systems for spoken audio documents, concentrating on the issue of handling the errorful or incomplete output provided by ASR systems. We focus on the usage case where a user enters search terms into a search engine and is returned a collection of spoken document hits.
Related Papers
- → New variety or learner English?(2007)65 cited
- → Britain had talent: a history of variety theatre(2013)12 cited
- → Application of Ashby’s Law of Requisite Variety to Interorganizational Conflicts in Nonprofit Organizations(2021)3 cited
- THE EFFECTS OF DIGITAL INTENSITY ON COMBINATIONS OF SEQUENTIAL AND CONFIGURAL PROCESS VARIETY(2012)