Spotting keywords and sensing topic changes in speech
Abstract
Security concerns involved in dealing with sensitive information conveyed in human languages must be able to handle speech, which is the most basic, natural form of human communication and a huge amount of data are being generated daily. Dealing with such data is naturally associated with typical big-data problems in terms of both computational complexity and storage space. Unfortunately, compared with written texts, speech is inherently more difficult to browse, if no technical support is provided. In this paper we are interested in spotting keywords, which could reflect a security agent's information needs, and study its usefulness in helping automatically disclose topic changes (boundaries) in speech data under concern. Our results show that keyword spotting can help identify topics with a competitive performance.
Related Papers
- → Keyword spotting method based on speech feature space trace matching(2004)7 cited
- → An approach of keyword spotting based on HMM(2002)4 cited
- → Mutitask Learning Based Muti-examples Keywords Spotting in Low Resource Condition(2018)4 cited
- → Attention-Based End-to-End Keywords Spotting(2020)1 cited
- → Word Spotting based on the Generalized Hough Transform and continuous DP matching(1998)