Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop
2011Vol. 15, pp. 5044–5047
Citations Over TimeTop 10% of 2011 papers
Geoffrey Zweig, P. Nguyen, Dirk Van Compernolle, Kris Demuynck, Les Atlas, Pascal Clark, Gregory Sell, M. Wang, Fei Sha, Hynek Heřmanský, Damianos Karakos, Jack Jansen, Samuel Thomas, G. S. V. S. Sivaram, Samuel Bowman, Justine Kao
Abstract
This paper summarizes the 2010 CLSP Summer Workshop on speech recognition at Johns Hopkins University. The key theme of the workshop was to improve on state-of-the-art speech recognition systems by using Segmental Conditional Random Fields (SCRFs) to integrate multiple types of information. This approach uses a state of-the-art baseline as a springboard from which to add a suite of novel features including ones derived from acoustic templates, deep neural net phoneme detections, duration models, modulation features, and whole word point-process models. The SCRF framework is able to appropriately weight these different information sources to produce significant gains on both die Broadcast News and Wall Street Journal tasks.
Related Papers
- → Bidirectional integrated random fields for human behaviour understanding(2012)10 cited
- Named entity recognition in Chinese medical records based on cascaded conditional random field(2014)
- → Human behavior recognition based on fractal conditional random field(2013)2 cited
- → SUITE 2010(2010)1 cited
- → Biomedical Named Entity Recognition Using Second-Order Conditional Random Fields(2011)