End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF
Citations Over TimeTop 1% of 2016 papers
Abstract
State-of-the-art sequence labeling systems traditionally require large amounts of taskspecific knowledge in the form of handcrafted features and data pre-processing. In this paper, we introduce a novel neutral network architecture that benefits from both word-and character-level representations automatically, by using combination of bidirectional LSTM, CNN and CRF. Our system is truly end-to-end, requiring no feature engineering or data preprocessing, thus making it applicable to a wide range of sequence labeling tasks. We evaluate our system on two data sets for two sequence labeling tasks -Penn Treebank WSJ corpus for part-of-speech (POS) tagging and CoNLL 2003 corpus for named entity recognition (NER). We obtain state-of-the-art performance on both datasets -97.55% accuracy for POS tagging and 91.21% F1 for NER.
Related Papers
- → End-to-End Training of a Large Vocabulary End-to-End Speech Recognition System(2019)35 cited
- → Visual analysis of attention-based end-to-end speech recognition(2019)3 cited
- → Does End-to-End Trained Deep Model Always Perform Better than Non-End-to-End Counterpart?(2021)2 cited
- → The notion of end-to-end capacity and its application to the estimation of end-to-end network delays(2005)4 cited
- → End-to-end consensus using end-to-end channels(2006)2 cited