0 citations0 references

Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Recurrent Neural Network

arXiv (Cornell University)2015

Citations Over Time

Peilu Wang, Yao Qian, Frank K. Soong, Lei He, Hai Zhao

Abstract

Bidirectional Long Short-Term Memory Recurrent Neural Network (BLSTM-RNN) has been shown to be very effective for tagging sequential data, e.g. speech utterances or handwritten documents. While word embedding has been demoed as a powerful representation for characterizing the statistical properties of natural language. In this study, we propose to use BLSTM-RNN with word embedding for part-of-speech (POS) tagging task. When tested on Penn Treebank WSJ test set, a state-of-the-art performance of 97.40 tagging accuracy is achieved. Without using morphological features, this approach can also achieve a good performance comparable with the Stanford POS tagger.

Related Papers

Because Size Does Matter: The Hamburg Dependency Treebank(2014)
→ The Universal Dependencies Treebank for Slovenian(2017)28 cited
→ Dependency structure annotation in the IULA Spanish LSP Treebank(2014)3 cited
Does Netgraph Fit Prague Dependency Treebank(2008)