TAG, dynamic programming, and the perceptron for efficient, feature-rich parsing
Citations Over TimeTop 1% of 2008 papers
Abstract
We describe a parsing approach that makes use of the perceptron algorithm, in conjunction with dynamic programming methods, to recover full constituent-based parse trees. The formalism allows a rich set of parse-tree features, including PCFG-based features, bigram and trigram dependency features, and surface features. A severe challenge in applying such an approach to full syntactic parsing is the efficiency of the parsing algorithms involved. We show that efficient training is feasible, using a Tree Adjoining Grammar (TAG) based parsing formalism. A lower-order dependency parsing model is used to restrict the search space of the full model, thereby making it efficient. Experiments on the Penn WSJ treebank show that the model achieves state-of-the-art performance, for both constituent and dependency accuracy.
Related Papers
- → A survey of syntactic-semantic parsing based on constituent and dependency structures(2020)35 cited
- Parsing Schemata - a framework for specification and analysis of parsing algorithms(1997)
- Research on Chinese parsing(2012)
- → A Survey of Syntactic-Semantic Parsing Based on Constituent and Dependency Structures(2020)2 cited
- → Parsing Schemata(1993)