Title :
Multiple Feature-Sets Method for Dependency Parsing
Author :
Xianchao Zhang ; Dong Du ; Xinyue Liu ; Wenxin Liang
Author_Institution :
Sch. of Software, Dalian Univ. of Technol., Dalian, China
Abstract :
This paper presents a simple and effective approach to improve dependency parsing by exploiting multiple feature-sets. Traditionally, features are extracted by applying the feature templates to all the word pairs(first-order features)and word tuples(second-order features). In this pa per, we show that exploiting different feature templates for different word pairs and word tuples achieves significant improvement overbaseline parsers. First, we train a text chunker using a freely available implementation of the first-order linear conditional random fields model. Then we build a clause-chunk tree for a given sentence based on chunking information and punctuation marks. Finally, we extract features for dependency parsing according to multiple feature-sets. We extend the projective parsing algorithms of McDonald[20] and Carreras[1] for our case, experimental results show that our approach significantly outperform the baseline systems without increasing complexity. Given correct chunking information, we improve from baseline accuracies of 91.36% and 92.20% to 93.19% and 93.89%, respectively.
Keywords :
feature extraction; grammars; natural language processing; text analysis; chunking information; clause-chunk tree; dependency parsing; feature extraction; feature templates; first-order features; first-order linear conditional random field model; multiple feature-sets method; projective parsing algorithms; punctuation marks; second-order features; text chunker; word pairs; word tuples; Accuracy; Approximation algorithms; Feature extraction; Heuristic algorithms; Inference algorithms; Tagging; Training; dependency parsing; semi-supervised methods;
Conference_Titel :
Parallel Architectures, Algorithms and Programming (PAAP), 2014 Sixth International Symposium on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-3844-5
DOI :
10.1109/PAAP.2014.30