DocumentCode :
2909761
Title :
Improving Chinese Dependency Parsing with Self-Disambiguating Patterns
Author :
Qiu, Likun ; Wu, Lei ; Zhao, Kai ; Hu, Changjian ; Kong, Lingpeng
Author_Institution :
Key Lab. of Comput. Linguistics, Peking Univ., Beijing, China
fYear :
2011
fDate :
15-17 Nov. 2011
Firstpage :
7
Lastpage :
10
Abstract :
To solve the data sparseness problem in dependency parsing, most previous studies used features constructed from large-scale auto-parsed data. Unlike previous work, we propose a new approach to improve dependency parsing with context-free dependency triples (CDT) extracted by using self-disambiguating patterns (SDP). The use of SDP makes it possible to avoid the dependency on a baseline parser and explore the influence of different types of substructures one by one. Additionally, taking the available CDTs as seeds, a label propagation process is used to tag a large number of unlabeled word pairs as CDTs. Experiments show that, when CDT features are integrated into a maximum spanning tree (MST) dependency parser, the new parser improves significantly over the baseline MST parser. Comparative results also show that CDTs with dependency relation labels perform much better than CDT without dependency relation label.
Keywords :
context-free grammars; natural language processing; trees (mathematics); CDT; MST; SDP; context-free dependency triples; data sparseness problem; dependency parsing; label propagation process; maximum spanning tree; self-disambiguating patterns; Context; Data mining; Earth Observing System; Feature extraction; Syntactics; Tagging; Training; Dependency parsing; raw corpus; self-disambiguating pattern;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2011 International Conference on
Conference_Location :
Penang
Print_ISBN :
978-1-4577-1733-8
Type :
conf
DOI :
10.1109/IALP.2011.36
Filename :
6121457
Link To Document :
بازگشت