DocumentCode
47551
Title
Improving Graph-Based Dependency Parsing Models With Dependency Language Models
Author
Min Zhang ; Wenliang Chen ; Xiangyu Duan ; Rong Zhang
Author_Institution
Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
Volume
21
Issue
11
fYear
2013
fDate
Nov. 2013
Firstpage
2313
Lastpage
2323
Abstract
For graph-based dependency parsing, how to enrich high-order features without increasing decoding complexity is a very challenging problem. To solve this problem, this paper presents an approach to representing high-order features for graph-based dependency parsing models using a dependency language model and beam search. Firstly, we use a baseline parser to parse a large-amount of unannotated data. Then we build the dependency language model (DLM) on the auto-parsed data. A set of new features is represented based on the DLM. Finally, we integrate the DLM-based features into the parsing model during decoding by beam search. We also utilize the features in bilingual text (bitext) parsing models. The main advantages of our approach are: 1) we utilize rich high-order features defined over a view of large scope and additional large raw corpus; 2) our approach does not increase the decoding complexity. We evaluate the proposed approach on the monotext and bitext parsing tasks. In the monotext parsing task, we conduct the experiments on Chinese and English data. The experimental results show that our new parser achieves the best accuracy on the Chinese data and comparable accuracy with the best known systems on the English data. In the bitext parsing task, we conduct the experiments on a Chinese-English bilingual data and our score is the best reported so far.
Keywords
decoding; grammars; graph theory; natural language processing; text analysis; Chinese data; Chinese-English bilingual data; DLM-based features; English data; auto-parsed data; beam search; bilingual text parsing models; bitext parsing models; bitext parsing tasks; decoding complexity; dependency language model; dependency language models; graph-based dependency parsing models; high-order features; monotext parsing tasks; Accuracy; Complexity theory; Computational modeling; Decoding; Mathematical model; Training; Vectors; Bilingual text parsing; dependency Language Model; dependency Parsing; natural Language Processing;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2013.2273715
Filename
6562798
Link To Document