Improving Graph-Based Dependency Parsing Models With Dependency Language Models

Author

Min Zhang ; Wenliang Chen ; Xiangyu Duan ; Rong Zhang

Author_Institution

Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China

Volume

21

Issue

11

fYear

2013

fDate

Nov. 2013

Firstpage

2313

Lastpage

2323

Abstract

For graph-based dependency parsing, how to enrich high-order features without increasing decoding complexity is a very challenging problem. To solve this problem, this paper presents an approach to representing high-order features for graph-based dependency parsing models using a dependency language model and beam search. Firstly, we use a baseline parser to parse a large-amount of unannotated data. Then we build the dependency language model (DLM) on the auto-parsed data. A set of new features is represented based on the DLM. Finally, we integrate the DLM-based features into the parsing model during decoding by beam search. We also utilize the features in bilingual text (bitext) parsing models. The main advantages of our approach are: 1) we utilize rich high-order features defined over a view of large scope and additional large raw corpus; 2) our approach does not increase the decoding complexity. We evaluate the proposed approach on the monotext and bitext parsing tasks. In the monotext parsing task, we conduct the experiments on Chinese and English data. The experimental results show that our new parser achieves the best accuracy on the Chinese data and comparable accuracy with the best known systems on the English data. In the bitext parsing task, we conduct the experiments on a Chinese-English bilingual data and our score is the best reported so far.

Keywords

decoding; grammars; graph theory; natural language processing; text analysis; Chinese data; Chinese-English bilingual data; DLM-based features; English data; auto-parsed data; beam search; bilingual text parsing models; bitext parsing models; bitext parsing tasks; decoding complexity; dependency language model; dependency language models; graph-based dependency parsing models; high-order features; monotext parsing tasks; Accuracy; Complexity theory; Computational modeling; Decoding; Mathematical model; Training; Vectors; Bilingual text parsing; dependency Language Model; dependency Parsing; natural Language Processing;

fLanguage

English

Journal_Title

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher

ieee

ISSN

1558-7916

Type

jour

DOI

10.1109/TASL.2013.2273715

Filename

6562798