• DocumentCode
    47551
  • Title

    Improving Graph-Based Dependency Parsing Models With Dependency Language Models

  • Author

    Min Zhang ; Wenliang Chen ; Xiangyu Duan ; Rong Zhang

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
  • Volume
    21
  • Issue
    11
  • fYear
    2013
  • fDate
    Nov. 2013
  • Firstpage
    2313
  • Lastpage
    2323
  • Abstract
    For graph-based dependency parsing, how to enrich high-order features without increasing decoding complexity is a very challenging problem. To solve this problem, this paper presents an approach to representing high-order features for graph-based dependency parsing models using a dependency language model and beam search. Firstly, we use a baseline parser to parse a large-amount of unannotated data. Then we build the dependency language model (DLM) on the auto-parsed data. A set of new features is represented based on the DLM. Finally, we integrate the DLM-based features into the parsing model during decoding by beam search. We also utilize the features in bilingual text (bitext) parsing models. The main advantages of our approach are: 1) we utilize rich high-order features defined over a view of large scope and additional large raw corpus; 2) our approach does not increase the decoding complexity. We evaluate the proposed approach on the monotext and bitext parsing tasks. In the monotext parsing task, we conduct the experiments on Chinese and English data. The experimental results show that our new parser achieves the best accuracy on the Chinese data and comparable accuracy with the best known systems on the English data. In the bitext parsing task, we conduct the experiments on a Chinese-English bilingual data and our score is the best reported so far.
  • Keywords
    decoding; grammars; graph theory; natural language processing; text analysis; Chinese data; Chinese-English bilingual data; DLM-based features; English data; auto-parsed data; beam search; bilingual text parsing models; bitext parsing models; bitext parsing tasks; decoding complexity; dependency language model; dependency language models; graph-based dependency parsing models; high-order features; monotext parsing tasks; Accuracy; Complexity theory; Computational modeling; Decoding; Mathematical model; Training; Vectors; Bilingual text parsing; dependency Language Model; dependency Parsing; natural Language Processing;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2273715
  • Filename
    6562798