• DocumentCode
    23015
  • Title

    Dependency Parse Reranking with Rich Subtree Features

  • Author

    Mo Shen ; Kawahara, Daisuke ; Kurohashi, Sadao

  • Author_Institution
    Grad. Sch. of Inf., Kyoto Univ., Kyoto, Japan
  • Volume
    22
  • Issue
    7
  • fYear
    2014
  • fDate
    Jul-14
  • Firstpage
    1208
  • Lastpage
    1218
  • Abstract
    In pursuing machine understanding of human language, highly accurate syntactic analysis is a crucial step. In this work, we focus on dependency grammar, which models syntax by encoding transparent predicate-argument structures. Recent advances in dependency parsing have shown that employing higher-order subtree structures in graph-based parsers can substantially improve the parsing accuracy. However, the inefficiency of this approach increases with the order of the subtrees. This work explores a new reranking approach for dependency parsing that can utilize complex subtree representations by applying efficient subtree selection methods. We demonstrate the effectiveness of the approach in experiments conducted on the Penn Treebank and the Chinese Treebank. Our system achieves the best performance among known supervised systems evaluated on these datasets, improving the baseline accuracy from 91.88% to 93.42% for English, and from 87.39% to 89.25% for Chinese.
  • Keywords
    computational linguistics; grammars; natural languages; tree data structures; trees (mathematics); Chinese treebank; Penn treebank; complex subtree representations; dependency grammar; dependency parse reranking approach; graph-based parsers; higher-order subtree structures; human language; rich subtree features; subtree selection methods; supervised systems; syntactic analysis; syntax models; transparent predicate-argument structure encoding; Accuracy; Context; Data mining; Educational institutions; Encoding; Feature extraction; Vectors; Dependency parsing; multilingual parsing; parse reranking;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2014.2327295
  • Filename
    6822566