• DocumentCode
    424252
  • Title

    Parsing Chinese with head-driven model

  • Author

    Cao, Hai-Long ; Zhao, Tie-jun ; Yang, Mu-yun ; Li, Sheng

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., China
  • Volume
    4
  • fYear
    2004
  • fDate
    26-29 Aug. 2004
  • Firstpage
    2618
  • Abstract
    Great progress has been made in parsing the Wall Street Journal portion of the Penn Treebank. Now parsing languages other than English is an intensive research area. Head-driven model is one of the best English parsing models. It has been successfully applied to Czech but failed to outperform a base-line model in parsing German. This paper attempts to parse Chinese with head-driven model. Promising experimental results demonstrate that head-driven model works well for Chinese. We propose a hybrid parsing strategy, which combines head-driven model with a Chinese base phrases parsing model. The combined model not only improves the performance but also makes the parser space and time efficient. We evaluate our method in PARSEVAL measures, and the combined model performances are at 79.88% precision, 81.97% recall.
  • Keywords
    grammars; natural language interfaces; natural languages; statistical analysis; Chinese base phrase parsing; head-driven model; hybrid parsing strategy; language parsing; Context modeling; Decoding; Dictionaries; Length measurement; Machine learning; Speech; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
  • Print_ISBN
    0-7803-8403-2
  • Type

    conf

  • DOI
    10.1109/ICMLC.2004.1382246
  • Filename
    1382246