• DocumentCode
    1849570
  • Title

    Automatic detection of problematic rules in Vietnamese Treebank

  • Author

    Hong-Quan Nguyen ; Phuong-Thai Nguyen ; Thanh-Quyen Dang ; Van-Hiep Nguyen

  • Author_Institution
    Quangninh Univ. of Ind., Quang Ninh, Vietnam
  • fYear
    2015
  • fDate
    25-28 Jan. 2015
  • Firstpage
    13
  • Lastpage
    18
  • Abstract
    Vietnamese Treebank is a syntactically annotated corpus newly published in 2009. In this paper, we applied automated methods to detect errors in Vietnammese Treebank based on the concept of equivalence classes proposed by Dickinson. On this basis, we propose an improved method of error detection by transforming syntax trees based on vertical markovization. Our experimental results on Vietnamese Treebank showed that the scope of error detection was extended more than 2 times and the precision was improved more than 18.07% in comparison with the base line methods.
  • Keywords
    Markov processes; equivalence classes; natural language processing; trees (mathematics); Vietnamese Treebank; automatic detection; equivalence classes; problematic rules; syntax trees; vertical Markovization; Abstracts; Accuracy; Buildings; Educational institutions; Natural language processing; Reliability; Syntactics; VTB; ad hoc; auto; detect; equipvalence; error; markovization; treebank;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing & Communication Technologies - Research, Innovation, and Vision for the Future (RIVF), 2015 IEEE RIVF International Conference on
  • Conference_Location
    Can Tho
  • Print_ISBN
    978-1-4799-8043-7
  • Type

    conf

  • DOI
    10.1109/RIVF.2015.7049867
  • Filename
    7049867