DocumentCode
1849570
Title
Automatic detection of problematic rules in Vietnamese Treebank
Author
Hong-Quan Nguyen ; Phuong-Thai Nguyen ; Thanh-Quyen Dang ; Van-Hiep Nguyen
Author_Institution
Quangninh Univ. of Ind., Quang Ninh, Vietnam
fYear
2015
fDate
25-28 Jan. 2015
Firstpage
13
Lastpage
18
Abstract
Vietnamese Treebank is a syntactically annotated corpus newly published in 2009. In this paper, we applied automated methods to detect errors in Vietnammese Treebank based on the concept of equivalence classes proposed by Dickinson. On this basis, we propose an improved method of error detection by transforming syntax trees based on vertical markovization. Our experimental results on Vietnamese Treebank showed that the scope of error detection was extended more than 2 times and the precision was improved more than 18.07% in comparison with the base line methods.
Keywords
Markov processes; equivalence classes; natural language processing; trees (mathematics); Vietnamese Treebank; automatic detection; equivalence classes; problematic rules; syntax trees; vertical Markovization; Abstracts; Accuracy; Buildings; Educational institutions; Natural language processing; Reliability; Syntactics; VTB; ad hoc; auto; detect; equipvalence; error; markovization; treebank;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing & Communication Technologies - Research, Innovation, and Vision for the Future (RIVF), 2015 IEEE RIVF International Conference on
Conference_Location
Can Tho
Print_ISBN
978-1-4799-8043-7
Type
conf
DOI
10.1109/RIVF.2015.7049867
Filename
7049867
Link To Document