DocumentCode
2683501
Title
A New Approach To Accent Restoration Of Vietnamese Texts Using Dynamic Programming Combined With Co-Occurrence Graph
Author
Nghia, Hoang Trong ; Phuc, Do
Author_Institution
Dept. of Inf. Technol., Univ. of Natural Sci., Ho Chi Minh City, Vietnam
fYear
2009
fDate
13-17 July 2009
Firstpage
1
Lastpage
4
Abstract
In this paper, we would like to introduce a new approach to recover Vietnamese text´s accents. Given a Vietnamese text in which accents are lost, our goal is to seek for a recovered text that yields a best lexical probability. Using a dynamic programming approach, we first build a model of language for Vietnamese as a lexical database which gives lexical probabilities to Vietnamese sentences. Second, we construct a map of literal translations of Vietnamese words to restrict our searching space. Finally, we apply dynamic programming as a searching engine to seek out the most probable sentence. We also use the co-occurrence graph to increase the accuracy of selection, the experimental results show that the average accuracy of our approach is about 93%-94%.
Keywords
dynamic programming; graph theory; natural languages; probability; text analysis; Vietnamese texts; accent restoration; co-occurrence graph; dynamic programming; lexical database; lexical probability; recovered text; Context modeling; Databases; Dynamic programming; Information systems; Information technology; Natural languages; Search engines; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing and Communication Technologies, 2009. RIVF '09. International Conference on
Conference_Location
Da Nang
Print_ISBN
978-1-4244-4566-0
Electronic_ISBN
978-1-4244-4568-4
Type
conf
DOI
10.1109/RIVF.2009.5174609
Filename
5174609
Link To Document