• DocumentCode
    2683501
  • Title

    A New Approach To Accent Restoration Of Vietnamese Texts Using Dynamic Programming Combined With Co-Occurrence Graph

  • Author

    Nghia, Hoang Trong ; Phuc, Do

  • Author_Institution
    Dept. of Inf. Technol., Univ. of Natural Sci., Ho Chi Minh City, Vietnam
  • fYear
    2009
  • fDate
    13-17 July 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this paper, we would like to introduce a new approach to recover Vietnamese text´s accents. Given a Vietnamese text in which accents are lost, our goal is to seek for a recovered text that yields a best lexical probability. Using a dynamic programming approach, we first build a model of language for Vietnamese as a lexical database which gives lexical probabilities to Vietnamese sentences. Second, we construct a map of literal translations of Vietnamese words to restrict our searching space. Finally, we apply dynamic programming as a searching engine to seek out the most probable sentence. We also use the co-occurrence graph to increase the accuracy of selection, the experimental results show that the average accuracy of our approach is about 93%-94%.
  • Keywords
    dynamic programming; graph theory; natural languages; probability; text analysis; Vietnamese texts; accent restoration; co-occurrence graph; dynamic programming; lexical database; lexical probability; recovered text; Context modeling; Databases; Dynamic programming; Information systems; Information technology; Natural languages; Search engines; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing and Communication Technologies, 2009. RIVF '09. International Conference on
  • Conference_Location
    Da Nang
  • Print_ISBN
    978-1-4244-4566-0
  • Electronic_ISBN
    978-1-4244-4568-4
  • Type

    conf

  • DOI
    10.1109/RIVF.2009.5174609
  • Filename
    5174609