• DocumentCode
    3101695
  • Title

    Advances in Acoustic Modeling for Vietnamese LVCSR

  • Author

    Nguyen, Tuan ; Vu, Quan

  • Author_Institution
    Univ. of Sci., Ho Chi Minh City, Vietnam
  • fYear
    2009
  • fDate
    7-9 Dec. 2009
  • Firstpage
    280
  • Lastpage
    284
  • Abstract
    In this paper, we present our experiments on the selection of basic phonetic units for the Vietnamese large vocabulary continuous speech recognition (LVCSR). Two acoustic models were compared. The first model has just used vowels or monophthongs as phonemes while the second one, which was proposed in this paper, has explored the use of diphthongs and triphthongs as phonemes as well. The two models were trained and evaluated on a broadcast news corpus containing 27 hours of acoustic training data and 1 hour of acoustic testing data. Moreover, an 146 M-word corpus collection of newspaper was employed for building the language models. Experimental results indicate significant improvements in both word accuracy rate and time-execution. With the second acoustic model, the word accuracy rates reach 86.06% on the best case and the execution time is faster than the real-time.
  • Keywords
    natural language processing; speech recognition; vocabulary; Vietnamese LVCSR; Vietnamese large vocabulary continuous speech recognition; acoustic modeling; acoustic testing data; broadcast news corpus; language models; Acoustic testing; Asia; Broadcasting; Character recognition; Handwriting recognition; Image recognition; Natural languages; Speech recognition; Vocabulary; Writing; Vietnamese; acoustic models; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing, 2009. IALP '09. International Conference on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-0-7695-3904-1
  • Type

    conf

  • DOI
    10.1109/IALP.2009.66
  • Filename
    5380749