• DocumentCode
    2798144
  • Title

    Generating transcriptions for romanized Thai persons´ names

  • Author

    Suchato, Atiwong ; Kittikool, Chuleekorn ; Punyabukkana, Proadpran

  • Author_Institution
    Dept. of Comput. Eng., Chulalongkorn Univ., Bangkok, Thailand
  • fYear
    2012
  • fDate
    16-18 May 2012
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    A transcription of each word can either be produced by rules, statistical models, or retrieved from dictionary. However, the lack of standards and the variation of how a Thai person romanizes his or her name pose transcription a challenging task. Although the dictionary-based approach seems to produce the most accurate result, a letter-to-sound conversion module is necessary for unknown names. We propose an approach to transcribe romanized Thai person names into Thai sounds which considers the popularity of usage. The romanized Thai names are parsed into sequences of grams, utilizing the Gram lexicon, built from a corpus of more than 130,000 names. The results show 90 and 93% mean opinion score of acceptability when the transcriptions are generated from all possible sequences with unweighted and weighted Thai grams respectively. When longest-match model is used, the acceptability levels are 70 and 75% for unweighted and weighted Thai grams.
  • Keywords
    dictionaries; information retrieval; natural language processing; speech synthesis; statistical analysis; Thai sounds; dictionary-based approach; gram lexicon; gram sequences; letter-to-sound conversion module; mean opinion score; name pose transcription; romanized Thai persons names; statistical models; text-to-speech system; transcription generation; unweighted Thai grams; weighted Thai grams; Accuracy; Audio systems; Computers; Educational institutions; Equations; Mathematical model; Training; Romanization; Thai names; Transcription;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON), 2012 9th International Conference on
  • Conference_Location
    Phetchaburi
  • Print_ISBN
    978-1-4673-2026-9
  • Type

    conf

  • DOI
    10.1109/ECTICon.2012.6254338
  • Filename
    6254338