• DocumentCode
    3153362
  • Title

    Memory-based data-driven approach for grapheme-to-phoneme conversion in Bengali text-to-speech synthesis system

  • Author

    Ghosh, Krishnendu ; Rao, K. Sreenivasa

  • Author_Institution
    Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
  • fYear
    2011
  • fDate
    16-18 Dec. 2011
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this paper, we propose a memory-based data-driven model for grapheme-to-phoneme (G2P) conversion for Bengali text-to-speech synthesis (TTS) system. Previous studies have stated the significance of the linguistic and phonetic features for rule-based Bengali G2P conversion techniques. But due to the lack of proper morphological analyzer, the scope of rule-based approaches is bounded. The proposed method overcomes the limitation of rule-based methods by exploiting the variety of contexts present in the text corpus built in the current study. The model has been trained with a memory-base showing the relation between graphs and phones based on contexts. The model has been tested with 300 random words and it achieved accuracy of 79.33% at word-level and 96.28% at graph-level. This performance has been compared with a related rule-based approach to prove the effectiveness of a data-driven method. Furthermore, the model doesn´t require any morphological knowledge of the words.
  • Keywords
    graphs; speech processing; speech synthesis; word processing; bengali text-to-speech synthesis system; grapheme-to-phoneme conversion; linguistic feature; memory-based data-driven approach; morphological analyzer; morphological knowledge; phonetic feature; rule-based Bengali G2P conversion technique; text corpus; Accuracy; Context; Dictionaries; Hidden Markov models; Manuals; Testing; Training; Alignment problem; Bengali; Data-driven method; Grapheme-to-phoneme conversion; Text-to-speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    India Conference (INDICON), 2011 Annual IEEE
  • Conference_Location
    Hyderabad
  • Print_ISBN
    978-1-4577-1110-7
  • Type

    conf

  • DOI
    10.1109/INDCON.2011.6139334
  • Filename
    6139334