Title :
Mongolian speech corpus for text-to-speech development
Author :
Hansakunbuntheung, Chachawarn ; Thangthai, Ausdang ; Thatphithakkul, N. ; Chagnaa, Altangerel
Author_Institution :
Human Language Technol. Lab., Nat. Electron. & Comput. Technol. Center, Pathumthani, Thailand
Abstract :
This paper presents a first attempt to develop Mongolian speech corpus that designed for data-driven speech synthesis in Mongolia. The aim of the speech corpus is to develop a high-quality Mongolian TTS for blinds to use with screen reader. The speech corpus contains nearly 6 hours of Mongolian phones. It well provides Cyrillic text transcription and its phonetic transcription with stress marking. It also provides context information including phone context, stressing levels, syntactic position in word, phrase and utterance for modeling speech acoustics and characteristics for speech synthesis.
Keywords :
handicapped aids; speech synthesis; Cyrillic text transcription; Mongolian TTS; Mongolian speech corpus; blinds; data-driven speech synthesis; phone context; phonetic transcription; screen reader; speech acoustics modelling; stress marking; stressing levels; text-to-speech development; Mongolian; Speech corpus; Text-to-Speech Synthesis;
Conference_Titel :
Speech Database and Assessments (Oriental COCOSDA), 2011 International Conference on
Conference_Location :
Hsinchu
Print_ISBN :
978-1-4577-0930-2
DOI :
10.1109/ICSDA.2011.6085994