DocumentCode :
2407052
Title :
Mongolian speech corpus for text-to-speech development
Author :
Hansakunbuntheung, Chachawarn ; Thangthai, Ausdang ; Thatphithakkul, N. ; Chagnaa, Altangerel
Author_Institution :
Human Language Technol. Lab., Nat. Electron. & Comput. Technol. Center, Pathumthani, Thailand
fYear :
2011
fDate :
26-28 Oct. 2011
Firstpage :
130
Lastpage :
135
Abstract :
This paper presents a first attempt to develop Mongolian speech corpus that designed for data-driven speech synthesis in Mongolia. The aim of the speech corpus is to develop a high-quality Mongolian TTS for blinds to use with screen reader. The speech corpus contains nearly 6 hours of Mongolian phones. It well provides Cyrillic text transcription and its phonetic transcription with stress marking. It also provides context information including phone context, stressing levels, syntactic position in word, phrase and utterance for modeling speech acoustics and characteristics for speech synthesis.
Keywords :
handicapped aids; speech synthesis; Cyrillic text transcription; Mongolian TTS; Mongolian speech corpus; blinds; data-driven speech synthesis; phone context; phonetic transcription; screen reader; speech acoustics modelling; stress marking; stressing levels; text-to-speech development; Mongolian; Speech corpus; Text-to-Speech Synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech Database and Assessments (Oriental COCOSDA), 2011 International Conference on
Conference_Location :
Hsinchu
Print_ISBN :
978-1-4577-0930-2
Type :
conf
DOI :
10.1109/ICSDA.2011.6085994
Filename :
6085994
Link To Document :
بازگشت