DocumentCode
638448
Title
Mongolian speech corpus extension for Text-to-Speech development
Author
Altangerel, Chagnaa ; Esenbek, Kerey ; Purev, Jaimai
Author_Institution
Sch. of Inf. Technol., Nat. Univ. of Mongolia, Ulaanbaatar, Mongolia
Volume
2
fYear
2013
fDate
June 28 2013-July 1 2013
Firstpage
336
Lastpage
340
Abstract
This paper presents an extension of Mongolian speech corpus that designed for data-driven speech synthesis. The aim of the speech corpus is to develop a high-quality Mongolian TTS for blinds to use with screen reader. The new speech corpus contains nearly 10 hours of Mongolian male speech that is designed to cover all Mongolian phones. It well provides Cyrillic text transcription and its phonetic transcription with stress marking. It also provides context information including phone context, stressing levels, syntactic position in word, phrase and utterance for modeling speech acoustics and characteristics for speech synthesis.
Keywords
handicapped aids; natural language processing; speech synthesis; Cyrillic text transcription; Mongolian male speech; Mongolian phones; Mongolian speech corpus extension; blinds; data-driven speech synthesis; high-quality Mongolian TTS; phone context; phonetic transcription; screen reader; speech acoustics; stress marking; stressing levels; syntactic word position; text-to-speech development; Colon; Labeling; Lead; Pragmatics; Speech; Stress; Mongolian; Speech corpus; Text-to-Speech Synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Strategic Technology (IFOST), 2013 8th International Forum on
Conference_Location
Ulaanbaatar
Print_ISBN
978-1-4799-0931-5
Type
conf
DOI
10.1109/IFOST.2013.6616908
Filename
6616908
Link To Document