• DocumentCode
    638448
  • Title

    Mongolian speech corpus extension for Text-to-Speech development

  • Author

    Altangerel, Chagnaa ; Esenbek, Kerey ; Purev, Jaimai

  • Author_Institution
    Sch. of Inf. Technol., Nat. Univ. of Mongolia, Ulaanbaatar, Mongolia
  • Volume
    2
  • fYear
    2013
  • fDate
    June 28 2013-July 1 2013
  • Firstpage
    336
  • Lastpage
    340
  • Abstract
    This paper presents an extension of Mongolian speech corpus that designed for data-driven speech synthesis. The aim of the speech corpus is to develop a high-quality Mongolian TTS for blinds to use with screen reader. The new speech corpus contains nearly 10 hours of Mongolian male speech that is designed to cover all Mongolian phones. It well provides Cyrillic text transcription and its phonetic transcription with stress marking. It also provides context information including phone context, stressing levels, syntactic position in word, phrase and utterance for modeling speech acoustics and characteristics for speech synthesis.
  • Keywords
    handicapped aids; natural language processing; speech synthesis; Cyrillic text transcription; Mongolian male speech; Mongolian phones; Mongolian speech corpus extension; blinds; data-driven speech synthesis; high-quality Mongolian TTS; phone context; phonetic transcription; screen reader; speech acoustics; stress marking; stressing levels; syntactic word position; text-to-speech development; Colon; Labeling; Lead; Pragmatics; Speech; Stress; Mongolian; Speech corpus; Text-to-Speech Synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Strategic Technology (IFOST), 2013 8th International Forum on
  • Conference_Location
    Ulaanbaatar
  • Print_ISBN
    978-1-4799-0931-5
  • Type

    conf

  • DOI
    10.1109/IFOST.2013.6616908
  • Filename
    6616908