• DocumentCode
    3619606
  • Title

    AlpSynth - concatenation-based speech synthesis for the Slovenian language

  • Author

    J.Z. Gros;A. Mihelic;N. Pavesic;M. Zganec;S. Gruden

  • Author_Institution
    Alpineon RTD, Ljubljana
  • fYear
    2005
  • fDate
    6/27/1905 12:00:00 AM
  • Firstpage
    213
  • Lastpage
    216
  • Abstract
    The paper focuses on the design and collection of a speech corpus of elemental speech units for AlpSynth, a corpus-driven Slovenian TTS system. We describe the design procedures for a new speech corpus: purpose definition, content selection, definition of recording conditions and requirements, corpus segmentation and annotation. First we describe and comment the results of a frequency analysis of Slovenian allophone strings performed on a large Slovenian input text that has been converted to allophones. Further we present a method we designed for selection of a compact and efficient set of Slovenian sentences out of a large text corpus so as to minimize the final representative speech corpus. The selected sentences cover all the desired most frequent Slovenian quadphones, triphones and subsequently diphones. We describe the recording sessions and recording conditions. We continue describing the corpus annotation process. Finally, we describe the archive structure of the spoken corpus and present the information on its structure, content and size
  • Keywords
    "Speech synthesis","Natural languages","Speech recognition","Telephony","Frequency conversion","Design methodology","Speech processing","Costs","User interfaces","Resumes"
  • Publisher
    ieee
  • Conference_Titel
    ELMAR, 2005. 47th International Symposium
  • Print_ISBN
    953-7044-01-4
  • Type

    conf

  • DOI
    10.1109/ELMAR.2005.193680
  • Filename
    1505681