DocumentCode
3619606
Title
AlpSynth - concatenation-based speech synthesis for the Slovenian language
Author
J.Z. Gros;A. Mihelic;N. Pavesic;M. Zganec;S. Gruden
Author_Institution
Alpineon RTD, Ljubljana
fYear
2005
fDate
6/27/1905 12:00:00 AM
Firstpage
213
Lastpage
216
Abstract
The paper focuses on the design and collection of a speech corpus of elemental speech units for AlpSynth, a corpus-driven Slovenian TTS system. We describe the design procedures for a new speech corpus: purpose definition, content selection, definition of recording conditions and requirements, corpus segmentation and annotation. First we describe and comment the results of a frequency analysis of Slovenian allophone strings performed on a large Slovenian input text that has been converted to allophones. Further we present a method we designed for selection of a compact and efficient set of Slovenian sentences out of a large text corpus so as to minimize the final representative speech corpus. The selected sentences cover all the desired most frequent Slovenian quadphones, triphones and subsequently diphones. We describe the recording sessions and recording conditions. We continue describing the corpus annotation process. Finally, we describe the archive structure of the spoken corpus and present the information on its structure, content and size
Keywords
"Speech synthesis","Natural languages","Speech recognition","Telephony","Frequency conversion","Design methodology","Speech processing","Costs","User interfaces","Resumes"
Publisher
ieee
Conference_Titel
ELMAR, 2005. 47th International Symposium
Print_ISBN
953-7044-01-4
Type
conf
DOI
10.1109/ELMAR.2005.193680
Filename
1505681
Link To Document