Title :
Creation of acoustic signal dictionary for ESNOLA based concatenated Bangla and Nepali TTS system
Author :
Khan, Soma ; Roy, Rajib
Author_Institution :
Centre for Dev. of Adv. Comput., Kolkata, India
Abstract :
Present paper describes the detail design and development of two different acoustic signal dictionaries for incorporating separately into Epoch Synchronous Non OverLap Add (ESNOLA) method based Bangla (SCB) and Nepali concatenative ´ITS system. Present work uses a new set of signal units in sub-phonemic level, namely, Partnemes and allows a flexible approach to the length of transitions. Partnemes include identifiable portions unique for phonemes, their transitions and co-articulations. The creation process includes a series of normalization (Pitch, Amplitude and DC) with judicial selection and augmentation of speech segments such that smaller fundamental yet appropriate parts of the phonemes, interphoneme and inter-word transitions can be used as acoustic units. Encouraging results of the listening test confirm good perceptual quality and acceptability of the developed signal dictionaries. ESNOLA framework with optimal size partneme inventories altogether give a simple approach for generation of high quality synthesized speech with easy portability to hand-held devices.
Keywords :
natural language processing; speech synthesis; Bangla-Nepali TTS system; ESNOLA; acoustic signal dictionary creation; epoch synchronous nonoverlap add method; hand-held devices; interphoneme; interword transitions; partnemes; speech segments; subphonemic level; Buildings; Dictionaries; Materials; Speech; Speech processing; Synthesizers; ESNOLA; Partnemes; TTS; Transition;
Conference_Titel :
Speech Database and Assessments (Oriental COCOSDA), 2011 International Conference on
Conference_Location :
Hsinchu
Print_ISBN :
978-1-4577-0930-2
DOI :
10.1109/ICSDA.2011.6086000