DocumentCode :
404824
Title :
Subspace and hypothesis based effective segmentation of co-articulated basic-units for concatenative speech synthesis
Author :
Muralishankar, R. ; Srikanth, R. ; Ramakrishnan, A.G.
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
Volume :
1
fYear :
2003
fDate :
15-17 Oct. 2003
Firstpage :
388
Abstract :
In this paper, we present two new methods for vowel-consonant segmentation of a co-articulated basic-units employed in our Thirukkural Tamil text-to-speech synthesis system (G. L. Jayavardhana Rama et al, IEEE workshop on Speech Synthesis, 2002). The basic-units considered in this are CV, VC, VCV, VCCV and VCCC, where C stands for a consonant and V for any vowel. In the first method, we use a subspace-based approach for vowel-consonant segmentation. It uses oriented principal component analysis (OPCA) where the test feature vectors are projected on to the V and C subspaces. The crossover of the norm-contours obtained by projecting the test basic-unit onto the V and C subspaces give the segmentation points which in turn helps in identifying the V and C durations of a test basic-unit. In the second method, we use probabilistic principal component analysis (PPCA) to get probability models for V and C. We then use the Neymen-Pearson (NP) test to segment the basic-unit into V and C. Finally, we show that the hypothesis testing turns out to be an energy detector for V-C segmentation which is similar to the first method.
Keywords :
principal component analysis; speech synthesis; Neymen-Pearson test; OPCA; PPCA; Thirukkural Tamil text-to-speech synthesis system; concatenative speech synthesis; hypothesis based co-articulated basic-unit segmentation; hypothesis testing; norm-contours crossover; oriented principal component analysis; probabilistic principal component analysis; subspace based co-articulated basic-unit segmentation; test feature vectors; vowel-consonant segmentation energy detector; Acoustic signal detection; Covariance matrix; Detectors; Matrix decomposition; Signal to noise ratio; Speech recognition; Speech synthesis; Testing; Vectors; Virtual colonoscopy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
TENCON 2003. Conference on Convergent Technologies for the Asia-Pacific Region
Print_ISBN :
0-7803-8162-9
Type :
conf
DOI :
10.1109/TENCON.2003.1273351
Filename :
1273351
Link To Document :
بازگشت