مرکز منطقه ای اطلاع رساني علوم و فناوري - Subspace and hypothesis based effective segmentation of co-articulated basic-units for concatenative speech synthesis

DocumentCode :

404824

Title :

Subspace and hypothesis based effective segmentation of co-articulated basic-units for concatenative speech synthesis

Author :

Muralishankar, R. ; Srikanth, R. ; Ramakrishnan, A.G.

Author_Institution :

Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India

Volume :

fYear :

2003

fDate :

15-17 Oct. 2003

Firstpage :

388

Abstract :

In this paper, we present two new methods for vowel-consonant segmentation of a co-articulated basic-units employed in our Thirukkural Tamil text-to-speech synthesis system (G. L. Jayavardhana Rama et al, IEEE workshop on Speech Synthesis, 2002). The basic-units considered in this are CV, VC, VCV, VCCV and VCCC, where C stands for a consonant and V for any vowel. In the first method, we use a subspace-based approach for vowel-consonant segmentation. It uses oriented principal component analysis (OPCA) where the test feature vectors are projected on to the V and C subspaces. The crossover of the norm-contours obtained by projecting the test basic-unit onto the V and C subspaces give the segmentation points which in turn helps in identifying the V and C durations of a test basic-unit. In the second method, we use probabilistic principal component analysis (PPCA) to get probability models for V and C. We then use the Neymen-Pearson (NP) test to segment the basic-unit into V and C. Finally, we show that the hypothesis testing turns out to be an energy detector for V-C segmentation which is similar to the first method.

Keywords :

principal component analysis; speech synthesis; Neymen-Pearson test; OPCA; PPCA; Thirukkural Tamil text-to-speech synthesis system; concatenative speech synthesis; hypothesis based co-articulated basic-unit segmentation; hypothesis testing; norm-contours crossover; oriented principal component analysis; probabilistic principal component analysis; subspace based co-articulated basic-unit segmentation; test feature vectors; vowel-consonant segmentation energy detector; Acoustic signal detection; Covariance matrix; Detectors; Matrix decomposition; Signal to noise ratio; Speech recognition; Speech synthesis; Testing; Vectors; Virtual colonoscopy;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

TENCON 2003. Conference on Convergent Technologies for the Asia-Pacific Region

Print_ISBN :

0-7803-8162-9

Type :

conf

DOI :

10.1109/TENCON.2003.1273351

Filename :

1273351

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=404824