مرکز منطقه ای اطلاع رساني علوم و فناوري - VTLN adaptation for statistical speech synthesis

DocumentCode :

2791593

Title :

VTLN adaptation for statistical speech synthesis

Author :

Saheer, Lakshmi ; Garner, Philip N. ; Dines, John ; Liang, Hui

Author_Institution :

Idiap Res. Inst., Martigny, Switzerland

fYear :

2010

fDate :

14-19 March 2010

Firstpage :

4838

Lastpage :

4841

Abstract :

The advent of statistical speech synthesis has enabled the unification of the basic techniques used in speech synthesis and recognition. Adaptation techniques that have been successfully used in recognition systems can now be applied to synthesis systems to improve the quality of the synthesized speech. The application of vocal tract length normalization (VTLN) for synthesis is explored in this paper. VTLN based adaptation requires estimation of a single warping factor, which can be accurately estimated from very little adaptation data and gives additive improvements over CMLLR adaptation. The challenge of estimating accurate warping factors using higher order features is solved by initializing warping factor estimation with the values calculated from lower order features.

Keywords :

speech recognition; speech synthesis; statistical analysis; CMLLR adaptation; VTLN adaptation; speech recognition; statistical speech synthesis; vocal tract length normalization; Adaptation model; Automatic speech recognition; Cepstral analysis; Feature extraction; Frequency; Hidden Markov models; Maximum likelihood linear regression; Speech recognition; Speech synthesis; Vectors; Adaptation; Statistical Speech Synthesis; Vocal Tract Length Normalization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on

Conference_Location :

Dallas, TX

ISSN :

1520-6149

Print_ISBN :

978-1-4244-4295-9

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2010.5495126

Filename :

5495126

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2791593