مرکز منطقه ای اطلاع رساني علوم و فناوري - Simple methods for improving speaker-similarity of HMM-based speech synthesis

DocumentCode :

2798969

Title :

Simple methods for improving speaker-similarity of HMM-based speech synthesis

Author :

Yamagishi, Junichi ; King, Simon

Author_Institution :

Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK

fYear :

2010

fDate :

14-19 March 2010

Firstpage :

4610

Lastpage :

4613

Abstract :

In this paper we revisit some basic configuration choices of HMM-based speech synthesis, such as waveform sampling rate, auditory frequency warping scale and the logarithmic scaling of F₀, with the aim of improving speaker similarity which is an acknowledged weakness of current HMM-based speech synthesisers. All of the techniques investigated are simple but, as we demonstrate using perceptual tests, can make substantial differences to the quality of the synthetic speech. Contrary to common practice in automatic speech recognition, higher waveform sampling rates can offer enhanced feature extraction and improved speaker similarity for speech synthesis. In addition, a generalized logarithmic transform of F₀ results in larger intra-utterance variance of F₀ trajectories and hence more dynamic and natural-sounding prosody.

Keywords :

feature extraction; hidden Markov models; speech recognition; transforms; HMM; auditory frequency warping scale; feature extraction; generalized logarithmic transform; logarithmic scaling; speaker-similarity; speech recognition; speech synthesis; synthetic speech; waveform sampling rate; Automatic speech recognition; Feature extraction; Filters; Frequency synthesizers; Hidden Markov models; Loudspeakers; Natural languages; Sampling methods; Speech synthesis; Testing; HMM; HTS; TTS; speech synthesis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on

Conference_Location :

Dallas, TX

ISSN :

1520-6149

Print_ISBN :

978-1-4244-4295-9

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2010.5495562

Filename :

5495562

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2798969