مرکز منطقه ای اطلاع رساني علوم و فناوري - Improving voice quality of HMM-based speech synthesis using voice conversion method

DocumentCode :

180507

Title :

Improving voice quality of HMM-based speech synthesis using voice conversion method

Author :

Yishan Jiao ; Xiang Xie ; Xingyu Na ; Ming Tu

Author_Institution :

Sch. of Inf. & Electron., Beijing Inst. of Technol., Beijing, China

fYear :

2014

fDate :

4-9 May 2014

Firstpage :

7914

Lastpage :

7918

Abstract :

HMM-based speech synthesis system (HTS) often generates buzzy and muffled speech. Such degradation of voice quality makes synthetic speech sound robotically rather than naturally. From this point, we suppose that synthetic speech is in a different speaker space apart from the original. We propose to use voice conversion method to transform synthetic speech toward the original so as to improve its quality. Local linear transformation (LLT) combined with temporal decomposition (TD) is proposed as the conversion method. It can not only ensure smooth spectral conversion but also avoid over-smoothing problem. Moreover, we design a robust spectral selection and modification strategy to make the modified spectra stable. Preference test shows that the proposed method can improve the quality of HMM-based speech synthesis.

Keywords :

hidden Markov models; speech synthesis; HMM based speech synthesis; local linear transformation; temporal decomposition; voice conversion method; voice quality; Hidden Markov models; High-temperature superconductors; Speech; Speech synthesis; Training; Vectors; HMM-based speech synthesis; local linear transformation; temporal decomposition; voice conversion;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location :

Florence

Type :

conf

DOI :

10.1109/ICASSP.2014.6855141

Filename :

6855141

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=180507