مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech spectrum transformation by speaker interpolation

DocumentCode :

290034

Title :

Speech spectrum transformation by speaker interpolation

Author :

Iwahashi, Naoto ; Sagisaka, Yoshinori

Author_Institution :

Sony Corp. Res. Labs, Tokyo, Japan

Volume :

fYear :

1994

fDate :

19-22 Apr 1994

Abstract :

Proposes a speech spectrum transformation method by interpolating spectral patterns between pre-stored multiple speakers for speech synthesis. The interpolation is carried out using spectral parameters such as cepstrum and log area ratio to generate new spectrum patterns. The spectral patterns can be transformed smoothly as the interpolation ratio is gradually changed, and speech individuality can easily be controlled between interpolated speakers. Adaptation to a target speaker can be performed by this interpolation, which uses only a small amount of training data to generate a new speech spectrum sequence close to the target speaker´s. An adaptation experiment was carried out in the case of using only one word spoken by the target speaker for learning. It was shown that the distance between the target speaker´s spectrum and the spectrum generated by the proposed interpolation method is reduced by about 40% compared with distance between the target speaker´s spectrum and spectrum of the speaker closest to the target among pre-stored ones

Keywords :

cepstral analysis; interpolation; speech processing; speech synthesis; adaptation experiment; cepstrum; interpolated speakers; log area ratio; speaker interpolation; spectral patterns; speech spectrum transformation; speech synthesis; target speaker; training data; Cepstrum; Degradation; Frequency conversion; Interpolation; Roentgenium; Shape; Speech synthesis; Telecommunications; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on

Conference_Location :

Adelaide, SA

ISSN :

1520-6149

Print_ISBN :

0-7803-1775-0

Type :

conf

DOI :

10.1109/ICASSP.1994.389256

Filename :

389256

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=290034