DocumentCode :
387935
Title :
Contributions of pitch, formant frequency and bandwidth to the perception of voice-personality
Author :
Takagi, Tohru ; Kuwabara, Hisao
Author_Institution :
NHK Science and Technical Research Laboratories, Tokyo, Japan
Volume :
11
fYear :
1986
fDate :
31503
Firstpage :
889
Lastpage :
892
Abstract :
To investigate the contributions of the resonance characteristics of vocal tract and the pitch frequency to the voice-personality, a pitch synchronous analysis/synthesis system has been developed which is capable of independent manipulation of formant frequencies, bandwidths, and pitch frequencies. This paper gives the results of perceptual experiments on voice personality for spectrum and pitch modified speech using this system. Experimental results show that the perception of voice-personality is significantly sensitive to the formant frequency shift and it is almost completely lost for the uniform shift of all formant frequencies larger than 5 percent. The manipulation of pitch frequency and formant bandwidths, on the other hand, is less sensitive to the perception of voice-personality. This study is important as a basic research for the speaker normalization on one hand, and for adding personal information to the non-personal synthetic speech on the other.
Keywords :
Bandwidth; Equations; Frequency estimation; Frequency synthesizers; Laboratories; Linear predictive coding; Resonance; Signal synthesis; Speech analysis; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
Type :
conf
DOI :
10.1109/ICASSP.1986.1168977
Filename :
1168977
Link To Document :
بازگشت