DocumentCode :
3498410
Title :
Probabilistic Approach for Speaker Transformation
Author :
Gao Yin-qiu ; Yang Zhen
Author_Institution :
Inst. of Signal & Inf. Process., Nanjing Univ. of Posts & Telecommun., Nanjing
fYear :
2007
fDate :
21-25 Sept. 2007
Firstpage :
2845
Lastpage :
2848
Abstract :
A probabilistic approach of speaker transformation is proposed in this paper to make the speech of a source speaker sound like uttered by a target speaker. Speaker individuality transformation is achieved by altering characteristics of the speech spectrum and the supersegmental information such as fundamental pitch frequency. The main advantage of this scheme lies in the aspect of not only having considered the statistical property of both the source and target speech spectrum but also the relationship between them under a cross correlational model. And to make sure that the transformed speech signals are perceptually closer to the target speaker, prosody modification is also involved. The proposed scheme is evaluated using both subjective and objective measures. The experimental results show that the transformation system put forward is capable of effectively transforming speaker identity whilst the converted speech maintains high quality. And the whole performance is evaluated to be superior to the conventional vector quantization (VQ) based method.
Keywords :
probability; speech processing; conventional vector quantization; probabilistic approach; speaker transformation; speech spectrum; supersegmental information; Frequency; Information processing; Interpolation; Loudspeakers; Oral communication; Robustness; Signal processing; Speech enhancement; Speech synthesis; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Wireless Communications, Networking and Mobile Computing, 2007. WiCom 2007. International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-1311-9
Type :
conf
DOI :
10.1109/WICOM.2007.706
Filename :
4340481
Link To Document :
بازگشت