DocumentCode :
3272364
Title :
Statistical eigenvoice: speaker features within S+N framework and a way towards language-independent voice conversion
Author :
Huang, Feng ; Yin, Junxun
Author_Institution :
Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China
fYear :
2005
fDate :
13-16 Dec. 2005
Firstpage :
33
Lastpage :
36
Abstract :
This paper presents a statistical method for speaker feature extraction and voice conversion within sinusoidal + noise (S+N) modeling framework. With fundamental researches on speaker characteristics embedded in the parameter sets of S+N model, we found the vector sets of statistical eigenvoice (SEV) and weighted statistical eigenvoice (wSEV), which are basis vectors of GMM representation, have significant properties: approximately speaker-dependent and language-independent. Piered by the feature vectors of SEV and wSEV, we address a new algorithm for context-free voice conversion. Subjective tests suggest that the SEV-based method achieves convincing results while maintaining high synthesis quality in comparison to the traditional LPC approaches.
Keywords :
eigenvalues and eigenfunctions; speech processing; statistical analysis; voice communication; S+N framework; language-independent voice conversion; sinusoidal + noise modeling framework; speaker feature extraction; statistical eigenvoice; weighted statistical eigenvoice; Linear predictive coding; Loudspeakers; Natural languages; Signal processing algorithms; Signal synthesis; Speech analysis; Speech processing; Speech synthesis; Statistical analysis; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Signal Processing and Communication Systems, 2005. ISPACS 2005. Proceedings of 2005 International Symposium on
Print_ISBN :
0-7803-9266-3
Type :
conf
DOI :
10.1109/ISPACS.2005.1595339
Filename :
1595339
Link To Document :
بازگشت