DocumentCode
466899
Title
Voice Conversion Adopting SOLAFS
Author
Zhao, Lei ; Gao, Yinqiu
Author_Institution
Qingdao Technol. Univ., Qingdao
Volume
1
fYear
2007
fDate
July 30 2007-Aug. 1 2007
Firstpage
543
Lastpage
548
Abstract
An improved method of voice conversion is proposed to make the speech of a source speaker sound like uttered by a target speaker. Speaker individuality transformation is achieved by altering the spectral envelope and prosodic information. The main advantage of this method is to firstly apply the synchronized overlap-add fixed synthesis (SOLAFS) to modify the source speaker´s speaking rate to match that of the target speaker, which enhances the performance of the whole conversion system compared with conventional systems without such a procedure. Besides, a precise estimation for the target excitation is advanced only with the information of the matched source´s excitation and the average pitch period of the target speaker. The proposed scheme is evaluated using both subjective and objective measures. The experimental results show that the system is capable of effectively transforming speaker identity whilst the converted speech maintains high quality.
Keywords
speech processing; prosodic information; speaker identity transformation; speaker individuality transformation; speaking rate matching; spectral envelope; synchronized overlap-add fixed synthesis; voice conversion; whole conversion system; Artificial intelligence; Distributed computing; Electronic mail; Frequency synchronization; Information processing; Loudspeakers; Signal processing; Software engineering; Speech enhancement; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2007. SNPD 2007. Eighth ACIS International Conference on
Conference_Location
Qingdao
Print_ISBN
978-0-7695-2909-7
Type
conf
DOI
10.1109/SNPD.2007.64
Filename
4287567
Link To Document