Title :
Voice Conversion Adopting SOLAFS
Author :
Zhao, Lei ; Gao, Yinqiu
Author_Institution :
Qingdao Technol. Univ., Qingdao
fDate :
July 30 2007-Aug. 1 2007
Abstract :
An improved method of voice conversion is proposed to make the speech of a source speaker sound like uttered by a target speaker. Speaker individuality transformation is achieved by altering the spectral envelope and prosodic information. The main advantage of this method is to firstly apply the synchronized overlap-add fixed synthesis (SOLAFS) to modify the source speaker´s speaking rate to match that of the target speaker, which enhances the performance of the whole conversion system compared with conventional systems without such a procedure. Besides, a precise estimation for the target excitation is advanced only with the information of the matched source´s excitation and the average pitch period of the target speaker. The proposed scheme is evaluated using both subjective and objective measures. The experimental results show that the system is capable of effectively transforming speaker identity whilst the converted speech maintains high quality.
Keywords :
speech processing; prosodic information; speaker identity transformation; speaker individuality transformation; speaking rate matching; spectral envelope; synchronized overlap-add fixed synthesis; voice conversion; whole conversion system; Artificial intelligence; Distributed computing; Electronic mail; Frequency synchronization; Information processing; Loudspeakers; Signal processing; Software engineering; Speech enhancement; Speech synthesis;
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2007. SNPD 2007. Eighth ACIS International Conference on
Conference_Location :
Qingdao
Print_ISBN :
978-0-7695-2909-7
DOI :
10.1109/SNPD.2007.64