Voice Conversion Adopting SOLAFS

Author

Zhao, Lei ; Gao, Yinqiu

Author_Institution

Qingdao Technol. Univ., Qingdao

Volume

1

fYear

2007

fDate

July 30 2007-Aug. 1 2007

Firstpage

543

Lastpage

548

Abstract

An improved method of voice conversion is proposed to make the speech of a source speaker sound like uttered by a target speaker. Speaker individuality transformation is achieved by altering the spectral envelope and prosodic information. The main advantage of this method is to firstly apply the synchronized overlap-add fixed synthesis (SOLAFS) to modify the source speaker´s speaking rate to match that of the target speaker, which enhances the performance of the whole conversion system compared with conventional systems without such a procedure. Besides, a precise estimation for the target excitation is advanced only with the information of the matched source´s excitation and the average pitch period of the target speaker. The proposed scheme is evaluated using both subjective and objective measures. The experimental results show that the system is capable of effectively transforming speaker identity whilst the converted speech maintains high quality.

Keywords

speech processing; prosodic information; speaker identity transformation; speaker individuality transformation; speaking rate matching; spectral envelope; synchronized overlap-add fixed synthesis; voice conversion; whole conversion system; Artificial intelligence; Distributed computing; Electronic mail; Frequency synchronization; Information processing; Loudspeakers; Signal processing; Software engineering; Speech enhancement; Speech synthesis;

fLanguage

English

Publisher

ieee

Conference_Titel

Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2007. SNPD 2007. Eighth ACIS International Conference on

Conference_Location

Qingdao

Print_ISBN

978-0-7695-2909-7

Type

conf

DOI

10.1109/SNPD.2007.64

Filename

4287567