Title :
Voice conversion system using SVM for vocal tract modification and codebook based model for pitch contour modification
Author :
Laskar, R.H. ; Talukdar, F.A. ; Bhattacharjee, Rajib ; Das, Saugat
Author_Institution :
Dept. of Electron. & Telecommun. Eng., Nat. Inst. of Technol., Silchar
Abstract :
The basic idea of this paper is to design an alternative voice conversion technique using support vector machine (SVM) as a regression tool that, converts the voice of a source speaker to specific standard target speaker. A nonlinear mapping function between the parameters for the acoustic features of the two speakers has been captured in our work. The vocal tract characteristics have been represented by the line spectral frequencies (LSFs). The kernel induced feature space using radial basis function network type SVM with Gaussian basis function have been used in our work. The codebook based technique has been used to modify the intonation characteristic (pitch contour). Mapping of the pitch contour has been achieved at the word level by associating the codebooks derived from the pitch contours of the source and the target speakers. The speech signals for the desired target speaker have been synthesized using the transformed LSFs along with the modified pitch contour and evaluated using both the subjective and the listening tests. The results signify that the proposed model improves the voice conversion performance in terms of capturing the speakerpsilas identity. However, the performance can further be improved by suitably modifying various user defined parameters used in regression analysis and using more training LSF vectors in the training stage.
Keywords :
nonlinear functions; radial basis function networks; regression analysis; speech processing; support vector machines; Gaussian basis function; SVM; codebook based model; line spectral frequencies; nonlinear mapping function; pitch contour modification; radial basis function network; regression analysis; regression tool; support vector machine; vocal tract modification; voice conversion system; Frequency; Kernel; Loudspeakers; Nonlinear acoustics; Radial basis function networks; Signal synthesis; Speech analysis; Speech synthesis; Support vector machines; Testing; Codebook; Intonation pattern; Pitch Contour; Radial Basis Function Network; Regression Analysis; Support Vector Machine; Vector Quantization;
Conference_Titel :
TENCON 2008 - 2008 IEEE Region 10 Conference
Conference_Location :
Hyderabad
Print_ISBN :
978-1-4244-2408-5
Electronic_ISBN :
978-1-4244-2409-2
DOI :
10.1109/TENCON.2008.4766412