DocumentCode
3659671
Title
Voice conversion: Wavelet based residual selection
Author
Pramod Kachare;Alice Cheeran;Jagganath Nirmal;Mukesh Zaveri
Author_Institution
Department of Electronics and Telecommunication Engineering, Ramrao Adik Institute of Technology, Nerul, Navi Mumbai 400706, India
fYear
2015
Firstpage
1513
Lastpage
1518
Abstract
Voice conversion has been studied over past few decades and yet no flawless system has been developed. Primary restriction in developing conversion systems is decayed output speech quality. Work presented here alleviates this problem by mapping higher order excitation features along with state of the art spectral parameters. Well known linear predictive analysis is used to extract shape of the vocal tract and corresponding residual signal. Higher feature dimensionality of the excitation signal is confronted using synchronous segmentation and windowing of the signal. Each of the resulting frames are wavelet analyzed to calculate normalized sub-band energy coefficients forming a codebook. Conversion is obtained by selecting target residual corresponding to minimized energy cost function. Primary advantage of this technique is reduced dimensionality with satisfactory conversion statistics. Proposed method is compared with baseline residual selection approach using various subjective and objective tests. Wavelet features provide better selection criteria with slight improvement in output speech individuality.
Keywords
"Speech","Feature extraction","Training","Wavelet analysis","Hidden Markov models","Computer architecture","Wavelet transforms"
Publisher
ieee
Conference_Titel
Advances in Computing, Communications and Informatics (ICACCI), 2015 International Conference on
Print_ISBN
978-1-4799-8790-0
Type
conf
DOI
10.1109/ICACCI.2015.7275827
Filename
7275827
Link To Document