Title :
Voice conversion: Wavelet based residual selection
Author :
Pramod Kachare;Alice Cheeran;Jagganath Nirmal;Mukesh Zaveri
Author_Institution :
Department of Electronics and Telecommunication Engineering, Ramrao Adik Institute of Technology, Nerul, Navi Mumbai 400706, India
Abstract :
Voice conversion has been studied over past few decades and yet no flawless system has been developed. Primary restriction in developing conversion systems is decayed output speech quality. Work presented here alleviates this problem by mapping higher order excitation features along with state of the art spectral parameters. Well known linear predictive analysis is used to extract shape of the vocal tract and corresponding residual signal. Higher feature dimensionality of the excitation signal is confronted using synchronous segmentation and windowing of the signal. Each of the resulting frames are wavelet analyzed to calculate normalized sub-band energy coefficients forming a codebook. Conversion is obtained by selecting target residual corresponding to minimized energy cost function. Primary advantage of this technique is reduced dimensionality with satisfactory conversion statistics. Proposed method is compared with baseline residual selection approach using various subjective and objective tests. Wavelet features provide better selection criteria with slight improvement in output speech individuality.
Keywords :
"Speech","Feature extraction","Training","Wavelet analysis","Hidden Markov models","Computer architecture","Wavelet transforms"
Conference_Titel :
Advances in Computing, Communications and Informatics (ICACCI), 2015 International Conference on
Print_ISBN :
978-1-4799-8790-0
DOI :
10.1109/ICACCI.2015.7275827