DocumentCode :
3659671
Title :
Voice conversion: Wavelet based residual selection
Author :
Pramod Kachare;Alice Cheeran;Jagganath Nirmal;Mukesh Zaveri
Author_Institution :
Department of Electronics and Telecommunication Engineering, Ramrao Adik Institute of Technology, Nerul, Navi Mumbai 400706, India
fYear :
2015
Firstpage :
1513
Lastpage :
1518
Abstract :
Voice conversion has been studied over past few decades and yet no flawless system has been developed. Primary restriction in developing conversion systems is decayed output speech quality. Work presented here alleviates this problem by mapping higher order excitation features along with state of the art spectral parameters. Well known linear predictive analysis is used to extract shape of the vocal tract and corresponding residual signal. Higher feature dimensionality of the excitation signal is confronted using synchronous segmentation and windowing of the signal. Each of the resulting frames are wavelet analyzed to calculate normalized sub-band energy coefficients forming a codebook. Conversion is obtained by selecting target residual corresponding to minimized energy cost function. Primary advantage of this technique is reduced dimensionality with satisfactory conversion statistics. Proposed method is compared with baseline residual selection approach using various subjective and objective tests. Wavelet features provide better selection criteria with slight improvement in output speech individuality.
Keywords :
"Speech","Feature extraction","Training","Wavelet analysis","Hidden Markov models","Computer architecture","Wavelet transforms"
Publisher :
ieee
Conference_Titel :
Advances in Computing, Communications and Informatics (ICACCI), 2015 International Conference on
Print_ISBN :
978-1-4799-8790-0
Type :
conf
DOI :
10.1109/ICACCI.2015.7275827
Filename :
7275827
Link To Document :
بازگشت