Voice conversion: Wavelet based residual selection

Author

Pramod Kachare;Alice Cheeran;Jagganath Nirmal;Mukesh Zaveri

Author_Institution

Department of Electronics and Telecommunication Engineering, Ramrao Adik Institute of Technology, Nerul, Navi Mumbai 400706, India

fYear

2015

Firstpage

1513

Lastpage

1518

Abstract

Voice conversion has been studied over past few decades and yet no flawless system has been developed. Primary restriction in developing conversion systems is decayed output speech quality. Work presented here alleviates this problem by mapping higher order excitation features along with state of the art spectral parameters. Well known linear predictive analysis is used to extract shape of the vocal tract and corresponding residual signal. Higher feature dimensionality of the excitation signal is confronted using synchronous segmentation and windowing of the signal. Each of the resulting frames are wavelet analyzed to calculate normalized sub-band energy coefficients forming a codebook. Conversion is obtained by selecting target residual corresponding to minimized energy cost function. Primary advantage of this technique is reduced dimensionality with satisfactory conversion statistics. Proposed method is compared with baseline residual selection approach using various subjective and objective tests. Wavelet features provide better selection criteria with slight improvement in output speech individuality.

Keywords

"Speech","Feature extraction","Training","Wavelet analysis","Hidden Markov models","Computer architecture","Wavelet transforms"

Publisher

ieee

Conference_Titel

Advances in Computing, Communications and Informatics (ICACCI), 2015 International Conference on

Print_ISBN

978-1-4799-8790-0

Type

conf

DOI

10.1109/ICACCI.2015.7275827

Filename

7275827