Title :
Arabic speech transformation using MFCC in GMM
Author :
Elmanfaloty, Rania ; Korany, N. ; Youssef, El-Sayed A.
Author_Institution :
Fac. of Eng., Electr. Eng. Dept., Alexandria Univ., Alexandria, Egypt
Abstract :
Voice conversion (VC) is a process which modifies the speech signal produced by one source speaker so that it sounds like another target speaker. In this paper the transformation is determined by using equal Arabic utterances from source and target speakers. A conversion function based on Gaussian mixture model (GMM) is used for transforming the spectral envelope described by Mel Frequency Cepstral Coefficients (MFCC). The quality of the transformed utterances is measured using subjective and objective evaluations.
Keywords :
Gaussian processes; speaker recognition; speech processing; Arabic speech transformation; GMM; Gaussian mixture model; MFCC; Mel frequency cepstral coefficients; VC; conversion function; source speaker; speech signal; target speakers; voice conversion; Discrete cosine transforms; Feature extraction; Filter banks; Mel frequency cepstral coefficient; Speech; Training; Vectors;
Conference_Titel :
Computer and Communication Engineering (ICCCE), 2012 International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4673-0478-8
DOI :
10.1109/ICCCE.2012.6271314