DocumentCode :
2663616
Title :
Robust Segmentation of Speech Signal Using MFCC and Acoustic Parameters
Author :
Yessenbayev, Zhandos
Author_Institution :
Dept. of Inf. Technol., L.N. Gumilev Eurasian Nat. Univ., Astana, Kazakhstan
fYear :
2012
fDate :
29-31 May 2012
Firstpage :
103
Lastpage :
108
Abstract :
In the current work, we investigate the effect of combining the mel-frequency cepstral coefficients (MFCC) with the acoustic parameters (AP) in the task of segmentation of continuous speech into sonorant and obstruent regions using Hidden Markov Models (HMM) with Gaussian Mixture Models (GMM). Along with the influence of APs to the performance of the model built, we analyze the set of acoustic features extracted for each phoneme to see how robust they are in the noise. All the experiments were conducted on TIMIT database. The results of the experiments show that there are APs, which have nice separating property and, therefore, improve the performance of a system if used with MFCCs, however, they are not robust to noise. On the other hand, there are APs, which do not have this property, but possess the intrinsic stability in noisy conditions and, as a result, add some robustness to a system.
Keywords :
Gaussian processes; acoustic signal processing; hidden Markov models; speech synthesis; GMM; Gaussian mixture models; HMM; MFCC; TIMIT database; acoustic features extracted; acoustic parameters; continuous speech segmentation; hidden Markov models; intrinsic stability; mel-frequency cepstral coefficients; robust segmentation; sonorant; speech signal; Erbium; Hidden Markov models; Mel frequency cepstral coefficient; Noise; Noise measurement; Speech; GMM; HMM; MFCC; acoustic parameters; robust segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Modelling Symposium (AMS), 2012 Sixth Asia
Conference_Location :
Bali
Print_ISBN :
978-1-4673-1957-7
Type :
conf
DOI :
10.1109/AMS.2012.26
Filename :
6243930
Link To Document :
بازگشت