Title :
Performance Analysis of Thewarped Discrete Cosine Transform Cepstrum with MFCC Using Different Classifiers
Author :
Sangwan, Abhijeet ; Muralishankar, R. ; O´Shaughnessy, Douglas
Author_Institution :
Dept. of Electr. & Comput. Eng., Concordia Univ., Montreal, Que.
Abstract :
In this paper, we continue our investigation of the warped discrete cosine transform cepstrum (WDCTC), which was earlier introduced as a new speech processing feature (Muralishankar et al., 2005). Here, we study the statistical properties of the WDCTC and compare them with the mel-frequency cepstral coefficients (MFCC). We report some interesting properties of the WDCTC when compared to the MFCC: its statistical distribution is more Gaussian-like with lower variance, it obtains better vowel cluster separability, it forms tighter vowel clusters and generates better codebooks. Further, we employ the WDCTC and MFCC features in a 5-vowel recognition task using vector quantization (VQ), 1-nearest neighbour (1-NN), probabilistic neural network (PNN) and Gaussian discriminant analysis (GDA) as classifiers. Finally, we discuss the vowel recognition results in the context of the statistical properties of the WDCTC and MFCC. In our experiments, the WDCTC consistently outperforms the MFCC
Keywords :
Gaussian distribution; cepstral analysis; discrete cosine transforms; neural nets; signal classification; speech processing; speech recognition; vector quantisation; Gaussian discriminant analysis; Gaussian-like distribution; mel-frequency cepstral coefficients; nearest neighbour classifier; probabilistic neural network; speech classification; speech processing; statistical distribution; vector quantization; vowel cluster separability; vowel recognition; warped discrete cosine transform cepstrum; Cepstral analysis; Cepstrum; Discrete cosine transforms; Gaussian distribution; Mel frequency cepstral coefficient; Performance analysis; Speech processing; Speech recognition; Statistical distributions; Vector quantization;
Conference_Titel :
Machine Learning for Signal Processing, 2005 IEEE Workshop on
Conference_Location :
Mystic, CT
Print_ISBN :
0-7803-9517-4
DOI :
10.1109/MLSP.2005.1532882