DocumentCode :
3457027
Title :
Investigation of MFCC feature representation for classification of spoken letters using Multi-Layer Perceptrons (MLP)
Author :
Daud, M.S. ; Yassin, I.M. ; Zabidi, A. ; Johari, M.A. ; Salleh, M.K.M.
Author_Institution :
Fac. of Electr. Eng., Univ. Teknol. Mara, Shah Alam, Malaysia
fYear :
2011
fDate :
4-7 Dec. 2011
Firstpage :
16
Lastpage :
20
Abstract :
In this paper, the Mel-Frequency Cepstral Coefficient (MFCC) is demonstrated as an effective feature representation method for spoken letters recognition. The Multi-Layer Perceptron (MLP) was used as a classifier to discriminate between two spoken letters - `A´ and `S´. The dataset consists of 72 samples (35 and 37 samples of spoken letters `A´ and `S´, respectively). The samples were represented using the Mel Frequency Cepstral Coefficients (MFCC). Several experiments were conducted to determine the optimal network parameters to yield the best classification results. The results indicate that the optimal network structure was with 2 hidden units, which yielded classification accuracy of 100% (training) and 93% (testing).
Keywords :
cepstral analysis; multilayer perceptrons; pattern classification; speech recognition; MFCC feature representation method; MLP; mel-frequency cepstral coefficient; multilayer perceptrons; optimal network parameters; optimal network structure; spoken letter classification; spoken letter recognition; Cepstrum; Classification algorithms; Computers; Feature extraction; MATLAB; Mel frequency cepstral coefficient; Mel-Frequency Cepstrum Coefficients (MFCC); Multi-Layer Perceptron (MLP); Pattern Classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Applications and Industrial Electronics (ICCAIE), 2011 IEEE International Conference on
Conference_Location :
Penang
Print_ISBN :
978-1-4577-2058-1
Type :
conf
DOI :
10.1109/ICCAIE.2011.6162096
Filename :
6162096
Link To Document :
بازگشت