DocumentCode :
395185
Title :
Mining speech: automatic selection of heterogeneous features using boosting
Author :
Klautau, Aldebaro
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., San Diego, La Jolla, CA, USA
Volume :
2
fYear :
2003
fDate :
6-10 April 2003
Abstract :
We investigate feature selection applied to automatic speech recognition (ASR) systems. We focus on systems based on support vector machines (SVM), which can naturally use features optimized for each classifier. We present a new method for feature selection based on the AdaBoost algorithm. This method was an order of magnitude faster than a similar one, while leading to equivalent accuracy. Experiments with phone classification using TIMIT and a total of 760 features (PLP, MFCC, Seneff´s, formants, etc.) indicated that the proposed method automatically discovered important information in the data. When using only 25 selected features per SVM, the accuracy was higher than when using a homogeneous set of 118 features based on PLP (perceptual linear prediction) coefficients.
Keywords :
feature extraction; learning automata; optimisation; pattern classification; speech recognition; AdaBoost algorithm; automatic speech recognition; feature selection; perceptual linear prediction; phone classification; speech mining; support vector machines; Automatic speech recognition; Boosting; Cepstrum; Hidden Markov models; Mel frequency cepstral coefficient; Oral communication; Psychoacoustic models; Psychology; Support vector machine classification; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1202282
Filename :
1202282
Link To Document :
بازگشت