DocumentCode
394280
Title
Optimizing speech/non-speech classifier design using AdaBoost
Author
Kwon, Oh-Wook ; Lee, Te-Won
Author_Institution
Inst. for Neural Comput., Univ. of California, La Jolla, CA, USA
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
We propose a new method to design speech/non-speech classifiers for voice activity detection and robust endpoint detection using the adaptive boosting (AdaBoost) algorithm. The method uses a combination of simple base classifiers through the AdaBoost algorithm and a set of optimized speech features combined with spectral subtraction. The key benefits of this method are the simple implementation and low computational complexity. The AdaBoost classifier combined with spectral subtraction significantly improved the receiver operating characteristic curves of the G.729 voice activity detector. For speech recognition purpose, the method reduced 20-50% of miss errors for the same false alarm rate by using additional band pass energy and spectral distortion based on mel frequency cepstral coefficients.
Keywords
optimisation; pattern classification; speech recognition; AdaBoost; G.729 voice activity detector; adaptive boosting algorithm; band pass energy; computational complexity; false alarm rate; mel frequency cepstral coefficients; miss errors; optimized speech features; receiver operating characteristic curves; robust endpoint detection; spectral distortion; spectral subtraction; speech recognition purpose; speech/nonspeech classifiers; voice activity detection; Algorithm design and analysis; Boosting; Computational complexity; Design methodology; Design optimization; Detectors; Mel frequency cepstral coefficient; Optimization methods; Robustness; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198811
Filename
1198811
Link To Document