Title :
Optimizing speech/non-speech classifier design using AdaBoost
Author :
Kwon, Oh-Wook ; Lee, Te-Won
Author_Institution :
Inst. for Neural Comput., Univ. of California, La Jolla, CA, USA
Abstract :
We propose a new method to design speech/non-speech classifiers for voice activity detection and robust endpoint detection using the adaptive boosting (AdaBoost) algorithm. The method uses a combination of simple base classifiers through the AdaBoost algorithm and a set of optimized speech features combined with spectral subtraction. The key benefits of this method are the simple implementation and low computational complexity. The AdaBoost classifier combined with spectral subtraction significantly improved the receiver operating characteristic curves of the G.729 voice activity detector. For speech recognition purpose, the method reduced 20-50% of miss errors for the same false alarm rate by using additional band pass energy and spectral distortion based on mel frequency cepstral coefficients.
Keywords :
optimisation; pattern classification; speech recognition; AdaBoost; G.729 voice activity detector; adaptive boosting algorithm; band pass energy; computational complexity; false alarm rate; mel frequency cepstral coefficients; miss errors; optimized speech features; receiver operating characteristic curves; robust endpoint detection; spectral distortion; spectral subtraction; speech recognition purpose; speech/nonspeech classifiers; voice activity detection; Algorithm design and analysis; Boosting; Computational complexity; Design methodology; Design optimization; Detectors; Mel frequency cepstral coefficient; Optimization methods; Robustness; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198811