Title :
Voice activity detection using AdaBoost with multi-frame information
Author :
Usukura, Tohru ; Mitsuhashi, Wataru
Author_Institution :
Dept. of Inf.&Commun. Eng., Univ. of Electro-Commun., Chofu
Abstract :
A noise robust scheme for voice activity detection (VAD) that employs a combination of both intra- and inter-frame acoustic features is presented in this paper. As intra-frame features full-band energy and mel-frequency cepstrum coefficient (MFCC) are calculated whereas integrated bispectrum is estimated as inter-frame features. The parameters combined by intra- and inter-frame features are sorted out by using adaptive boosting (AdaBoost) algorithm, thereby resulting in a better performance in contrast to a scheme with only a single feature extracted from every frame. On the basis of VAD evaluation framework, CENSREC-1-C (Corpora and Environments for Noisy Speech RECognition-1 Concatenated), the accuracy of the proposed VAD scheme is examined. The results of numerical experiments suggest that the performance of the proposed VAD scheme significantly outperforms conventional methods in real noisy environments.
Keywords :
cepstral analysis; feature extraction; speech recognition; AdaBoost; CENSREC-1-C; adaptive boosting; inter-frame acoustic features; intra-frame acoustic features; mel-frequency cepstrum coefficient; multi-frame information; noisy speech recognition; voice activity detection; Acoustic noise; Acoustic signal detection; Boosting; Cepstrum; Feature extraction; Mel frequency cepstral coefficient; Noise robustness; Speech analysis; Speech recognition; Working environment noise;
Conference_Titel :
Signal Processing and Communication Systems, 2008. ICSPCS 2008. 2nd International Conference on
Conference_Location :
Gold Coast
Print_ISBN :
978-1-4244-4243-0
Electronic_ISBN :
978-1-4244-4243-0
DOI :
10.1109/ICSPCS.2008.4813692