DocumentCode
417268
Title
A new voice activity detector using subband order-statistics filters for robust speech recognition
Author
Ramírez, J. ; Segura, J.C. ; Benirez, C. ; La Torre, A. De ; Rubio, A.
Author_Institution
Dept. de Electron. y Tecnologia de Computadores, Granada Univ., Spain
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
Currently, there are technology barriers inhibiting speech processing systems working under extreme noisy conditions. The emerging applications of speech technology, especially in the fields of wireless communications, digital hearing aids or speech recognition, are some examples of such systems often requiring a noise reduction technique in combination with a precise voice activity detector (VAD). This paper presents a new VAD for improving speech detection robustness in noisy environments and the performance of speech recognition systems. The algorithm uses long-term information about the speech signal to formulate the decision rule and estimates the subband SNR using specialized order statistics filters (OSF). The proposed algorithm is compared to the most commonly used VAD in the field, in terms of speech/nonspeech discrimination and also in terms of recognition performance when the VAD is used in an automatic speech recognition (ASR) system. Experimental results demonstrate a sustained advantage over different VAD methods including standard VAD such as G.729 and AMR which are used as a reference, the VAD of the Advanced Front-End (AFE) for distributed speech recognition (DSR), and recently reported algorithms.
Keywords
decision trees; parameter estimation; signal detection; speech recognition; ASR system; VAD; automatic speech recognition; decision rule; noisy environments; performance; robust speech recognition; speech detection robustness; speech processing; subband SNR estimation; subband order-statistics filters; voice activity detector; Acoustical engineering; Automatic speech recognition; Detectors; Filters; Hearing aids; Noise robustness; Speech enhancement; Speech processing; Speech recognition; Wireless communication;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326119
Filename
1326119
Link To Document