DocumentCode
417218
Title
A differential spectral voice activity detector
Author
Garner, Philip N. ; Fukada, Toshiaki ; Komori, Yasuhiro
Author_Institution
Canon Inc, Tokyo, Japan
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
The voice activity detection (VAD) problem is placed into a decision theoretic framework, and the Gaussian VAD model of Sohn et al. (1998, 1999) is then shown to fit well with the framework. It is argued that the Gaussian model can be made more robust to correlation and expected spectral shapes of speech and noise by using a differential spectral representation. Such a model is formulated theoretically. The differential spectral VAD is then shown by experiment to compare favourably with the basic Gaussian VAD in a speech recognition setting, especially for noisy environments.
Keywords
Gaussian distribution; decision theory; signal representation; spectral analysis; speech recognition; Gaussian VAD model; correlation robustness; differential spectral representation; spectral shapes; speech recognition; voice activity detector; Automatic speech recognition; Cost function; Detectors; Gaussian noise; Noise robustness; Noise shaping; Spectral shape; Speech enhancement; Speech recognition; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326056
Filename
1326056
Link To Document