DocumentCode :
3527365
Title :
The “UTDrive” in-vehicle voice activity detection system
Author :
Yu, Tao ; Hansen, John H L
Author_Institution :
CRSS: Center for Robust Speech Syst., Univ. of Texas at Dallas, Richardson, TX, USA
fYear :
2010
fDate :
21-24 June 2010
Firstpage :
162
Lastpage :
167
Abstract :
In this study, we specifically address the problem of in-vehicle voice activity detection (VAD), which has a significant importance for the speech controlled intelligent vehicle. A novel VAD system is proposed based on microphone array beam- forming and discriminative Gaussian mixture model. As a binary classification problem, the features and classifiers are explored under the in-vehicle acoustic environment. Using microphone array, we show that the spatial power ratio can serve as an effective feature for speech activity detection. Further, a discriminative training based Gaussian mixture model (GMM) classifier is employed to enhance the VAD performance in terms of receiver operating characteristics (ROC). Compared to the conventional VAD systems, the proposed VAD system presents a novel and robust performance against various in-vehicle noisy scenarios from the UTDrive project.
Keywords :
Gaussian processes; array signal processing; microphone arrays; speech recognition; speech-based user interfaces; traffic engineering computing; UTDrive project; binary classification problem; discriminative Gaussian mixture model; in-vehicle voice activity detection system; microphone array beam-forming; receiver operating characteristics; speech activity detection; speech controlled intelligent vehicle; Acoustic signal detection; Automatic speech recognition; Intelligent vehicles; Microphone arrays; Power system modeling; Speech enhancement; Speech processing; USA Councils; Vehicle detection; Vehicle safety; discriminative training; microphone array; voice activity detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Vehicles Symposium (IV), 2010 IEEE
Conference_Location :
San Diego, CA
ISSN :
1931-0587
Print_ISBN :
978-1-4244-7866-8
Type :
conf
DOI :
10.1109/IVS.2010.5547963
Filename :
5547963
Link To Document :
بازگشت