Title :
The “UTDrive” in-vehicle voice activity detection system
Author :
Yu, Tao ; Hansen, John H L
Author_Institution :
CRSS: Center for Robust Speech Syst., Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
In this study, we specifically address the problem of in-vehicle voice activity detection (VAD), which has a significant importance for the speech controlled intelligent vehicle. A novel VAD system is proposed based on microphone array beam- forming and discriminative Gaussian mixture model. As a binary classification problem, the features and classifiers are explored under the in-vehicle acoustic environment. Using microphone array, we show that the spatial power ratio can serve as an effective feature for speech activity detection. Further, a discriminative training based Gaussian mixture model (GMM) classifier is employed to enhance the VAD performance in terms of receiver operating characteristics (ROC). Compared to the conventional VAD systems, the proposed VAD system presents a novel and robust performance against various in-vehicle noisy scenarios from the UTDrive project.
Keywords :
Gaussian processes; array signal processing; microphone arrays; speech recognition; speech-based user interfaces; traffic engineering computing; UTDrive project; binary classification problem; discriminative Gaussian mixture model; in-vehicle voice activity detection system; microphone array beam-forming; receiver operating characteristics; speech activity detection; speech controlled intelligent vehicle; Acoustic signal detection; Automatic speech recognition; Intelligent vehicles; Microphone arrays; Power system modeling; Speech enhancement; Speech processing; USA Councils; Vehicle detection; Vehicle safety; discriminative training; microphone array; voice activity detection;
Conference_Titel :
Intelligent Vehicles Symposium (IV), 2010 IEEE
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4244-7866-8
DOI :
10.1109/IVS.2010.5547963