DocumentCode :
3325670
Title :
Personalized voice command systems in multi modal user interface
Author :
Kurniawati, Evelyn ; Celetto, Luca ; Capovilla, Nicola ; George, Sapna
fYear :
2012
fDate :
12-14 Jan. 2012
Firstpage :
45
Lastpage :
47
Abstract :
The goal of this paper is to describe the voice command system as part of the multi modal user interface for residential application project demoed at CES 2012. The application is a 3D TV panel which can be controlled through face recognition, gesture, and speech. The speech interface is invoked using activation keyword, and terminated in similar fashion with de-activation keyword. Speaker recognition is performed on the activation keyword to allow personalization of the voice commands available to the particular user, who in this scenario is a member of the household. A separate setting is also devised to enable guest user to have basic interaction with the system. Template matching scheme using dynamic time warping is employed for its simplicity and robustness to noise. The template chosen is a cluster of Gaussian Mixture Model (GMM), each representing a sub-word unit. A state model for voice interaction is presented to allow efficient operation of this interface.
Keywords :
Gaussian processes; pattern matching; speaker recognition; speech-based user interfaces; three-dimensional television; 3D TV panel; Gaussian mixture model cluster; activation keyword; deactivation keyword; dynamic time warping; face recognition; gesture recognition; multimodal user interface; personalized voice command systems; residential application project; speaker recognition; speech interface; template matching scheme; voice interaction; Accuracy; Hidden Markov models; Speaker recognition; Speech; Speech recognition; Training; User interfaces; Multi modal user interface; dynamic time warping; edits distance; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Emerging Signal Processing Applications (ESPA), 2012 IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4673-0899-1
Type :
conf
DOI :
10.1109/ESPA.2012.6152442
Filename :
6152442
Link To Document :
بازگشت