Title :
Haptic Voice Recognition: Augmenting speech modality with touch events for efficient speech recognition
Author_Institution :
Nat. Univ. of Singapore, Singapore, Singapore
Abstract :
This paper proposes the Haptic Voice Recognition (HVR), a multi-modal interface that combines speech and touch sensory inputs to perform voice recognition. These touch inputs form a series of haptic events that provide cues or `landmarks´ for word boundaries. These word boundary cues greatly reduce the search space for speech recognition, thereby making the decoding process more efficient and suitable for portable devices with limited compute and memory resources. Furthermore, having the knowledge of word boundaries also suppresses insertion and deletion errors. This is particularly helpful when recognition is performed in noisy environment. In this paper, a series of experiments were conducted to study the feasibility of augmenting touch events to automatic speech recognition and explore its potential benefits. Experiments were conducted with syntactically simulated haptic events on the Wall Street Journal database as well as realistic haptic events acquired using a prototype HVR interface implemented on a touchscreen device.
Keywords :
decoding; haptic interfaces; speech recognition; speech-based user interfaces; tactile sensors; touch sensitive screens; HVR interface; Wall Street Journal database; augmenting speech modality; automatic speech recognition; decoding; haptic voice recognition; multimodal interface; noisy environment; portable devices; search space; touch events; touch sensory inputs; touchscreen device; haptic events; multimodal interface; voice recognition;
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2010 IEEE
Conference_Location :
Berkeley, CA
Print_ISBN :
978-1-4244-7904-7
Electronic_ISBN :
978-1-4244-7902-3
DOI :
10.1109/SLT.2010.5700825