DocumentCode :
3249164
Title :
Gesture cues for conversational interaction in monocular video
Author :
Quek, Francis ; McNeill, David ; Ansari, Rashid ; Ma, Xin-Feng ; Bryll, Robert ; Duncan, Susan ; McCullough, Karl E.
Author_Institution :
Vision Interfaces & Syst. Lab., Wright State Univ., Dayton, OH, USA
fYear :
1999
fDate :
1999
Firstpage :
119
Lastpage :
126
Abstract :
We present our work on the determination of cues for discourse segmentation in free-form gesticulation accompanying speech in natural conversation. The basis for this integrating between gesticulation and speech discourse is the psycholinguistic concept of the co-equal generation of gesture and speech from the same semantic intent. We use the psycholinguistic device known as the `catchment´ as the locus around which this integration proceeds. We videotape gesture and speech elicitation experiments in which a subject describes her living space to an interlocutor. We extract the gestural motion of both hands using the Vector Coherence Mapping algorithm that combines spatial, momentum and skin color constraints in parallel using a fuzzy image processing approach. We extract the voiced units in the discourse as F0 units are correlate these with transcribed speech. Psycholinguistics researchers perceptually micro-analyze the same video tape to produce a transcript that is annotated with the video timestamp and perceived gesture-speech entities. These serve to direct our high level analysis of the gesture trace and F0 data. We report the results of our analysis that show that the feature of `handedness´ and the kind of symmetry in two-handed gestures provide effective cues for discourse segmentation. We also present observations on how the gesture traces provide cues to segment hand use, high level discourse repair and super-segmental cues for discourse grouping
Keywords :
computer vision; gesture recognition; speech recognition; conversational interaction; discourse grouping; discourse segmentation; fuzzy image processing; gesture cues; gesture-speech entities; monocular video; natural conversation; psycholinguistic concept; speech discourse; speech elicitation experiments; Color; Image converters; Image processing; Image segmentation; Laboratories; Machine vision; Psychology; Read only memory; Skin; Speech analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems, 1999. Proceedings. International Workshop on
Conference_Location :
Corfu
Print_ISBN :
0-7695-0378-0
Type :
conf
DOI :
10.1109/RATFG.1999.799234
Filename :
799234
Link To Document :
بازگشت