Title :
Gesture, speech, and gaze cues for discourse segmentation
Author :
Quek, Francis ; McNeill, David ; Bryll, Robert ; Kirbas, Cemil ; Arslan, Hasan ; McCullough, Karl E. ; Furuyama, Nobuhiro ; Ansari, Rashid
Author_Institution :
Vision Interfaces & Syst. Lab., Wright State Univ., Dayton, OH, USA
Abstract :
Psycholinguistic evidence has established the complementary nature of the verbal and non-verbal aspects of human expression. We present our findings in the detection of these cites in interaction. We use the psycholinguistic device known as the `catchment´ as the locus of integration of gesture, speech and gaze components. We videotape conversation elicitation experiments in which subjects convey complex spatial plans to an interlocutor using a calibrated three-camera setup. We extract the gestural motion of both hands, gaze direction, and voiced units in the discourse and compare these with transcripts generated by expert microanalysis of the video. Our results show the complementary nature of these communicative modalities. Where there is ambiguity in the structure of one modality (such as in haptologies or owing to noise in the audio signal), other modalities provide evidence for correct segmentation
Keywords :
gesture recognition; image segmentation; catchment; discourse segmentation; expert microanalysis; gaze direction; gestural motion; segmentation; Computer vision; Humans; Laboratories; Machine vision; Motion analysis; Psychology; Speech analysis; Speech enhancement; Torso; Video signal processing;
Conference_Titel :
Computer Vision and Pattern Recognition, 2000. Proceedings. IEEE Conference on
Conference_Location :
Hilton Head Island, SC
Print_ISBN :
0-7695-0662-3
DOI :
10.1109/CVPR.2000.854800