DocumentCode
2237803
Title
An analysis of speakers´ gaze behavior for automatic addressee identification in multiparty conversation and its application to video editing
Author
Takemae, Yoshinao ; Otsuka, Kazuhiro ; Mukawa, Naoki
Author_Institution
NTT Commun. Sci. Lab., NTT Corp., Kanagawa, Japan
fYear
2004
fDate
20-22 Sept. 2004
Firstpage
581
Lastpage
586
Abstract
This work tackles the issue of the speaker-addressee links in face-to-face multiparty conversation. Systems that archive meetings and those that support teleconferences are attracting considerable interest. Conventional systems use a fixed-viewpoint camera and simple camera selection based on the participants´ utterances etc. Unfortunately, they fail to adequately convey who is talking to whom. To solve this problem, we must automatically detect the addressee or addressees and develop video editing rules that can clearly convey who is talking to whom. In this paper, to detect the addressee, we statistically analyze the speakers´ gaze behavior for (a) one-addressee utterances and (b) multi-addressee utterances. Experiments verify that speakers´ gaze behavior is 89% accurate in classifying addressee type, using the discrimination function obtained by discriminant analysis. Finally, we present three new video editing rules based on utterance type, and indicate the possibility of more successfully conveying who is talking to whom.
Keywords
image sequences; speaker recognition; statistical analysis; teleconferencing; video signal processing; addressee classification; automatic addressee identification; discriminant analysis; discrimination function; face to face multiparty conversation; fixed viewpoint camera; multiaddressee utterances; multiparty conversation; one addressee utterances; speaker addressee detection; speaker gaze behavior; statistical analysis; teleconferences; video editing; Cameras; Collaborative work; Face detection; Information analysis; Processor scheduling; Teleconferencing;
fLanguage
English
Publisher
ieee
Conference_Titel
Robot and Human Interactive Communication, 2004. ROMAN 2004. 13th IEEE International Workshop on
Print_ISBN
0-7803-8570-5
Type
conf
DOI
10.1109/ROMAN.2004.1374825
Filename
1374825
Link To Document