• DocumentCode
    2237803
  • Title

    An analysis of speakers´ gaze behavior for automatic addressee identification in multiparty conversation and its application to video editing

  • Author

    Takemae, Yoshinao ; Otsuka, Kazuhiro ; Mukawa, Naoki

  • Author_Institution
    NTT Commun. Sci. Lab., NTT Corp., Kanagawa, Japan
  • fYear
    2004
  • fDate
    20-22 Sept. 2004
  • Firstpage
    581
  • Lastpage
    586
  • Abstract
    This work tackles the issue of the speaker-addressee links in face-to-face multiparty conversation. Systems that archive meetings and those that support teleconferences are attracting considerable interest. Conventional systems use a fixed-viewpoint camera and simple camera selection based on the participants´ utterances etc. Unfortunately, they fail to adequately convey who is talking to whom. To solve this problem, we must automatically detect the addressee or addressees and develop video editing rules that can clearly convey who is talking to whom. In this paper, to detect the addressee, we statistically analyze the speakers´ gaze behavior for (a) one-addressee utterances and (b) multi-addressee utterances. Experiments verify that speakers´ gaze behavior is 89% accurate in classifying addressee type, using the discrimination function obtained by discriminant analysis. Finally, we present three new video editing rules based on utterance type, and indicate the possibility of more successfully conveying who is talking to whom.
  • Keywords
    image sequences; speaker recognition; statistical analysis; teleconferencing; video signal processing; addressee classification; automatic addressee identification; discriminant analysis; discrimination function; face to face multiparty conversation; fixed viewpoint camera; multiaddressee utterances; multiparty conversation; one addressee utterances; speaker addressee detection; speaker gaze behavior; statistical analysis; teleconferences; video editing; Cameras; Collaborative work; Face detection; Information analysis; Processor scheduling; Teleconferencing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robot and Human Interactive Communication, 2004. ROMAN 2004. 13th IEEE International Workshop on
  • Print_ISBN
    0-7803-8570-5
  • Type

    conf

  • DOI
    10.1109/ROMAN.2004.1374825
  • Filename
    1374825