• DocumentCode
    177654
  • Title

    Analysis and modeling of next speaking start timing based on gaze behavior in multi-party meetings

  • Author

    Ishii, Ryo ; Otsuka, Kanji ; Kumano, Shiro ; Yamato, Junji

  • Author_Institution
    NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    694
  • Lastpage
    698
  • Abstract
    To realize a conversational interface where an agent system can smoothly communicate with multiple persons, it is imperative to know how the start timing of speaking is decided. In this research, we demonstrate a relationship between gaze transition patterns and the start timing of next speaking against the end of the last speaking in multi-party meetings. Then, we construct a prediction model for the start timing using gaze transition patterns near the end of an utterance. An analysis of data collected from natural multi-party meetings reveals a strong relationship between gaze transition patterns of the speaker, next speaker, and listener and the start timing of the next speaker. On the basis of the results, we used gaze transition patterns of the speaker, next speaker, and listener and mutual gaze as variables, and devised several prediction models. A model using all features performed the best and was able to predict the start timing well.
  • Keywords
    data analysis; speaker recognition; data analysis; gaze behavior; multiparty meetings; next speaking start timing; speaker gaze transition patterns; Acoustics; Analytical models; Predictive models; Speech; Speech processing; Timing; Speaking timing; gaze transition pattern; multi-party meetings; mutual gaze; prediction model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853685
  • Filename
    6853685