Title :
Analysis and modeling of next speaking start timing based on gaze behavior in multi-party meetings
Author :
Ishii, Ryo ; Otsuka, Kanji ; Kumano, Shiro ; Yamato, Junji
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
Abstract :
To realize a conversational interface where an agent system can smoothly communicate with multiple persons, it is imperative to know how the start timing of speaking is decided. In this research, we demonstrate a relationship between gaze transition patterns and the start timing of next speaking against the end of the last speaking in multi-party meetings. Then, we construct a prediction model for the start timing using gaze transition patterns near the end of an utterance. An analysis of data collected from natural multi-party meetings reveals a strong relationship between gaze transition patterns of the speaker, next speaker, and listener and the start timing of the next speaker. On the basis of the results, we used gaze transition patterns of the speaker, next speaker, and listener and mutual gaze as variables, and devised several prediction models. A model using all features performed the best and was able to predict the start timing well.
Keywords :
data analysis; speaker recognition; data analysis; gaze behavior; multiparty meetings; next speaking start timing; speaker gaze transition patterns; Acoustics; Analytical models; Predictive models; Speech; Speech processing; Timing; Speaking timing; gaze transition pattern; multi-party meetings; mutual gaze; prediction model;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6853685