• DocumentCode
    730370
  • Title

    Predicting next speaker based on head movement in multi-party meetings

  • Author

    Ishii, Ryo ; Kumano, Shiro ; Otsuka, Kazuhiro

  • Author_Institution
    NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    2319
  • Lastpage
    2323
  • Abstract
    We proposed a model for predicting the next speaker in multi-party meetings by focusing on the participants´ head movements measured by using a six degrees-of-freedom head tracker. Results of an analysis of head movements collected from multi-party meetings revealed differences in the amounts, amplitude, and frequency of movement of the head position and rotation of the speaker near the end of an utterance in turn-keeping and turn-taking. The results also revealed the differences in the amounts of movement, amplitude, and frequency of head position movement and rotation between the listeners in turn-keeping, turn-taking, and the next speaker in turn-taking. We then built a next speaker prediction model that features two processing steps to predict whether turn-taking or turn-keeping will occur and who the next speaker will be in turn-taking. The evaluation results for the model suggest that the speaker´s and listeners´ head movements contribute to predicting the next speaker.
  • Keywords
    speaker recognition; head position movement; multiparty meetings; next speaker prediction model; turn-keeping; turn-taking; Azimuth; Magnetic heads; Predictive models; Speech; Timing; Tracking; Head movement; meeting analysis; multi-party meetings; next-speaker prediction; turn-taking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178385
  • Filename
    7178385