• DocumentCode
    137682
  • Title

    “Look at this!” learning to guide visual saliency in human-robot interaction

  • Author

    Schauerte, Boris ; Stiefelhagen, Rainer

  • Author_Institution
    Inst. for Anthropomatics, Karlsruhe Inst. of Technol., Karlsruhe, Germany
  • fYear
    2014
  • fDate
    14-18 Sept. 2014
  • Firstpage
    995
  • Lastpage
    1002
  • Abstract
    We learn to direct visual saliency in multimodal (i.e., pointing gestures and spoken references) human-robot interaction to highlight and segment arbitrary referent objects. For this purpose, we train a conditional random field to integrate features that reflect low-level visual saliency, the likelihood of salient objects, the probability that a given pixel is pointed at, and - if available - spoken information about the target object´s visual appearance. As such, this work integrates several of our ideas and approaches, ranging from multi-scale spectral saliency detection, spatially debiased salient object detection, computational attention in human-robot interaction to learning robust color term models. We demonstrate that this machine learning driven integration outperforms the previously reported results on two datasets, one dataset without and one with spoken object references. In summary, for automatically detected pointing gestures and automatically extracted object references, our approach improves the rate at which the correct object is included in the initial focus of attention by 10.37% in the absence and 25.21% in the presence of spoken target object information.
  • Keywords
    human-robot interaction; learning (artificial intelligence); object detection; random processes; robot vision; conditional random field; human-robot interaction; low-level visual saliency; machine learning driven integration; multiscale spectral saliency detection; object visual appearance; pointing gesture; robust color term model; salient object detection; Image color analysis; Image segmentation; Joints; Object detection; Object recognition; Vectors; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems (IROS 2014), 2014 IEEE/RSJ International Conference on
  • Conference_Location
    Chicago, IL
  • Type

    conf

  • DOI
    10.1109/IROS.2014.6942680
  • Filename
    6942680