• DocumentCode
    985299
  • Title

    Mental imagery for a conversational robot

  • Author

    Roy, Deb ; Hsiao, Kai-yuh ; Mavridis, Nikolaos

  • Author_Institution
    Media Lab., MIT, Cambridge, MA, USA
  • Volume
    34
  • Issue
    3
  • fYear
    2004
  • fDate
    6/1/2004 12:00:00 AM
  • Firstpage
    1374
  • Lastpage
    1383
  • Abstract
    To build robots that engage in fluid face-to-face spoken conversations with people, robots must have ways to connect what they say to what they see. A critical aspect of how language connects to vision is that language encodes points of view. The meaning of my left and your left differs due to an implied shift of visual perspective. The connection of language to vision also relies on object permanence. We can talk about things that are not in view. For a robot to participate in situated spoken dialog, it must have the capacity to imagine shifts of perspective, and it must maintain object permanence. We present a set of representations and procedures that enable a robotic manipulator to maintain a "mental model" of its physical environment by coupling active vision to physical simulation. Within this model, "imagined" views can be generated from arbitrary perspectives, providing the basis for situated language comprehension and production. An initial application of mental imagery for spatial language understanding for an interactive robot is described.
  • Keywords
    manipulators; natural languages; robot vision; simulation; speech processing; active vision; conversational robot; face-to-face spoken conversation; interactive robot; mental imagery; mental model; object permanence; physical simulation; robotic manipulator; situated language comprehension; situated spoken dialog; spatial language understanding; Computer vision; Context awareness; Face detection; Grounding; Image converters; Manipulators; Natural languages; Navigation; Orbital robotics; Robot vision systems; Algorithms; Artificial Intelligence; Communication; Image Interpretation, Computer-Assisted; Imagination; Imaging, Three-Dimensional; Mental Processes; Natural Language Processing; Robotics; User-Computer Interface; Vocabulary, Controlled;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2004.823327
  • Filename
    1298887