• DocumentCode
    807790
  • Title

    Visual learning by imitation with motor representations

  • Author

    Lopes, Manuel ; Santos-Victor, José

  • Author_Institution
    Inst. de Sistemas e Robotica, Inst. Superior Tecnico, Lisboa, Portugal
  • Volume
    35
  • Issue
    3
  • fYear
    2005
  • fDate
    6/1/2005 12:00:00 AM
  • Firstpage
    438
  • Lastpage
    449
  • Abstract
    We propose a general architecture for action (mimicking) and program (gesture) level visual imitation. Action-level imitation involves two modules. The viewpoint transformation (VPT) performs a "rotation" to align the demonstrator\´s body to that of the learner. The visuo-motor map (VMM) maps this visual information to motor data. For program-level (gesture) imitation, there is an additional module that allows the system to recognize and generate its own interpretation of observed gestures to produce similar gestures/goals at a later stage. Besides the holistic approach to the problem, our approach differs from traditional work in i) the use of motor information for gesture recognition; ii) usage of context (e.g., object affordances) to focus the attention of the recognition system and reduce ambiguities, and iii) use iconic image representations for the hand, as opposed to fitting kinematic models to the video sequence. This approach is motivated by the finding of visuomotor neurons in the F5 area of the macaque brain that suggest that gesture recognition/imitation is performed in motor terms (mirror) and rely on the use of object affordances (canonical) to handle ambiguous actions. Our results show that this approach can outperform more conventional (e.g., pure visual) methods.
  • Keywords
    gesture recognition; humanoid robots; image representation; image sequences; learning (artificial intelligence); psychology; robot kinematics; robot vision; F5 area; VMM map; VPT; action-level imitation; anthropomorphic robot; gesture recognition system; iconic image representation; kinematic model; macaque brain; motor data; motor information; motor representation; program level visual imitation; video sequence; viewpoint transformation; visual learning; visuo-motor map; visuomotor coordination; visuomotor neuron; Cognitive robotics; Focusing; Humans; Image recognition; Image representation; Mirrors; Neurons; Robot kinematics; Robot sensing systems; Video sequences; Anthropomorphic robots; imitation; learning; visuomotor coordination; Algorithms; Artificial Intelligence; Biomimetics; Bionics; Computer Simulation; Hand; Hand Strength; Humans; Image Interpretation, Computer-Assisted; Models, Biological; Motor Skills; Movement; Pattern Recognition, Automated; Robotics; Vision;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2005.846654
  • Filename
    1430829