DocumentCode
807790
Title
Visual learning by imitation with motor representations
Author
Lopes, Manuel ; Santos-Victor, José
Author_Institution
Inst. de Sistemas e Robotica, Inst. Superior Tecnico, Lisboa, Portugal
Volume
35
Issue
3
fYear
2005
fDate
6/1/2005 12:00:00 AM
Firstpage
438
Lastpage
449
Abstract
We propose a general architecture for action (mimicking) and program (gesture) level visual imitation. Action-level imitation involves two modules. The viewpoint transformation (VPT) performs a "rotation" to align the demonstrator\´s body to that of the learner. The visuo-motor map (VMM) maps this visual information to motor data. For program-level (gesture) imitation, there is an additional module that allows the system to recognize and generate its own interpretation of observed gestures to produce similar gestures/goals at a later stage. Besides the holistic approach to the problem, our approach differs from traditional work in i) the use of motor information for gesture recognition; ii) usage of context (e.g., object affordances) to focus the attention of the recognition system and reduce ambiguities, and iii) use iconic image representations for the hand, as opposed to fitting kinematic models to the video sequence. This approach is motivated by the finding of visuomotor neurons in the F5 area of the macaque brain that suggest that gesture recognition/imitation is performed in motor terms (mirror) and rely on the use of object affordances (canonical) to handle ambiguous actions. Our results show that this approach can outperform more conventional (e.g., pure visual) methods.
Keywords
gesture recognition; humanoid robots; image representation; image sequences; learning (artificial intelligence); psychology; robot kinematics; robot vision; F5 area; VMM map; VPT; action-level imitation; anthropomorphic robot; gesture recognition system; iconic image representation; kinematic model; macaque brain; motor data; motor information; motor representation; program level visual imitation; video sequence; viewpoint transformation; visual learning; visuo-motor map; visuomotor coordination; visuomotor neuron; Cognitive robotics; Focusing; Humans; Image recognition; Image representation; Mirrors; Neurons; Robot kinematics; Robot sensing systems; Video sequences; Anthropomorphic robots; imitation; learning; visuomotor coordination; Algorithms; Artificial Intelligence; Biomimetics; Bionics; Computer Simulation; Hand; Hand Strength; Humans; Image Interpretation, Computer-Assisted; Models, Biological; Motor Skills; Movement; Pattern Recognition, Automated; Robotics; Vision;
fLanguage
English
Journal_Title
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
1083-4419
Type
jour
DOI
10.1109/TSMCB.2005.846654
Filename
1430829
Link To Document