Title :
Language Bootstrapping: Learning Word Meanings From Perception–Action Association
Author :
Salvi, Giampiero ; Montesano, Luis ; Bernardino, Alexandre ; Santos-Victor, José
Author_Institution :
Dept. of Speech, Music & Hearing, Kungliga Tek. Hogskolan (KTH), Stockholm, Sweden
fDate :
6/1/2012 12:00:00 AM
Abstract :
We address the problem of bootstrapping language acquisition for an artificial system similarly to what is observed in experiments with human infants. Our method works by associating meanings to words in manipulation tasks, as a robot interacts with objects and listens to verbal descriptions of the interactions. The model is based on an affordance network, i.e., a mapping between robot actions, robot perceptions, and the perceived effects of these actions upon objects. We extend the affordance model to incorporate spoken words, which allows us to ground the verbal symbols to the execution of actions and the perception of the environment. The model takes verbal descriptions of a task as the input and uses temporal co-occurrence to create links between speech utterances and the involved objects, actions, and effects. We show that the robot is able form useful word-to-meaning associations, even without considering grammatical structure in the learning process and in the presence of recognition errors. These word-to-meaning associations are embedded in the robot´s own understanding of its actions. Thus, they can be directly used to instruct the robot to perform tasks and also allow to incorporate context in the speech recognition task. We believe that the encouraging results with our approach may afford robots with a capacity to acquire language descriptors in their operation´s environment as well as to shed some light as to how this challenging process develops with human infants.
Keywords :
human-robot interaction; humanoid robots; intelligent robots; manipulators; speech recognition; unsupervised learning; affordance network; artificial system; bootstrapping language acquisition; humanoid robot; language bootstrapping; manipulation task; perception-action association; recognition error; robot interact; speech recognition task; speech utterance; temporal cooccurrence; unsupervised learning; verbal description; verbal symbol; word meaning learning; word-to-meaning association; Computational modeling; Context; Humans; Robot sensing systems; Speech; Speech recognition; Affordances; Bayesian networks; automatic speech recognition; cognitive robotics; grasping; humanoid robots; language; unsupervised learning; Algorithms; Artificial Intelligence; Biomimetics; Computer Simulation; Decision Support Techniques; Humans; Infant; Models, Theoretical; Natural Language Processing; Pattern Recognition, Automated; Robotics;
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
DOI :
10.1109/TSMCB.2011.2172420