Title :
Grounded spoken language acquisition: experiments in word learning
Author_Institution :
Media Lab., Massachusetts Inst. of Technol., Cambridge, MA, USA
fDate :
6/1/2003 12:00:00 AM
Abstract :
Language is grounded in sensory-motor experience. Grounding connects concepts to the physical world enabling humans to acquire and use words and sentences in context. Currently most machines which process language are not grounded. Instead, semantic representations are abstract, pre-specified, and have meaning only when interpreted by humans. We are interested in developing computational systems which represent words, utterances, and underlying concepts in terms of sensory-motor experiences leading to richer levels of machine understanding. A key element of this work is the development of effective architectures for processing multisensory data. Inspired by theories of infant cognition, we present a computational model which learns words from untranscribed acoustic and video input. Channels of input derived from different sensors are integrated in an information-theoretic framework. Acquired words are represented in terms of associations between acoustic and visual sensory experience. The model has been implemented in a real-time robotic system which performs interactive language learning and understanding. Successful learning has also been demonstrated using infant-directed speech and images.
Keywords :
computational linguistics; hidden Markov models; speech coding; computational model; computational systems; grounded spoken language acquisition; infant cognition; interactive language learning; machine understanding; real-time robotic system; semantic representations; sensory-motor experience; word learning; Acoustic sensors; Cognition; Cognitive robotics; Computational modeling; Computer architecture; Grounding; Humans; Natural languages; Real time systems; Robot sensing systems;
Journal_Title :
Multimedia, IEEE Transactions on
DOI :
10.1109/TMM.2003.811618