DocumentCode :
1576635
Title :
Bootstrapping word learning: A perception driven semantics-first approach
Author :
Mukerjee, Amitabha ; Joshi, Nikhil ; Mudgal, Prabhat ; Srinath, S. V P Gopi
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Kanpur, India
Volume :
2
fYear :
2011
Firstpage :
1
Lastpage :
6
Abstract :
In recent decades, evidence for preverbal perceptual categorization in infants has been accumulating, and a role has been suggested for such processes in bootstrapping word learning, i.e. acquiring the very first word-meaning associations. We propose a computational study to consider the possibility that initial notions of semantic classes may help word learning. We consider visual scenes with many possible referents, and consider unparsed linguistic descriptions in text form. We use no prior knowledge of vision domain, or of morphology, syntax or word frequency. Using a synthetic model of object attention, we show that the system is able to first identify perceptual classes from the visual stream, and then associate these with words from the linguistic stream. Working with Hindi text, we demonstrate the ability to learn words for prominent proto-concepts like BICYCLE, TRUCK, and CAR from a complex traffic video. We compare the associations when learning unsegmented poly-syllabic strings in the language (without knowledge of word boundaries) versus segmented words, and find that the poly-syllables do nearly as well. This suggests that early acquisition of some semantic classes may also help in parsing the input stream into “words”. The model is then used on a novel video from a similar domain, to identify objects with their labels. Since we provide no knowledge to the system either for the visual or language analyses, the results are likely to hold for other visual scenes and languages.
Keywords :
computational linguistics; grammars; video signal processing; BICYCLE; CAR; Hindi text; TRUCK; bootstrapping word learning; first word-meaning associations; infant preverbal perceptual categorization; linguistic stream; morphology; perception driven semantics-first approach; poly-syllabic string learning; syntax; traffic video; unparsed linguistic descriptions; vision domain; visual scenes; word frequency; Irrigation; Pragmatics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Development and Learning (ICDL), 2011 IEEE International Conference on
Conference_Location :
Frankfurt am Main
ISSN :
2161-9476
Print_ISBN :
978-1-61284-989-8
Type :
conf
DOI :
10.1109/DEVLRN.2011.6037345
Filename :
6037345
Link To Document :
بازگشت