Title :
Building sensori-motor prototypes from audiovisual exemplars
Author_Institution :
Inst. de la Commun. Parlee, Stendhal Univ., Grenoble, France
Abstract :
The paper shows how an articulatory model, able to produce acoustic signals from articulatory motion, can learn to speak, i.e. coordinate its movements in such a way that it utters meaningful sequences of sounds belonging to a given language. This complex learning procedure is accomplished in four major steps: (a) a babbling phase, where the device builds up a model of the forward transforms, i.e. the articulatory-to-audiovisual mapping; (b) an imitation stage, where it tries to reproduce a limited set of sound sequences produced by a distal “teacher”; (c) a “shaping” stage, where phonemes are associated with the most efficient sensorimotor representation; and finally, (d) a “rhythmic” phase, where it learns the appropriate coordination of the activations of these sensory-motor targets
Keywords :
audio-visual systems; robots; sequences; speech synthesis; acoustic signal production; articulatory model; articulatory motion; articulatory-to-audiovisual mapping; audiovisual exemplars; babbling phase; distal teacher; forward transforms; imitation stage; learning procedure; meaningful sound sequences; movement coordination; phonemes; rhythmic phase; sensory-motor prototype building; shaping stage; speech learning; Discrete transforms; Electronic mail; Force control; Force sensors; Haptic interfaces; Jacobian matrices; Natural languages; Prototypes; Speech processing; Speech synthesis;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607761