Title :
A unified approach to speech production and recognition based on articulatory motor representations
Author :
Hörnstein, Jonas ; Santos-Victor, José
Author_Institution :
Inst. Super. Tecnico, Lisbon
fDate :
Oct. 29 2007-Nov. 2 2007
Abstract :
We present a unified approach for speech production and recognition based on articulatory motor representations. The approach is inspired by the motor theory and the discovery of mirror neurons, and use motor representations for both reproduction and recognition of speech. A model of the vocal tract is used to create sound and the created sound is then mapped back to the motor representation using a neural network. To learn the map we mimic the behavior of a child that uses a combination of babbling and interaction with its caregiver to learn how to speak. Several different phases of babbling and interaction are identified and described. These help to overcome the inversion problem. The approach has been implemented on a humanoid robot, which has successfully learned to pronounce Swedish and Portuguese vowels. We have also studied how the different phases of babbling and interaction effect the error of the map and the achieved recognition rate when presented with vowels from different subjects. Finally we compare the recognition rates obtained using motor space with recognition rates obtained by directly using the acoustic parameters.
Keywords :
humanoid robots; inverse problems; speech recognition; Portuguese; Swedish; acoustic parameters; articulatory motor representations; babbling combination; caregiver; humanoid robot; inversion problem; mirror neurons; motor theory; speech production; speech recognition; Humans; Intelligent robots; Learning systems; Mirrors; Neurons; Production systems; Speech recognition; Tongue; USA Councils; Usability;
Conference_Titel :
Intelligent Robots and Systems, 2007. IROS 2007. IEEE/RSJ International Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4244-0912-9
Electronic_ISBN :
978-1-4244-0912-9
DOI :
10.1109/IROS.2007.4399475