• DocumentCode
    3379822
  • Title

    Synchronized gesture and speech production for humanoid robots

  • Author

    Ng-Thow-Hing, Victor ; Luo, Pengcheng ; Okita, Sandra

  • Author_Institution
    Honda Res. Inst. USA, Inc., Mountain View, CA, USA
  • fYear
    2010
  • fDate
    18-22 Oct. 2010
  • Firstpage
    4617
  • Lastpage
    4624
  • Abstract
    We present a model that is capable of synchronizing expressive gestures with speech. The model, implemented on a Honda humanoid robot, can generate a full range of gesture types, such as emblems, iconic and metaphoric gestures, deictic pointing and beat gestures. Arbitrary input text is analyzed with a part-of-speech tagger and a text-to-speech engine for timing information of spoken words. In addition, style tags can be optionally added to specify the level of excitement or topic changes. The text, combined with any tags, is then processed by several grammars, one for each gesture type to produce several candidate gestures for each word of the text. The model then selects probabilistically amongst the gesture types based on the desired degree of expressivity. Once a gesture type is selected, it coincides with a particular gesture template, consisting of trajectory curves that define the gesture. Speech timing patterns and style parameters are used to modulate the shape of the curve before it sent to the whole body control system on the robot. Evaluation of the model´s parameters were performed, demonstrating the ability of observers to differentiate varying levels of expressiveness, excitement and speech synchronization. Modification of gesture speed for trajectory tracking found that positive associations like happiness and excitement accompanied faster speeds, with negative associations like sadness or tiredness occurred at slower speeds.
  • Keywords
    gesture recognition; humanoid robots; object tracking; position control; speech processing; synchronisation; gesture synchronization; gesture template; humanoid robots; part-of-speech tagger; speech production; speech synchronization; text-to-speech engine; trajectory tracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems (IROS), 2010 IEEE/RSJ International Conference on
  • Conference_Location
    Taipei
  • ISSN
    2153-0858
  • Print_ISBN
    978-1-4244-6674-0
  • Type

    conf

  • DOI
    10.1109/IROS.2010.5654322
  • Filename
    5654322