• DocumentCode
    3244768
  • Title

    Acoustic correlates of user response to error in human-computer dialogues

  • Author

    Kazemzadeh, Abe ; Lee, Sungbok ; Narayanan, Shrikanth

  • Author_Institution
    Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2003
  • fDate
    30 Nov.-3 Dec. 2003
  • Firstpage
    215
  • Lastpage
    220
  • Abstract
    Using tagged data from the DARPA Communicator Project, we investigate acoustic features of user responses to system errors. We measure acoustic parameters such as energy, fundamental frequency, sub-band energy, ratios of voiced, unvoiced and silent regions of speech, fundamental frequency slope, spectral slope, and spectral center of gravity. We investigate different types of user responses to the errors, including frustration and various types of corrections. It is confirmed that the most prominent acoustic parameter for responses to the errors is fundamental frequency maximum and range, while other features are found to be salient for specific reaction types. More interestingly, acoustic characteristics of user responses to the errors are found to be different depending on whether the responses are the initial or continued responses to the errors. Similarly, normal user responses can differ acoustically depending on whether or not they were preceded by responses to error. We also present results on automatic classification of error response types using these features.
  • Keywords
    acoustics; error analysis; human computer interaction; human factors; interactive systems; natural language interfaces; speech-based user interfaces; DARPA Communicator Project; acoustic energy; acoustic features; fundamental frequency maximum; fundamental frequency range; fundamental frequency slope; human-computer dialogue errors; silent regions; spectral center of gravity; spectral slope; spoken dialogue systems; sub-band energy; tagged data; unvoiced regions; user response; voiced regions; Acoustic measurements; Energy measurement; Error analysis; Error correction; Frequency measurement; Gravity; Length measurement; Robustness; Speech; Switches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
  • Print_ISBN
    0-7803-7980-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2003.1318443
  • Filename
    1318443