• DocumentCode
    312333
  • Title

    On not recognizing disfluencies in dialogue

  • Author

    Lickley, R.J. ; Bard, E.G.

  • Author_Institution
    Human Commun. Res. Centre & Dept. of Linguistics, Edinburgh Univ., UK
  • Volume
    3
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    1876
  • Abstract
    Tests the hypothesis that listeners miss disfluencies or fail to transcribe them accurately because disfluencies interfere with the normal relationship between speech sound and linguistic context in human spoken word recognition. In a word-level gating experiment, 16 listeners heard a total of 56 disfluent utterances selected from a corpus of spontaneous speech, 56 length-matched fluent controls and 56 fluent foils. The proportion of words never recognized was greater in disfluent utterances than in the controls. The failures clustered around the point where the disfluency interrupted the utterance, ocurring particularly within the reparanda, but were not found at corresponding locations in uninterrupted controls. Repetition disfluencies, where pre- and post-interruption portions might easily be construed together, allowed more successful word recognitions than recast disfluencies, where reconstruction of a single intended utterance would be difficult, if not impossible. The results have implications both for understanding human speech recognition and for improving the robustness of automatic speech recognition (ASR) systems
  • Keywords
    hearing; natural languages; speech intelligibility; speech recognition; automatic speech recognition; dialogue; disfluent utterances; fluent foils; human speech recognition; human spoken word recognition; interruption; length-matched fluent controls; linguistic context; missed disfluencies; recast disfluencies; reparanda; repetition disfluencies; robustness; speech sound; speech understanding; spontaneous speech corpus; spontaneous speech transcription; uninterrupted controls; word-level gating experiment; Acoustic materials; Acoustic testing; Automatic speech recognition; Context; Delay; Humans; Proportional control; Robustness; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607998
  • Filename
    607998