• DocumentCode
    2979030
  • Title

    Automatic detection of discourse structure for speech recognition and understanding

  • Author

    Jurafsky, Daniel ; Bates, Rebecca ; Coccaro, Noah ; Martin, Rachel ; Meteer, Marie ; Ries, Klaus ; Shriberg, Elizabeth ; Stolcke, Audreas ; Taylor, Paul ; Van Ess-Dykema, Carol

  • fYear
    1997
  • fDate
    14-17 Dec 1997
  • Firstpage
    88
  • Lastpage
    95
  • Abstract
    We describe a new approach for statistical modeling and detection of discourse structure for natural conversational speech. Our model is based on 42 dialog acts (DAs), (question, answer, backchannel, agreement, disagreement, apology, etc.). We labeled 1155 conversations from the Switchboard (SWBD) database (Godfrey et al., 1992) of human-to-human telephone conversations with these 42 types and trained a dialog act detector based on three distinct knowledge sources: sequences of words which characterize a dialog act; prosodic features which characterize a dialog act; and a statistical discourse grammar. Our combined detector, although still in preliminary stages, already achieves a 65% dialog act detection rate based on acoustic waveforms, and 72% accuracy based on word transcripts. Using this detector to switch among the 42 dialog-act-specific trigram LMs also gave us an encouraging but not statistically significant reduction in SWBD word error
  • Keywords
    computational linguistics; grammars; natural language interfaces; speech recognition; statistical analysis; Switchboard database; acoustic waveforms; dialog acts; discourse structure detection; natural conversational speech; prosodic features; speech recognition; speech understanding; statistical discourse grammar; statistical modeling; telephone conversations; word error; word transcripts; Acoustic signal detection; Acoustic waves; Automatic speech recognition; Buildings; Detectors; Error analysis; Spatial databases; Speech recognition; Switches; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
  • Conference_Location
    Santa Barbara, CA
  • Print_ISBN
    0-7803-3698-4
  • Type

    conf

  • DOI
    10.1109/ASRU.1997.658992
  • Filename
    658992