• DocumentCode
    3426982
  • Title

    Automatic classification of question turns in spontaneous speech using lexical and prosodic evidence

  • Author

    Ananthakrishnan, Sankaranarayanan ; Ghosh, Prasanta ; Narayanan, Shrikanth

  • Author_Institution
    Dept. of Electr. Eng., Southern California Univ., Los Angeles, CA
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    5005
  • Lastpage
    5008
  • Abstract
    The ability to identify speech acts reliably is desirable in any spoken language system that interacts with humans. Minimally, such a system should be capable of distinguishing between question-bearing turns and other types of utterances. However, this is a non-trivial task, since spontaneous speech tends to have incomplete syntactic, and even ungrammatical, structure and is characterized by disfluencies, repairs and other non-linguistic vocalizations that make simple rule based pattern learning difficult. In this paper, we present a system for identifying question-bearing turns in spontaneous multi-party speech (ICSI Meeting Corpus) using lexical and prosodic evidence. On a balanced test set, our system achieves an accuracy of 71.9% for the binary question vs. non-question classification task. Further, we investigate the robustness of our proposed technique to uncertainty in the lexical feature stream (e.g. caused by speech recognition errors). Our experiments indicate that classification accuracy of the proposed method is robust to errors in the text stream, dropping only about 0.8% for every 10% increase in word error rate (WER).
  • Keywords
    knowledge based systems; learning (artificial intelligence); linguistics; pattern classification; speech processing; speech recognition; lexical evidence; nonlinguistic vocalizations; prosodic evidence; question-bearing turns; rule based pattern learning; spoken language system; spontaneous multiparty speech; word error rate; Acoustic testing; Automatic speech recognition; Data mining; Databases; Error analysis; Humans; Natural languages; Robustness; Speech analysis; System testing; dialog; prosody; question turn; speech act; spontaneous speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518782
  • Filename
    4518782