• DocumentCode
    178690
  • Title

    Spoken dialogue grammar induction from crowdsourced data

  • Author

    Palogiannidi, Elisavet ; Klasinas, Ioannis ; Potamianos, Alexandros ; Iosif, Elias

  • Author_Institution
    Sch. of ECE, Tech. Univ. of Crete, Chania, Greece
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    3211
  • Lastpage
    3215
  • Abstract
    We design and evaluate various crowdsourcing tasks for eliciting spoken dialogue data. Task design is based on an array of parameters that quantify the basic characteristics of the elicitation questions, e.g., how open-ended is a question. The crowdsourced data are used for and evaluated on the unsupervised induction of semantic classes for speech understanding grammars. We show that grammar induction performance is significantly affected by the crowdsourcing task parameters, e.g., paraphrasing tasks prime high lexical entrain-ment and result in poor corpus/grammar quality. The task parameters along with perplexity filters are used for corpus selection achieving grammar induction performance that is comparable to that of using in-domain spoken dialogue data.
  • Keywords
    information retrieval; speech processing; unsupervised learning; corpus selection; corpus-grammar quality; crowdsourced data; crowdsourcing task parameters; data elicitation; grammar induction performance; perplexity filters; speech understanding grammars; spoken dialogue data; spoken dialogue grammar induction; task parameters; Cities and towns; Conferences; Context; Crowdsourcing; Grammar; Semantics; Speech; Crowdsourcing; Grammar Induction; Spoken Dialogue Systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854193
  • Filename
    6854193