DocumentCode
178690
Title
Spoken dialogue grammar induction from crowdsourced data
Author
Palogiannidi, Elisavet ; Klasinas, Ioannis ; Potamianos, Alexandros ; Iosif, Elias
Author_Institution
Sch. of ECE, Tech. Univ. of Crete, Chania, Greece
fYear
2014
fDate
4-9 May 2014
Firstpage
3211
Lastpage
3215
Abstract
We design and evaluate various crowdsourcing tasks for eliciting spoken dialogue data. Task design is based on an array of parameters that quantify the basic characteristics of the elicitation questions, e.g., how open-ended is a question. The crowdsourced data are used for and evaluated on the unsupervised induction of semantic classes for speech understanding grammars. We show that grammar induction performance is significantly affected by the crowdsourcing task parameters, e.g., paraphrasing tasks prime high lexical entrain-ment and result in poor corpus/grammar quality. The task parameters along with perplexity filters are used for corpus selection achieving grammar induction performance that is comparable to that of using in-domain spoken dialogue data.
Keywords
information retrieval; speech processing; unsupervised learning; corpus selection; corpus-grammar quality; crowdsourced data; crowdsourcing task parameters; data elicitation; grammar induction performance; perplexity filters; speech understanding grammars; spoken dialogue data; spoken dialogue grammar induction; task parameters; Cities and towns; Conferences; Context; Crowdsourcing; Grammar; Semantics; Speech; Crowdsourcing; Grammar Induction; Spoken Dialogue Systems;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854193
Filename
6854193
Link To Document