DocumentCode :
178690
Title :
Spoken dialogue grammar induction from crowdsourced data
Author :
Palogiannidi, Elisavet ; Klasinas, Ioannis ; Potamianos, Alexandros ; Iosif, Elias
Author_Institution :
Sch. of ECE, Tech. Univ. of Crete, Chania, Greece
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
3211
Lastpage :
3215
Abstract :
We design and evaluate various crowdsourcing tasks for eliciting spoken dialogue data. Task design is based on an array of parameters that quantify the basic characteristics of the elicitation questions, e.g., how open-ended is a question. The crowdsourced data are used for and evaluated on the unsupervised induction of semantic classes for speech understanding grammars. We show that grammar induction performance is significantly affected by the crowdsourcing task parameters, e.g., paraphrasing tasks prime high lexical entrain-ment and result in poor corpus/grammar quality. The task parameters along with perplexity filters are used for corpus selection achieving grammar induction performance that is comparable to that of using in-domain spoken dialogue data.
Keywords :
information retrieval; speech processing; unsupervised learning; corpus selection; corpus-grammar quality; crowdsourced data; crowdsourcing task parameters; data elicitation; grammar induction performance; perplexity filters; speech understanding grammars; spoken dialogue data; spoken dialogue grammar induction; task parameters; Cities and towns; Conferences; Context; Crowdsourcing; Grammar; Semantics; Speech; Crowdsourcing; Grammar Induction; Spoken Dialogue Systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854193
Filename :
6854193
Link To Document :
بازگشت