DocumentCode :
294551
Title :
Discourse structure for multi-speaker spontaneous spoken dialogs: incorporating heuristics into stochastic RTNs
Author :
Young, Sheryl R.
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
177
Abstract :
In real spoken language applications, where speakers interact spontaneously, there is much seeming unpredictability that makes recognition difficult. Multi-speaker spontaneous dialog where two speakers interact verbally to cooperatively solve a mutual, shared problem is more varied than human-computer interactions. Spontaneous speech is not well structured, exhibiting mid-utterance corrections and restarts in utterances. Discourse contains digressions, clarifications, corrections and topic changes. But, multi-speaker discourse is even more varied, with initiative effects, speakers interacting, planning and responding. This makes it extremely difficult to develop grammars and language models with adequate coverage and reliable stochastic parameters. Perplexity increases and recognition degrades considerably vis-a-vis human-database dialog. In spite of all this, multi-speaker dialogs are structured and predictable when the discourse is appropriately modelled. We have developed heuristics to model spontaneous speech and multi-speaker dialogs. The underlying heuristics have been evaluated and shown to adequately and accurately predict discourse phenomena, as evaluated on a 10,000+ utterance corpus. Generally, the heuristics for computing discourse structure and deriving constraints from it are rule based. We have taken the rules and used them to develop a set of stochastic RTNs that capture both the rules and corpus probabilities. The resulting language model can be used predictively to dynamically generate stochastic utterance predictions or can be incorporated into any recognition/understanding system where a single prior state is maintained
Keywords :
grammars; knowledge based systems; natural languages; speech processing; stochastic processes; clarifications; corpus probabilities; corrections; digressions; discourse structure; grammars; heuristics; human-database dialog; language model; language models; multi-speaker discourse; multi-speaker spontaneous spoken dialogs; rule based system; speech recognition; speech understanding system; spoken language applications; spontaneous speech; stochastic RTN; stochastic parameters; stochastic utterance predictions; topic changes; utterance corpus; Application software; Computer science; Degradation; Laser sintering; Level control; Natural languages; Predictive models; Problem-solving; Speech analysis; Speech recognition; Stochastic processes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479393
Filename :
479393
Link To Document :
بازگشت