مرکز منطقه ای اطلاع رساني علوم و فناوري - Discourse structure for multi-speaker spontaneous spoken dialogs: incorporating heuristics into stochastic RTNs

DocumentCode :

294551

Title :

Discourse structure for multi-speaker spontaneous spoken dialogs: incorporating heuristics into stochastic RTNs

Author :

Young, Sheryl R.

Author_Institution :

Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA

Volume :

fYear :

1995

fDate :

9-12 May 1995

Firstpage :

177

Abstract :

In real spoken language applications, where speakers interact spontaneously, there is much seeming unpredictability that makes recognition difficult. Multi-speaker spontaneous dialog where two speakers interact verbally to cooperatively solve a mutual, shared problem is more varied than human-computer interactions. Spontaneous speech is not well structured, exhibiting mid-utterance corrections and restarts in utterances. Discourse contains digressions, clarifications, corrections and topic changes. But, multi-speaker discourse is even more varied, with initiative effects, speakers interacting, planning and responding. This makes it extremely difficult to develop grammars and language models with adequate coverage and reliable stochastic parameters. Perplexity increases and recognition degrades considerably vis-a-vis human-database dialog. In spite of all this, multi-speaker dialogs are structured and predictable when the discourse is appropriately modelled. We have developed heuristics to model spontaneous speech and multi-speaker dialogs. The underlying heuristics have been evaluated and shown to adequately and accurately predict discourse phenomena, as evaluated on a 10,000+ utterance corpus. Generally, the heuristics for computing discourse structure and deriving constraints from it are rule based. We have taken the rules and used them to develop a set of stochastic RTNs that capture both the rules and corpus probabilities. The resulting language model can be used predictively to dynamically generate stochastic utterance predictions or can be incorporated into any recognition/understanding system where a single prior state is maintained

Keywords :

grammars; knowledge based systems; natural languages; speech processing; stochastic processes; clarifications; corpus probabilities; corrections; digressions; discourse structure; grammars; heuristics; human-database dialog; language model; language models; multi-speaker discourse; multi-speaker spontaneous spoken dialogs; rule based system; speech recognition; speech understanding system; spoken language applications; spontaneous speech; stochastic RTN; stochastic parameters; stochastic utterance predictions; topic changes; utterance corpus; Application software; Computer science; Degradation; Laser sintering; Level control; Natural languages; Predictive models; Problem-solving; Speech analysis; Speech recognition; Stochastic processes;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location :

Detroit, MI

ISSN :

1520-6149

Print_ISBN :

0-7803-2431-5

Type :

conf

DOI :

10.1109/ICASSP.1995.479393

Filename :

479393

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=294551