مرکز منطقه ای اطلاع رساني علوم و فناوري - Recognition of Dialogue Acts in Multiparty Meetings Using a Switching DBN

DocumentCode :

1144041

Title :

Recognition of Dialogue Acts in Multiparty Meetings Using a Switching DBN

Author :

Dielmann, Alfred ; Renals, Steve

Author_Institution :

Centre for Speech Technol. Res. (CSTR), Univ. of Edinburgh, Edinburgh

Volume :

Issue :

fYear :

2008

Firstpage :

1303

Lastpage :

1314

Abstract :

This paper is concerned with the automatic recognition of dialogue acts (DAs) in multiparty conversational speech. We present a joint generative model for DA recognition in which segmentation and classification of DAs are carried out in parallel. Our approach to DA recognition is based on a switching dynamic Bayesian network (DBN) architecture. This generative approach models a set of features, related to lexical content and prosody, and incorporates a weighted interpolated factored language model. The switching DBN coordinates the recognition process by integrating the component models. The factored language model, which is estimated from multiple conversational data corpora, is used in conjunction with additional task-specific language models. In conjunction with this joint generative model, we have also investigated the use of a discriminative approach, based on conditional random fields, to perform a reclassification of the segmented DAs. We have carried out experiments on the AMI corpus of multimodal meeting recordings, using both manually transcribed speech, and the output of an automatic speech recognizer, and using different configurations of the generative model. Our results indicate that the system performs well both on reference and fully automatic transcriptions. A further significant improvement in recognition accuracy is obtained by the application of the discriminative reranking approach based on conditional random fields.

Keywords :

Bayes methods; belief networks; interpolation; signal classification; speech recognition; AMI corpus; DA classification; DA segmentation; automatic speech recognizer; conditional random fields; dialogue acts recognition; dynamic Bayesian network; joint generative model; manually transcribed speech; multiparty conversational speech; multiparty meetings; switching DBN; weighted interpolated factored language model; AMI corpus; conditional random field (CRF); dialogue act (DA); dynamic Bayesian network (DBN); interpolated factored language model (FLM);

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2008.922463

Filename :

4497831

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1144041