An empirical investigation of sparse log-linear models for improved dialogue act classification

Author

Yun-Nung Chen ; Wang, Wei Yu ; Rudnicky, Alexander I.

Author_Institution

Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA

fYear

2013

Firstpage

8317

Lastpage

8321

Abstract

Previous work on dialogue act classification have primarily focused on dense generative and discriminative models. However, since the automatic speech recognition (ASR) outputs are often noisy, dense models might generate biased estimates and overfit to the training data. In this paper, we study sparse modeling approaches to improve dialogue act classification, since the sparse models maintain a compact feature space, which is robust to noise. To test this, we investigate various element-wise frequentist shrinkage models such as lasso, ridge, and elastic net, as well as structured sparsity models and a hierarchical sparsity model that embed the dependency structure and interaction among local features. In our experiments on a real-world dataset, when augmenting N-best word and phone level ASR hypotheses with confusion network features, our best sparse log-linear model obtains a relative improvement of 19.7% over a rule-based baseline, a 3.7% significant improvement over a traditional non-sparse log-linear model, and outperforms a state-of-the-art SVM model by 2.2%.

Keywords

learning (artificial intelligence); speech recognition; SVM model; augmenting N-best word; automatic speech recognition; dense generative model; discriminative model; elastic net; element-wise frequentist shrinkage model; improved dialogue act classification; lasso net; nonsparse log-linear model; phone level ASR hypotheses; ridge net; rule-based baseline; sparse log-linear modeling approach; Accuracy; Computational modeling; Hidden Markov models; Noise measurement; Speech; Support vector machines; Training; Dialogue act classification; discriminative model; log-linear model; maximum entropy; sparsity;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on

Conference_Location

Vancouver, BC

ISSN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2013.6639287

Filename

6639287