• DocumentCode
    1695465
  • Title

    An empirical investigation of sparse log-linear models for improved dialogue act classification

  • Author

    Yun-Nung Chen ; Wang, Wei Yu ; Rudnicky, Alexander I.

  • Author_Institution
    Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    2013
  • Firstpage
    8317
  • Lastpage
    8321
  • Abstract
    Previous work on dialogue act classification have primarily focused on dense generative and discriminative models. However, since the automatic speech recognition (ASR) outputs are often noisy, dense models might generate biased estimates and overfit to the training data. In this paper, we study sparse modeling approaches to improve dialogue act classification, since the sparse models maintain a compact feature space, which is robust to noise. To test this, we investigate various element-wise frequentist shrinkage models such as lasso, ridge, and elastic net, as well as structured sparsity models and a hierarchical sparsity model that embed the dependency structure and interaction among local features. In our experiments on a real-world dataset, when augmenting N-best word and phone level ASR hypotheses with confusion network features, our best sparse log-linear model obtains a relative improvement of 19.7% over a rule-based baseline, a 3.7% significant improvement over a traditional non-sparse log-linear model, and outperforms a state-of-the-art SVM model by 2.2%.
  • Keywords
    learning (artificial intelligence); speech recognition; SVM model; augmenting N-best word; automatic speech recognition; dense generative model; discriminative model; elastic net; element-wise frequentist shrinkage model; improved dialogue act classification; lasso net; nonsparse log-linear model; phone level ASR hypotheses; ridge net; rule-based baseline; sparse log-linear modeling approach; Accuracy; Computational modeling; Hidden Markov models; Noise measurement; Speech; Support vector machines; Training; Dialogue act classification; discriminative model; log-linear model; maximum entropy; sparsity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639287
  • Filename
    6639287