• DocumentCode
    838454
  • Title

    Exploitation of unlabeled sequences in hidden Markov models

  • Author

    Inoue, Masashi ; Ueda, Naonori

  • Author_Institution
    Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Japan
  • Volume
    25
  • Issue
    12
  • fYear
    2003
  • Firstpage
    1570
  • Lastpage
    1581
  • Abstract
    This paper presents a method for effectively using unlabeled sequential data in the learning of hidden Markov models (HMMs). With the conventional approach, class labels for unlabeled data are assigned deterministically by HMMs learned from labeled data. Such labeling often becomes unreliable when the number of labeled data is small. We propose an extended Baum-Welch (EBW) algorithm in which the labeling is undertaken probabilistically and iteratively so that the labeled and unlabeled data likelihoods are improved. Unlike the conventional approach, the EBW algorithm guarantees convergence to a local maximum of the likelihood. Experimental results on gesture data and speech data show that when labeled training data are scarce, by using unlabeled data, the EBW algorithm improves the classification performance of HMMs more robustly than the conventional naive labeling (NL) approach.
  • Keywords
    convergence; gesture recognition; hidden Markov models; learning (artificial intelligence); maximum likelihood estimation; pattern classification; speech recognition; classification performance; conventional naive labeling; convergence; extended Baum Welch algorithm; gesture data; hidden Markov models; labeled training data; speech data; unlabeled sequential data; Convergence; Data mining; Hidden Markov models; Iterative algorithms; Labeling; Semisupervised learning; Sequences; Speech recognition; Supervised learning; Training data;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2003.1251150
  • Filename
    1251150