• DocumentCode
    2865248
  • Title

    Discriminatively trained Markov model for sequence classification

  • Author

    Yakhnenko, Oksana ; Silvescu, Adrian ; Honavar, Vasant

  • Author_Institution
    Dept. of Comput. Sci., Iowa State Univ., Ames, IA, USA
  • fYear
    2005
  • fDate
    27-30 Nov. 2005
  • Abstract
    In this paper, we propose a discriminative counterpart of the directed Markov Models of order k - 1, or MM(k - 1) for sequence classification. MM(k - 1) models capture dependencies among neighboring elements of a sequence. The parameters of the classifiers are initialized to based on the maximum likelihood estimates for their generative counterparts. We derive gradient based update equations for the parameters of the sequence classifiers in order to maximize the conditional likelihood function. Results of our experiments with data sets drawn from biological sequence classification (specifically protein function and subcellular localization) and text classification applications show that the discriminatively trained sequence classifiers outperform their generative counterparts, confirming the benefits of discriminative training when the primary objective is classification. Our experiments also show that the discriminatively trained MM(k - 1) sequence classifiers are competitive with the computationally much more expensive Support Vector Machines trained using k-gram representations of sequences.
  • Keywords
    Markov processes; maximum likelihood estimation; pattern classification; biological sequence classification; conditional likelihood function; directed Markov model; discriminative training; discriminatively trained Markov model; gradient based update equation; maximum likelihood estimation; text classification; Artificial intelligence; Computational intelligence; Computer science; Equations; Laboratories; Learning; Maximum likelihood detection; Maximum likelihood estimation; Proteins; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, Fifth IEEE International Conference on
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2278-5
  • Type

    conf

  • DOI
    10.1109/ICDM.2005.52
  • Filename
    1565717