• DocumentCode
    51197
  • Title

    A Normalized Statistical Metric Space for Hidden Markov Models

  • Author

    Chen Lu ; Schwier, J.M. ; Craven, R.M. ; Lu Yu ; Brooks, R.R. ; Griffin, Christopher

  • Author_Institution
    Holcombe Dept. of Electr. & Comput. Eng., Clemson Univ., Clemson, SC, USA
  • Volume
    43
  • Issue
    3
  • fYear
    2013
  • fDate
    Jun-13
  • Firstpage
    806
  • Lastpage
    819
  • Abstract
    In this paper, we present a normalized statistical metric space for hidden Markov models (HMMs). HMMs are widely used to model real-world systems. Like graph matching, some previous approaches compare HMMs by evaluating the correspondence, or goodness of match, between every pair of states, concentrating on the structure of the models instead of the statistics of the process being observed. To remedy this, we present a new metric space that compares the statistics of HMMs within a given level of statistical significance. Compared with the Kullback-Leibler divergence, which is another widely used approach for measuring model similarity, our approach is a true metric, can always return an appropriate distance value, and provides a confidence measure on the metric value. Experimental results are given for a sample application, which quantify the similarity of HMMs of network traffic in the Tor anonymization system. This application is interesting since it considers models extracted from a system that is intentionally trying to obfuscate its internal workings. In the conclusion, we discuss applications in less-challenging domains, such as data mining.
  • Keywords
    hidden Markov models; statistics; Kullback-Leibler divergence; Tor anonymization system; data mining; distance value; graph matching; hidden Markov models; metric value; model similarity measurement; network traffic; normalized statistical metric space; Data models; Hidden Markov models; Joints; Markov processes; Mathematical model; Measurement; Probability; Hidden Markov models (HMMs); model comparison; model similarity; Algorithms; Computer Simulation; Data Interpretation, Statistical; Data Mining; Markov Chains; Models, Statistical;
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TSMCB.2012.2216872
  • Filename
    6320699