DocumentCode
506866
Title
A Method for Determination on HMM Distance Threshold
Author
Duan, Jiangjiao ; Zeng, Jianping ; Zhang, Dongzhan
Author_Institution
Dept. of Comput. Sci., Xiamen Univ., Xiamen, China
Volume
1
fYear
2009
fDate
14-16 Aug. 2009
Firstpage
387
Lastpage
391
Abstract
Hidden Markov model (HMM) is widely used in time series modeling. Usually, it is necessarily to calculate the sequence´s likelihood w.r.t. HMM to evaluate the similarity between the sequence and the HMM. Hence, it is required to provide a method to select a best threshold value that can determine whether the sequence is well approximated by the model or not. However, this process is usually done manually. Here, we provide a method (HTDM) to determine the threshold automatically. Based on likelihood statistic, we conclude that the likelihood is subjected to normal distribution, and then standard deviation of the distribution is estimated. Hence, the distance threshold value can be achieved based on the rule of ¿three sigma¿. In the experiment, we make performance comparison between the HMM-based hierarchical clustering algorithm HHCH using HTDM, and algorithm HBHCTS in which threshold is set by manual. Experiment results show that the proposed method is effective on both syntax dataset and real world dataset.
Keywords
hidden Markov models; normal distribution; pattern clustering; HMM distance threshold; hidden Markov model; hierarchical clustering algorithm; likelihood statistic; normal distribution; three sigma rule; Clustering algorithms; Computer science; Data mining; Fuzzy systems; Gaussian distribution; Hidden Markov models; Performance analysis; Statistical distributions; Stochastic processes; Time series analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location
Tianjin
Print_ISBN
978-0-7695-3735-1
Type
conf
DOI
10.1109/FSKD.2009.732
Filename
5358570
Link To Document