Title :
Mining Dependent Frequent Serial Episodes from Uncertain Sequence Data
Author :
Li Wan ; Ling Chen ; Chengqi Zhang
Author_Institution :
Comput. Sci. & Technol. Coll., Chongqing Univ., Chongqing, China
Abstract :
In this paper, we focus on the problem of mining Probabilistic Dependent Frequent Serial Episodes (P-DFSEs) from uncertain sequence data. By observing that the frequentness probability of an episode in an uncertain sequence is a Markov Chain imbeddable variable, we first propose an Embeded Markov Chain-based algorithm that efficiently computes the frequentness probability of an episode by projecting the probability space into a set of limited partitions. To further improve the computation efficiency, we devise an optimized approach that prunes candidate episodes early by estimating the upper bound of their frequentness probabilities.
Keywords :
Markov processes; data mining; P-DFSE mining; embeded Markov chain-based algorithm; episode frequentness probability; probabilistic dependent frequent serial episodes mining; probability space; uncertain sequence data; Automata; Data mining; Electromagnetic compatibility; Heuristic algorithms; Markov processes; Probabilistic logic; Yttrium;
Conference_Titel :
Data Mining (ICDM), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
DOI :
10.1109/ICDM.2013.35