Title :
Online Induction of Probabilistic Real Time Automata
Author :
Schmidt, J. ; Kramer, S.
Author_Institution :
Tech. Univ. Munchen, Garching, Germany
Abstract :
Probabilistic real time automata (PRTAs) are a representation of dynamic processes arising in the sciences and industry. Currently, the induction of automata is divided into two steps: the creation of the prefix tree acceptor (PTA) and the merge procedure based on clustering of the states. These two steps can be very time intensive when a PRTA is to be induced for massive or even unbounded data sets. The latter one can be efficiently processed, as there exist scalable online clustering algorithms. However, the creation of the PTA still can be very time consuming. To overcome this problem, we propose a genuine online PRTA induction approach that incorporates new instances by first collapsing them and then using a maximum frequent pattern based clustering. The approach is tested against a predefined synthetic automaton and real-world data sets, for which the approach is scalable and stable. Moreover, we present a broad evaluation on a real world disease group data set that shows the applicability of such a model to the analysis of medical processes.
Keywords :
Internet; automata theory; diseases; medical computing; pattern clustering; disease group data set; dynamic process representation; massive data sets; maximum frequent pattern based clustering; medical process analysis; merge procedure; online PRTA induction approach; online clustering algorithms; online induction; prefix tree acceptor creation; probabilistic real time automata; real-world data sets; state clustering; synthetic automaton; unbounded data sets; Automata; Data models; Diseases; History; Merging; Probabilistic logic; Real-time systems; Probabilistic real time automata; maximum frequent pattern based clustering; online induction;
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-4649-8
DOI :
10.1109/ICDM.2012.121