Title :
Efficient Online Sequence Prediction with Side Information
Author :
Han Xiao ; Eckert, Claudia
Author_Institution :
Inst. of Inf., Tech. Univ. Munchen, Munich, Germany
Abstract :
Sequence prediction is a key task in machine learning and data mining. It involves predicting the next symbol in a sequence given its previous symbols. Our motivating application is predicting the execution path of a process on an operating system in real-time. In this case, each symbol in the sequence represents a system call accompanied with arguments and a return value. We propose a novel online algorithm for predicting the next system call by leveraging both context and side information. The online update of our algorithm is efficient in terms of time cost and memory consumption. Experiments on real-world data sets showed that our method outperforms state-of-the-art online sequence prediction methods in both accuracy and efficiency, and incorporation of side information does significantly improve the predictive accuracy.
Keywords :
data mining; learning (artificial intelligence); operating systems (computers); arguments; context information; data mining; machine learning; memory consumption; online algorithm; online sequence prediction; operating system; process execution path; return value; side information; system call; time cost; Accuracy; Context; Error analysis; Memory management; Prediction algorithms; Predictive models; Vectors; online learning; scalability; sequence predictio; system trace;
Conference_Titel :
Data Mining (ICDM), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
DOI :
10.1109/ICDM.2013.31