Title :
An Anomaly Detection Algorithm Based on Lossless Compression
Author :
Wang, Nan ; Han, Jizhong ; Fang, Jinyun
Author_Institution :
Inst. of Comput. Technol., Beijing, China
Abstract :
Anomaly detection is essential in network security. It has been researched for decades. Many anomaly detection methods have been proposed. Because of the simplicity of principles, statistical and Markovian methods dominate these approaches. However, their effectiveness is constrained by specific preconditions, which make them work for only appropriate data sets which satisfy their premises. Other than statistical and Markovian model, information theory provides a different perspective about anomaly detection. However, the computation of information theoretic measures is still based on statistics. In this paper, we present a novel, information theoretic anomaly detection framework. Instead of statistics, it employs lossless compression for measuring the information quantity, and detects outliers according to compression result. We also discuss the selection of underlying compression algorithm, and choose a grammar compression for utilizing the structure of data. With grammar compression, our method overcomes the shortcomings of statistical and Markovian methods. In addition, the implementation and operation of our method is even simpler than traditional approaches. We test our method on four data sets about text analyzing, host intrusion detection and bug detection. Experimental results show that, even traditional methods fail in some situations, our simple method works well in all cases.
Keywords :
Markov processes; information theory; security of data; telecommunication security; Markovian method; Markovian model; anomaly detection algorithm; anomaly detection method; bug detection; compression algorithm; data structure; grammar compression; information quantity; information theoretic anomaly detection framework; information theoretic measures; information theory; intrusion detection; lossless compression; network security; Compression algorithms; Entropy; Grammar; Hidden Markov models; Markov processes; Statistical analysis; Training; anomaly detection; data mining; grammar-based compression;
Conference_Titel :
Networking, Architecture and Storage (NAS), 2012 IEEE 7th International Conference on
Conference_Location :
Xiamen, Fujian
Print_ISBN :
978-1-4673-1889-1