DocumentCode
1688472
Title
Evaluating model drift in machine learning algorithms
Author
Nelson, Kevin ; Corbin, George ; Anania, Mark ; Kovacs, Matthew ; Tobias, Jeremy ; Blowers, Misty
Author_Institution
BAE Syst., Rome, NY, USA
fYear
2015
Firstpage
1
Lastpage
8
Abstract
Machine learning is rapidly emerging as a valuable technology thanks to its ability to learn patterns from large data sets and solve problems that are impossible to model using conventional programming logic. As machine learning techniques become more mainstream, they are being applied to a wider range of application domains. These algorithms are now trusted to make critical decisions in secure and adversarial environments such as healthcare, fraud detection, and network security, in which mistakes can be incredibly costly. They are also a critical component to most modern autonomous systems. However, the data driven approach utilized by these machine learning methods can prove to be a weakness if the data on which the models rely are corrupted by either nefarious or accidental means. Models that utilize on-line learning or periodic retraining to learn new patterns and account for data distribution changes are particularly susceptible to corruption through model drift. In modeling this type of scenario, specially crafted data points are added to the training set over time to adversely influence the system, inducing model drift which leads to incorrect classifications. Our work is focused on exploring the resistance of various machine learning algorithms to such an approach. In this paper we present an experimental framework designed to measure the susceptibility of anomaly detection algorithms to model drift. We also exhibit our preliminary results using various machine learning algorithms commonly found in intrusion detection research.
Keywords
learning (artificial intelligence); pattern recognition; security of data; anomaly detection algorithms; autonomous systems; data distribution; data driven approach; intrusion detection research; machine learning algorithms; machine learning techniques; model drift evaluation; on-line learning; periodic retraining; programming logic; Algorithm design and analysis; Data models; Hidden Markov models; Image color analysis; Machine learning algorithms; Security; Training; adversarial machine learning; cyber security; intrusion detection systems; model drift;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence for Security and Defense Applications (CISDA), 2015 IEEE Symposium on
Conference_Location
Verona, NY
Print_ISBN
978-1-4673-7556-6
Type
conf
DOI
10.1109/CISDA.2015.7208643
Filename
7208643
Link To Document