DocumentCode :
1591300
Title :
Cost-based modeling for fraud and intrusion detection: results from the JAM project
Author :
Stolfo, Salvatore J. ; Fan, Wei ; Lee, Wenke ; Prodromidis, Andreas ; Chan, Philip K.
Author_Institution :
Dept. of Comput. Sci., Columbia Univ., New York, NY, USA
Volume :
2
fYear :
2000
fDate :
6/22/1905 12:00:00 AM
Firstpage :
130
Abstract :
We describe the results achieved using the JAM distributed data mining system for the real world problem of fraud detection in financial information systems. For this domain we provide clear evidence that state-of-the-art commercial fraud detection systems can be substantially improved in stopping losses due to fraud by combining multiple models of fraudulent transaction shared among banks. We demonstrate that the traditional statistical metrics used to train and evaluate the performance of learning systems (ie. statistical accuracy or ROC analysis) are misleading and perhaps inappropriate for this application. Cost-based metrics are more relevant in certain domains, and defining such metrics poses significant and interesting research questions both in evaluating systems and alternative models, and in formalizing the problems to which one may wish to apply data mining technologies. This paper also demonstrates how the techniques developed for fraud detection can be generalized and applied to the important area of intrusion detection in networked information systems. We report the outcome of recent evaluations of our system applied to tcpdump network intrusion data specifically with respect to statistical accuracy. This work involved building additional components of JAM that we have come to call, MADAM ID (Mining Audit Data for Automated Models for Intrusion Detection). However, taking the next step to define cost-based models for intrusion detection poses interesting new research questions. We describe our initial ideas about how to evaluate intrusion detection systems using cost models learned during our work on fraud detection
Keywords :
costing; data mining; distributed databases; financial data processing; fraud; security of data; JAM project; MADAM ID; banks; cost-based metrics; cost-based modeling; distributed data mining system; financial information systems; fraud detection systems; fraudulent transaction; intrusion detection; learning systems; network intrusion data; networked information systems; statistical metrics; Computer science; Cost accounting; Data mining; Ear; Electrical capacitance tomography; Identity-based encryption; Intrusion detection; Java; Statistical analysis; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
DARPA Information Survivability Conference and Exposition, 2000. DISCEX '00. Proceedings
Conference_Location :
Hilton Head, SC
Print_ISBN :
0-7695-0490-6
Type :
conf
DOI :
10.1109/DISCEX.2000.821515
Filename :
821515
Link To Document :
بازگشت