Title :
Association rules based algorithm for identifying outlier transactions in data stream
Author :
Kao, Li-Jen ; Huang, Yo-Ping
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Hwa Hsia Inst. of Technol., Taipei, Taiwan
Abstract :
Most outlier detection algorithms are proposed to discover outlier patterns from static databases. Those algorithms are infeasible for instant identification of outlier patterns in data streams that continuously arriving and unbounded data serve as the data sources in many applications such as sensor data feeding. In this paper an association rules based method is proposed to find outlier patterns in data streams. The presented work segments transactions from data streams and then finds approximate frequent itemsets with single data scan instead of requiring multiple scans. Based on the derived association rules some transaction can be identified as outliers if their outlier degrees are higher than a predefined threshold. The proposed method not only just finds the outlier patterns but also identifies the most possible items that induce the abnormal transactions in the data streams. Efficiency comparisons with frequent itemsets-based work are also done to verify the effectiveness of the proposed framework.
Keywords :
data mining; database management systems; transaction processing; abnormal transactions; association rules based algorithm; data sources; data streams; frequent itemsets; instant identification; multiple scans; outlier detection algorithms; outlier patterns; outlier transactions; predefined threshold; sensor data feeding; single data scan; static databases; unbounded data; Accuracy; Algorithm design and analysis; Association rules; Dairy products; Itemsets; association rules; data stream; frequent itemsets; outlier detection;
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4673-1713-9
Electronic_ISBN :
978-1-4673-1712-2
DOI :
10.1109/ICSMC.2012.6378285