DocumentCode :
2447678
Title :
Novelty generating machine
Author :
Li, Shukai
Author_Institution :
Center for Comput. Intell., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2011
fDate :
14-16 Oct. 2011
Firstpage :
90
Lastpage :
95
Abstract :
Novelty detection is one of primary tasks in data mining and machine learning. The task is to differentiate unseen outliers from normal patterns. Though novelty detection has been well-studied for many years and has found a wide range of applications, identifying outliers is still very challenging because of the absence or scarcity of outliers. We observe several characteristics of outliers and normal patterns. First, normal patterns are usually grouped together and form some clusters in high density regions of the data. Second, outliers are very different from the normal patterns, and in turn these outliers are far away from the normal patterns. Third, the number of outliers is very small compared with the size of the dataset. Based on these observations, we can envisage that the decision boundary between outliers and normal patterns usually lies in some low density regions of the data, which is referred to as cluster assumption. The resultant optimization problem is in form of a mixed integer programming. Then, we present a cutting plane algorithm together with multiple kernel learning techniques to solve its convex relaxation. Moreover, we make use of the scarcity of outliers to find a violating solution in cutting plane algorithm.
Keywords :
convex programming; data mining; integer programming; learning (artificial intelligence); pattern clustering; convex relaxation; cutting plane algorithm; data mining; decision boundary; kernel learning technique; machine learning; mixed integer programming; normal pattern; novelty detection; novelty generating machine; outlier; Clustering algorithms; Kernel; Labeling; Linear programming; Optimization; Support vector machines; Vectors; Cluster assumption; Cutting plane algorithm; Mixed integer programming; Novelty detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Soft Computing and Pattern Recognition (SoCPaR), 2011 International Conference of
Conference_Location :
Dalian
Print_ISBN :
978-1-4577-1195-4
Type :
conf
DOI :
10.1109/SoCPaR.2011.6089101
Filename :
6089101
Link To Document :
بازگشت