Title :
Cluster pca for outliers detection in high-dimensional data
Author :
Stefatos, George ; Hamza, A.Ben
Author_Institution :
Concordia Institute for Information Systems Engineering, Concordia University, Montréal, QC, Canada
Abstract :
We introduce a new method to detect multiple outliers in high-dimensional datasets using the concepts of hierarchical clustering and principal component analysis. The proposed algorithm is computationally fast and robust to outliers detection. A comparative study with existing techniques is performed on both low and high dimensional datasets. Our experimental results demonstrate an improved performance of our algorithm in comparison with existing multivariate outlier detection techniques.
Keywords :
Clustering algorithms; Control charts; Data engineering; Data mining; Information analysis; Information systems; Manufacturing; Principal component analysis; Robustness; Systems engineering and theory;
Conference_Titel :
Systems, Man and Cybernetics, 2007. ISIC. IEEE International Conference on
Conference_Location :
Montreal, QC, Canada
Print_ISBN :
978-1-4244-0990-7
Electronic_ISBN :
978-1-4244-0991-4
DOI :
10.1109/ICSMC.2007.4414244