Title :
Study on online outlier detection method based on principal component analysis and Bayesian classification
Author :
Wang Yalin ; Xie Wenping ; Wang Xiaoli ; Chen Bin
Author_Institution :
Sch. of Inf. Sci. & Eng., Central South Univ., Changsha, China
Abstract :
Outliers detection is an important part of the online model prediction. Due to the difficulty in determining the suitable control limits for traditional PCA method for the outlier detection, an online outlier detection method is presented based on principal component analysis and Bayesian theory. Firstly, principal component analysis (PCA) is used to calculate Q statistics with the training data collected in the normal process. Secondly, using the priori knowledge and the sample data which is updated by sliding window technology, the Q statistic is classified from the normal process and the disturbance process by the Bayesian classification method. If the current sample is from the disturbance process, it should be further determined that the value is caused by the case of the abnormal value or the process changes, which realizes the online outlier detection for the process data. The simulation using the data from the UCI machine learning repository shows that the proposed method has the lower misjudgment rate compared with the traditional PCA method, and it can effectively identify the abnormal values and process changes in the process data. The simulation result verifies the effectiveness of the proposed method.
Keywords :
Bayes methods; learning (artificial intelligence); pattern classification; principal component analysis; Bayesian classification method; Bayesian theory; PCA method; Q-statistics; UCI machine learning repository; abnormal value; control limits; disturbance process; misjudgment rate; normal process; online model prediction; online outlier detection method; principal component analysis; priori-knowledge; process changes; sample data; sliding window technology; training data collection; Abstracts; Analytical models; Bayes methods; Educational institutions; Electronic mail; Information science; Principal component analysis; Bayesian Classification Approach; Outliers detection; Principal Component Analysis; Sliding Window Technique;
Conference_Titel :
Control Conference (CCC), 2013 32nd Chinese
Conference_Location :
Xi´an