Title :
A Comparative Analysis of the Influence of Methods for Outliers Detection on the Performance of Data Driven Models
Author :
Di Bella, A. ; Fortuna, L. ; Graziani, Salvatore ; Napoli, G. ; Xibilia, M.G.
Author_Institution :
Univ. degli Studi di Catania, Catania
Abstract :
In this paper we describe, test, and compare the performance of a number of techniques used for outlier detection to improve modeling capabilities of soft sensors on the basis of the quality of available data. We analyze methods based on standard deviation of population, on residuals of a linear input-output regression, on the structure correlation of the data, on principal components and partial least squares (both linear and nonlinear) in multi dimensional space (2D, 3D, 4D), on Q and T2 statistics, on the distance of each observation from the mean of the data, and on the Mahalanobis distance. We apply techniques for outlier detection both on a fictitious model data and on real data acquired from a sulfur recovery unit of a refinery. We show that outlier removal almost always improves modeling capabilities of considered techniques.
Keywords :
industrial plants; measurement theory; process monitoring; sensors; statistical analysis; Mahalanobis distance; Q statistics; T2 statistics; data driven models; data structure correlation; linear input-output regression; multidimensional space; outlier detection; partial least squares; refinery sulfur recovery unit; soft sensors; Computer industry; Industrial control; Instrumentation and measurement; Least squares methods; Performance analysis; Refining; Sensor phenomena and characterization; Software measurement; Statistics; Testing; empirical nonlinear modeling; outlier detection; refinery; soft sensors;
Conference_Titel :
Instrumentation and Measurement Technology Conference Proceedings, 2007. IMTC 2007. IEEE
Conference_Location :
Warsaw
Print_ISBN :
1-4244-0588-2
DOI :
10.1109/IMTC.2007.379222