Title :
Extending statistical data quality improvement with explicit domain models
Author :
Solomakhina, Nina ; Hubauer, Thomas ; Lamparter, Steffen ; Roshchin, Mikhail ; Grimm, Stephan
Author_Institution :
Tech. Univ. of Vienna, Vienna, Austria
Abstract :
Automatic processing of data for the purpose of determining operating states and identifying faults has become essential for many modern industrial systems. Typical sources of this data include hundreds of sensors mounted at the industrial machinery measuring qualities such as temperature, vibration, pressure, and many more. However, sensors are complex technical devices, which means that they can fail and their readings may contain noise or imprecise values. Such low quality data makes it hard to solve the original task of assessing system and process status. We present an approach which brings together several well-known techniques from computer science and statistics and enhances monitoring of technical systems by improving results of detection and correction of data quality issues in sensor data. The application domain and the dependencies between its objects are represented as a knowledge-based model, while statistics identifies data anomalies, such as outlying or missing values, in sensor measurement data. Combining information from the knowledge-based model and statistical computations allows to validate and improve data analysis results. We demonstrate the proposed approach on a real-world industrial use case from the power generation domain. Our evaluation shows that the combined solution improves precision indexes while maintaining high accuracy and recall values.
Keywords :
ontologies (artificial intelligence); sensor fusion; statistical analysis; application domain; automatic data processing; computer science; data analysis results; data anomalies; data quality issue correction; data quality issue detection; explicit domain models; fault identification; industrial machinery; industrial systems; knowledge-based model; ontology; operating states; power generation domain; pressure measurement; quality measurement; sensor data; sensor measurement data; sensors; statistical computations; statistical data quality improvement; statistics; technical system monitoring; temperature measurement; vibration measurement; Data models; OWL; Ontologies; Sensor phenomena and characterization; Temperature measurement; Turbines; data quality; industrial control; knowledge-based methods; statistics; time series;
Conference_Titel :
Industrial Informatics (INDIN), 2014 12th IEEE International Conference on
Conference_Location :
Porto Alegre
DOI :
10.1109/INDIN.2014.6945602