DocumentCode :
1765844
Title :
A Novel Machine Learning Approach Toward Quality Assessment of Sensor Data
Author :
Rahman, Aminur ; Smith, D.V. ; Timms, G.
Author_Institution :
Comput. Inf., CSIRO, Hobart, TAS, Australia
Volume :
14
Issue :
4
fYear :
2014
fDate :
41730
Firstpage :
1035
Lastpage :
1047
Abstract :
A novel machine learning approach to assess the quality of sensor data using an ensemble classification framework is presented in this paper. The quality of sensor data is indicated by discrete quality flags that indicate the level of uncertainty associated with a sensor reading. Depending on the domain and the problem under consideration, the level of uncertainty is different and thus unsupervised methods like outlier detection fails to match the expectation. The quality flags are normally assigned by domain experts. Considering the volume of sensor data, manual assignment is a laborious task and subject to human error. Given a representative set of labelled data, a supervised classification approach is thus a feasible alternative. The nature of sensor data, however, poses some challenges to the classification task. Data of dubious quality exists in such data sets with very small frequency leading to the class imbalance problem. We thus adopt a cluster oriented sampling approach to address the imbalance issue. In addition, it is beneficial to train multiple classifiers to improve the overall classification accuracy. We thus produce multiple under-sampled training sets using cluster oriented sampling and train base classifiers on each of them. Decisions produced by the base classifiers are fused into a single decision using majority voting. We have evaluated the proposed ensemble classification framework by assessing the quality of marine sensor data obtained from sensors situated at Sullivans Cove, Hobart, Australia. Experimental results reveal that the proposed framework agrees with expert judgement with high accuracy and achieves superior classification performance than other state-of-the-art approaches.
Keywords :
learning (artificial intelligence); pattern classification; sampling methods; sensors; cluster oriented sampling approach; ensemble classification framework; machine learning approach; marine sensor; multiple classifiers; multiple under-sampled training sets; quality assessment; sensor data; train base classifiers; Australia; Biosensors; Conductivity; Quality assessment; Temperature sensors; Training; Sensors; class balancing; ensemble classifier; quality assessment of sensor data; time series classification;
fLanguage :
English
Journal_Title :
Sensors Journal, IEEE
Publisher :
ieee
ISSN :
1530-437X
Type :
jour
DOI :
10.1109/JSEN.2013.2291855
Filename :
6671378
Link To Document :
بازگشت