DocumentCode :
2336326
Title :
Hybrid rebalancing approach to handle imbalanced dataset for fault diagnosis in manufacturing systems
Author :
Teck, Ching Chuen ; Xiang, Li ; Junhong, Zhou ; Xiaoli, Li ; Hong, Cao ; Woon, David
Author_Institution :
Singapore Inst. of Manuf. Technol., A-Star, Singapore, Singapore
fYear :
2012
fDate :
18-20 July 2012
Firstpage :
1224
Lastpage :
1229
Abstract :
In a mature manufacturing system, the occurrence of operating fault conditions is few and far between. Majority of the data collected from such systems typically exhibits normal operating behaviours. This phenomenon inadvertently creates an imbalance between the class distributions of the data. The imbalance ratio may fall in the range of 1:100 to 1:1000 for every fault condition data available. The nature of such datasets thus makes it harder to build reliable models for accurate fault diagnosis in Condition-Based Maintenance (CBM) due to the lack of learning exemplars of the fault class. Conventional machine learning algorithms do not handle imbalanced datasets well and generally would produce poor classification results. To improve the fault diagnosis reliability on class-imbalanced datasets, this paper proposes a hybrid rebalancing approach called Hybrid Support Vector Machine (SVM) under sampling with Mega Trend Diffusion (MTD) oversampling. Our proposed approach rebalances the dataset by (1) Reducing the amount of normal condition data whilst retaining the most informative ones and (2) Boosting the number of fault condition data to match the size of the normal data. This approach is highly applicable to the manufacturing setting as there is a level of predictability to the nature of data, i.e. data of different fault conditions tend to cluster together in the feature space. Thus, manipulating the data at this level is a logical step. As such, learning effectively with the limited available fault data can translate to significantly cost-saving. Our approach is demonstrated and validated with a case study on bearing fault detection. To end, some conclusions and future works are discussed.
Keywords :
fault diagnosis; learning (artificial intelligence); maintenance engineering; manufacturing data processing; manufacturing systems; support vector machines; bearing fault detection; class-imbalanced datasets; condition-based maintenance; fault diagnosis; hybrid SVM; hybrid rebalancing approach; hybrid support vector machine; machine learning; manufacturing systems; mega trend diffusion oversampling; Accuracy; Conferences; Fault diagnosis; Measurement; Support vector machines; Training; Class-Imbalanced dataset Learning; Condition-based Maintenance; Data-Mining; Fault diagnosis; Mega Trend Diffusion; Support Vector Machine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Industrial Electronics and Applications (ICIEA), 2012 7th IEEE Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4577-2118-2
Type :
conf
DOI :
10.1109/ICIEA.2012.6360910
Filename :
6360910
Link To Document :
بازگشت