DocumentCode
3124950
Title
Adaptive neural-fuzzy inference system for classification of rail quality data with bootstrapping-based over-sampling
Author
Yang, Y.Y. ; Mahfouf, M. ; Panoutsos, G. ; Zhang, Q. ; Thornton, S.
Author_Institution
Dept. of Autom. Control & Syst. Eng., Univ. of Sheffield, Sheffield, UK
fYear
2011
fDate
27-30 June 2011
Firstpage
2205
Lastpage
2212
Abstract
An iterative bootstrapping-based data over-sampling strategy is presented in this paper together with an adaptive neural-fuzzy inference system (ANFIS) to deal with a severely imbalanced data modelling problem. As real industrial data are often very large, containing hundreds of process variables and a huge number of data records, the selection of a compact set of input variables becomes critical for any successful modelling and analysis operations. Significant efforts have been devoted to identifying the most relevant input variables through correlation analysis and neural network based forward input selection. An optimal majority to minority class data ratio, which controls the level of data imbalance for model training, is then determined through the iterative bootstrapping process such that the combined sensitivity and specificity performance is optimised. The iterative bootstrapping ANFIS modelling strategy is then applied to a real industrial case study for rail quality classification, with the original data being provided by Tata Steel Europe. Preliminary results show a good overall performance through the iterative bootstrapping data over-sampling ANFIS modelling.
Keywords
adaptive systems; fuzzy reasoning; neural nets; railways; sampling methods; Tata Steel Europe; adaptive neural-fuzzy inference system; analysis operations; bootstrapping-based over-sampling; compact set; correlation analysis; data records; forward input selection; input variables; iterative bootstrapping ANFIS modelling strategy; iterative bootstrapping process; iterative bootstrapping-based data over-sampling strategy; model training; neural network; rail quality classification; rail quality data classification; real industrial data; severely imbalanced data modelling problem; successful modelling; Artificial neural networks; Computational modeling; Correlation; Data models; Rails; Steel; ANFIS; Imbalance data; bootstrapping; data resampling; fuzzy c-means clustering; rail; steel manufacturing;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
Conference_Location
Taipei
ISSN
1098-7584
Print_ISBN
978-1-4244-7315-1
Electronic_ISBN
1098-7584
Type
conf
DOI
10.1109/FUZZY.2011.6007729
Filename
6007729
Link To Document