Title :
A Hybrid Model with a Weighted Voting Scheme for Feature Selection in Machinery Condition Monitoring
Author :
Zhang, Kui ; Ball, Andrew ; Gu, Fengshou ; Li, Yuhua
Author_Institution :
Manchester Univ., Manchester
Abstract :
Feature selection method has become the focus of research in the area of engineering data processing where there exists a large amount of high-dimensional data from the high-frequency acquisition system. For high-dimensional data processing, engineers often resort to feature extraction methods and statistical theories to convert the original features into new features. However, the converted data always lose the engineering meaning of the original features and the choice and use of conversion methods are challenging. In this paper, a hybrid feature selection model is presented to select the most significant input features from all potentially relevant features. The algorithm combines a filter model with a wrapper model. In the filter model, four variable ranking methods are used to pre-rank the candidate features. These four methods including Pearson correlation coefficient, relief algorithm, Fisher score and class separability, measure features from various angles, which leads to different ranking results. Therefore, a weighted voting scheme is introduced to re-rank features based on the degree of significance of the four methods on the classification error rate of radial basis function (RBF) classifier. In wrapper model, a binary search (BS) method and a sequential backward search (SBS) method are utilized to minimize the number of relevant features when promising to keep the classification error rate of RBF classifier below a given threshold. To demonstrate the potential of applying the method to large-scale engineering data processing, a case study is conducted.
Keywords :
condition monitoring; error statistics; feature extraction; mechanical engineering computing; search problems; statistical analysis; Fisher score; Pearson correlation coefficient; binary search method; class separability; classification error rate; engineering data processing; feature extraction methods; filter model; high-dimensional data; high-dimensional data processing; high-frequency acquisition system; hybrid feature selection model; machinery condition monitoring; radial basis function classifier; relief algorithm; sequential backward search method; statistical theories; weighted voting scheme; wrapper model; Automation; Condition monitoring; Data engineering; Data processing; Error analysis; Feature extraction; Filters; Large-scale systems; Machinery; Voting;
Conference_Titel :
Automation Science and Engineering, 2007. CASE 2007. IEEE International Conference on
Conference_Location :
Scottsdale, AZ
Print_ISBN :
978-1-4244-1154-2
Electronic_ISBN :
978-1-4244-1154-2
DOI :
10.1109/COASE.2007.4341697