Title :
Variable and time-lag selection using empirical data
Author :
Souza, Francisco ; Araújo, Rui
Author_Institution :
Dept. of Electr. & Comput. Eng. (DEEC-UC), Univ. of Coimbra, Coimbra, Portugal
Abstract :
The paper proposes a method to select the best variables and respective time-lags for industrial applications when the objective is the estimation of a target variable using the information content of empirical data. No further information is assumed about the process. The problem of jointly selecting the best variables and the respective time-lags is treated as a variable selection problem. This assumption implies an increase of input dimensionality and multicollinearity into input space. Then, a multidimensional mutual information estimator based on the l-nearest neighbor algorithm is used in a forward search procedure to select the best variables and and respective time-lags. To verify the performance of selected variables and delays, the method was successfully applied in two data sets. A least squares support vector machine was used as the main model for the soft sensor in both cases.
Keywords :
data analysis; learning (artificial intelligence); least squares approximations; pattern clustering; process control; process monitoring; production engineering computing; search problems; support vector machines; delays; empirical data; forward search procedure; industrial application; information content; input dimensionality; input multicollinearity; l-nearest neighbor algorithm; least squares support vector machine; machine learning; multidimensional mutual information estimator; process control; process monitoring; process supervision; soft sensor; time-lag selection; variable selection problem; Computational modeling; Correlation; Delay; Estimation; Input variables; Mutual information; Predictive models;
Conference_Titel :
Emerging Technologies & Factory Automation (ETFA), 2011 IEEE 16th Conference on
Conference_Location :
Toulouse
Print_ISBN :
978-1-4577-0017-0
Electronic_ISBN :
1946-0740
DOI :
10.1109/ETFA.2011.6059083