DocumentCode :
2045932
Title :
On node selection for classification in correlated data sets
Author :
Cristescu, Razvan
Author_Institution :
Dept. of Ind. Design, Tech. Univ. of Eindhoven, Eindhoven
fYear :
2008
fDate :
19-21 March 2008
Firstpage :
1064
Lastpage :
1068
Abstract :
Consider a system which can be in a finite number of states. Given a large number of characteristics which are measured, representing the system, we are concerned with the selection of a subset of characteristics of (small) given cardinality, for which the classification of the system according to one of the states in the state set is optimal according to the Rayleigh quotient criterion. This problem is relevant in various scenarios where a few explanatory variables have to be selected from a large set of candidates, including sensor selection in sensor networks, classification in image processing, and feature selection in data mining for bioinformatics applications. We show that the optimization amounts to finding the submatrix of the features covariance matrix for which the sum of elements of the inverse is maximized, and we present bounds which relate this optimization to a similar metric based on elements of the original covariance matrix.
Keywords :
covariance matrices; pattern classification; Rayleigh quotient criterion; correlated data set classification; features covariance matrix; node selection; Bioinformatics; Biomarkers; Biosensors; Covariance matrix; Data mining; Diseases; Image processing; Image sensors; Sensor phenomena and characterization; Sensor systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Sciences and Systems, 2008. CISS 2008. 42nd Annual Conference on
Conference_Location :
Princeton, NJ
Print_ISBN :
978-1-4244-2246-3
Electronic_ISBN :
978-1-4244-2247-0
Type :
conf
DOI :
10.1109/CISS.2008.4558676
Filename :
4558676
Link To Document :
بازگشت