DocumentCode :
3305219
Title :
Feature selection for high dimensionality data in chemical sensing
Author :
Pardo, Matteo ; Sberveglieri, Giorgio ; Gardner, Julian W.
Author_Institution :
Sensor Lab., Brescia Univ.
fYear :
2005
fDate :
Oct. 30 2005-Nov. 3 2005
Abstract :
We present the results obtained by applying different feature selection (FS) methods to the analysis of chemical sensing data. Four datasets have been considered: a mass spectrometer (MS) dataset with 505 peaks (features) and three datasets from metal oxide sensor based E-noses with 30 features. For the MS data we first filter features individually and then apply optimal and suboptimal FS search strategies. The three E-nose datasets present different discrimination hardness and different number of data. For every classification problem three classifiers are tested (3NN, LDA, QDA). FS performance is calculated by two nested cross-validation cycles in order to prevent selection bias. FS always increases the correct test set classification ratio (sometimes the increase is substantial) and the discriminative features are recognized. To our knowledge, this is the first benchmarking study evaluating several FS options (classifiers and search strategies, including suboptimal ones) for the classification of different chemical sensing datasets
Keywords :
electronic noses; feature extraction; mass spectrometers; E-nose datasets; chemical sensing data; chemical sensor; electronic nose; feature selection method; high dimensionality data; mass spectrometer dataset; metal oxide sensor; Chemical analysis; Chemical sensors; Data mining; Feature extraction; Laboratories; Linear discriminant analysis; Sensor arrays; Sensor phenomena and characterization; Sensor systems; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Sensors, 2005 IEEE
Conference_Location :
Irvine, CA
Print_ISBN :
0-7803-9056-3
Type :
conf
DOI :
10.1109/ICSENS.2005.1597880
Filename :
1597880
Link To Document :
بازگشت