Title : 
Data representation and feature selection for colorimetric sensor arrays used as explosives detectors
         
        
            Author : 
Alstrøm, Tommy S. ; Larsen, Jan ; Kostesh, Natalie V. ; Jakobsen, Mogens H. ; Boisen, Anja
         
        
            Author_Institution : 
Dept. of Inf. & Math. Modeling, Tech. Univ. of Denmark, Lyngby, Denmark
         
        
        
        
        
        
            Abstract : 
Within the framework of the strategic research project Xsense at the Technical University of Denmark, we are developing a colorimetric sensor array which can be useful for detection of explosives like DNT, TNT, HMX, RDX and TATP and identification of volatile organic compounds in the presence of water vapor in air. In order to analyze colorimetric sensors with statistical methods, the sensory output must be put into numerical form suitable for analysis. We present new ways of extracting features from a colorimetric sensor and determine the quality and robustness of these features using machine learning classifiers. Sensors, and in particular explosive sensors, must not only be able to classify explosives, they must also be able to measure the certainty of the classifier regarding the decision it has made. This means there is a need for classifiers that not only give a decision, but also give a posterior probability about the decision. We will compare K-nearest neighbor, artificial neural networks and sparse logistic regression for colorimetric sensor data analysis. Using the sparse solutions we perform feature selection and feature ranking and compare to Gram-Schmidt orthogonalization.
         
        
            Keywords : 
chemical sensors; colorimeters; data analysis; data structures; explosives; feature extraction; learning (artificial intelligence); maximum likelihood estimation; organic compounds; pattern classification; sensor arrays; statistical analysis; a posterior probability; air; colorimetric sensor arrays; data analysis; data representation; explosives detectors; feature ranking; feature selection; machine learning classifiers; statistical methods; volatile organic compounds; water vapor; Compounds; Data models; Explosives; Feature extraction; Image color analysis; Principal component analysis; Sensor arrays; DNT; Gram-Schmidt orthogonalization; K-nearest neighbor (KNN); TNT; artificial neural networks (ANN); chemo-selective compounds; classification; colorimetric sensor array; explosives detection; feature ranking; sparse logistic regression (SLR);
         
        
        
        
            Conference_Titel : 
Machine Learning for Signal Processing (MLSP), 2011 IEEE International Workshop on
         
        
            Conference_Location : 
Santander
         
        
        
            Print_ISBN : 
978-1-4577-1621-8
         
        
            Electronic_ISBN : 
1551-2541
         
        
        
            DOI : 
10.1109/MLSP.2011.6064615