Title :
Principal discriminants analysis for small-sample-size problems: application to chemical sensing
Author :
Wang, M. ; Perera, A. ; Gutierrez-Osuna, R.
Author_Institution :
Dept. of Comput. Sci., Texas A&M Univ., College Station, TX, USA
Abstract :
Two dimensionality reduction techniques are widely used to analyze data from chemical sensor arrays: Fisher´s linear discriminants analysis (LDA) and principal components analysis (PCA). LDA finds the directions of maximum discrimination in classification problems, but has a tendency to overfit when the ratio of training samples to dimensionality is low, as is commonly the case in chemical sensor array problems. PCA is more robust to overfitting but, being a variance model, fails to capture discriminatory information in low-variance sensors. In this article we propose a hybrid model, termed principal discriminants analysis (PDA), which incorporates both LDA and PCA criteria by means of a regularization parameter. The model is characterized on a synthetic dataset and validated with experimental data from an array of 15 metal-oxide sensors exposed to five varieties of roasted coffee beans. Our results show that PDA provides higher predictive accuracy than LDA or PCA alone. In addition, the model is able to find a trade-off between discriminant- and variance-based projections according to where information is located in the distribution of the data.
Keywords :
array signal processing; chemical sensors; gas sensors; principal component analysis; LDA; PCA; chemical sensing; chemical sensor arrays; classification problems; data distribution; dimensionality reduction techniques; discriminant-based projections; discriminatory information; linear discriminants analysis; low-variance sensors; maximum discrimination directions; metal-oxide sensors; predictive accuracy; principal components analysis; principal discriminants analysis; regularization parameter; roasted coffee beans; small-sample-size problems; synthetic dataset characterized model; training samples; variance model; variance-based projections; Application software; Chemical analysis; Chemical sensors; Data analysis; Eigenvalues and eigenfunctions; Linear discriminant analysis; Personal digital assistants; Principal component analysis; Sensor arrays; Sensor phenomena and characterization;
Conference_Titel :
Sensors, 2004. Proceedings of IEEE
Print_ISBN :
0-7803-8692-2
DOI :
10.1109/ICSENS.2004.1426234