DocumentCode :
1639413
Title :
Structure feature selection for chemical compound classification
Author :
Fei, Hongliang ; Huan, Jun
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Univ. of Kansas, Lawrence, KS
fYear :
2008
Firstpage :
1
Lastpage :
6
Abstract :
With the development of highly efficient cheminformatics data collection technology, classification of chemical structure data emerges as an important topic in cheminformatics. Towards building highly accurate predictive models for chemical data, here we present an efficient feature selection method. In our method, we first represent a chemical structure by its 2D connectivity map. We then use frequent subgraph mining to identify structural fragments as features for graph classification. Different from existing methods, we consider the spatial distribution of the subgraph features in the graph data and select those ones that have consistent spatial locations. We have applied our feature selection methods to several cheminformatics benchmarks. Our experimental results demonstrate a significant improvement of prediction as compared to the state-of-the-art feature selection methods.
Keywords :
bioinformatics; chemical structure; chemistry computing; data mining; feature extraction; 2D connectivity map; chemical compound classification; chemical structure data; cheminformatics; data collection technology; graph classification; structure feature selection; subgraph mining; Biological systems; Biology computing; Buildings; Chemical analysis; Chemical compounds; Chemical technology; Feature extraction; Filtering; Kernel; Principal component analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
BioInformatics and BioEngineering, 2008. BIBE 2008. 8th IEEE International Conference on
Conference_Location :
Athens
Print_ISBN :
978-1-4244-2844-1
Electronic_ISBN :
978-1-4244-2845-8
Type :
conf
DOI :
10.1109/BIBE.2008.4696655
Filename :
4696655
Link To Document :
بازگشت