Title :
Biological data classification using rough sets and support vector machines
Author :
Zhao, Yanjun ; Zhang, Yanqing ; Xiong, Naixue
Author_Institution :
Dept. of Comput. Sci., Georgia State Univ., Atlanta, GA, USA
Abstract :
Biological data classification is an important data mining research area in biomedical applications. The current challenge problem is that there is a large number of condition attributes (features) in biological data, with which it is difficult for classification methods to deal. In this paper, a new approach based on rough sets and support vector machines is proposed for biological data classification. Rough sets theory is a good mathematical tool to make attribute reduction by removing redundant condition attributes (features). Furthermore, the new rough support vector machines use the new information entropy of rough sets as uncertainty measurement to reflect the whole uncertainty information. Simulation results demonstrate that this new approach is useful in terms of classification accuracy and the number of attributes.
Keywords :
data mining; medical computing; rough set theory; support vector machines; biological data classification; biomedical applications; data mining; rough sets; support vector machines; uncertainty measurement; Biology computing; Biomedical computing; Data mining; Fuzzy sets; Information entropy; Information systems; Measurement uncertainty; Rough sets; Support vector machine classification; Support vector machines; attribute reduction; biological data classification; information entropy; rough sets; support vector machines;
Conference_Titel :
Fuzzy Information Processing Society, 2009. NAFIPS 2009. Annual Meeting of the North American
Conference_Location :
Cincinnati, OH
Print_ISBN :
978-1-4244-4575-2
Electronic_ISBN :
978-1-4244-4577-6
DOI :
10.1109/NAFIPS.2009.5156445