DocumentCode :
1879580
Title :
Clustering-based multi-class classification of complex disease
Author :
Phongwattana, Thiptanawat ; Engchuan, Worrawat ; Chan, Jonathan H.
Author_Institution :
Data & Knowledge Eng. Lab. (D-Lab.), King Mongkut´s Univ. of Technol. Thonburi, Bangkok, Thailand
fYear :
2015
fDate :
28-31 Jan. 2015
Firstpage :
25
Lastpage :
29
Abstract :
Pathway activity data transformed from gene expression profiles may be used to identify tumors, complex diseases progression, and cellular response to stimuli, and so on. Previous researches utilized data mining techniques on pathway activity data to distinguish subjects or to predict the phenotype outcome of subject directly. However, in the multi-class classification, learning those data mixing with population from different groups may result in contaminated model as excessive information is presented. This research, we use a two-stage approach applying clustering to homogenize training data before building the classification model. Hierarchical Clustering is used as a clustering method and Random Forest is used as classifier for evaluating the performance of the proposed method. The results are promising and show that using a clustering technique before classifying improves classification performance in general.
Keywords :
data mining; diseases; learning (artificial intelligence); medical computing; pattern classification; pattern clustering; classification model; clustering method; clustering-based multiclass classification; complex disease; contaminated model; data mining; data mixing; gene expression profiles; hierarchical clustering; learning; pathway activity data; random forest; training data; Accuracy; Bioinformatics; Cancer; Diseases; Euclidean distance; Gene expression; Training; DNA Microarray; Hierarchical Clustering; Pathway Activities; Random Forest; Two-stage Multi-class Analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Knowledge and Smart Technology (KST), 2015 7th International Conference on
Conference_Location :
Chonburi
Print_ISBN :
978-1-4799-6048-4
Type :
conf
DOI :
10.1109/KST.2015.7051475
Filename :
7051475
Link To Document :
بازگشت