Title :
A novel data mining approach for differential genes identification in small cancer expression data
Author :
Al-Watban, Abdullatif ; Yang, Zi Hua ; Everson, Richard ; Yang, Zheng Rong
Author_Institution :
Sch. of Biosci., Univ. of Exeter, Exeter, UK
Abstract :
The simple t test is the standard approach for differential gene identification but is not suited to data with low replication. Here, we propose using a multi-scale Gaussian (MSG) to improve the detection accuracy of differential cancerous genes in low replicate microarray experiment. By modelling the gene expression densities as Gaussian scale mixtures, the differential genes are then identified using the estimated density function. We use simulated data and data from GEO to demonstrate that the new algorithm compares favourably to four benchmark algorithms for cancer gene expression data with low replicate.
Keywords :
Gaussian processes; biology computing; cancer; data mining; genetics; GEO; Gaussian scale mixture; MSG; cancer gene expression data; data mining; density function estimation; detection accuracy; differential cancerous genes; differential genes identification; gene expression density modelling; low replicate microarray experiment; multiscale Gaussian; Algorithm design and analysis; Cancer; Clustering algorithms; Gene expression; Prediction algorithms; Sensitivity; Standards;
Conference_Titel :
Health Informatics and Bioinformatics (HIBIT), 2012 7th International Symposium on
Conference_Location :
Nevsehir
Print_ISBN :
978-1-4673-0879-3
DOI :
10.1109/HIBIT.2012.6209033