DocumentCode :
1840977
Title :
On relations between Genes and metagenes obtained via gradient-based matrix factorization
Author :
Huang, Tian-Hsiang ; Nikulin, Vladimir ; McLachlan, Geoffrey J.
Author_Institution :
Dept. of Math., Univ. of Queensland, Brisbane, QLD, Australia
fYear :
2010
fDate :
13-15 July 2010
Firstpage :
17
Lastpage :
22
Abstract :
The high dimensionality of microarray data, the expressions of thousands of genes in a much smaller number of samples, presents challenges that affect the applicability of the analytical results. In principle, it would be better to describe the data in terms of a small number of metagenes, derived as a result of matrix factorization, which could reduce noise while still capturing the essential features of the data. Our system represents a two-step procedure. Firstly, using a gradient-based matrix factorization (GMF) proposed in our previous study, we reduce a given microarray to a few metagenes. Secondly, we demonstrate the sensitivity of the system using a linear support vector machine (SVM). We conducted experiments in this paper on three real datasets. The standard leave-one-out (LOO) scheme was employed in order to evaluate the quality of the system. The evaluation with LOO misclassification rates (LMR) demonstrates that metagenes acquired as an outcome of our method can capture the important biological features of the data. In addition, we considered links between our model and the gene ontology GO taking into account the pathway records of the Kyoto Encyclopedia of Genes and Genomes (KEGG) extracted from candidate gene sets according to absolute value of their correlations with the metagenes. This knowledge may be particularly useful in order to improve interpretability of the results presented to biologists.
Keywords :
biology computing; genetics; genomics; matrix decomposition; molecular biophysics; support vector machines; gene ontology; genes; genomes; gradient-based matrix factorization; matrix factorization; metagenes; microarray data; misclassification rates; Colon; Correlation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Complex Medical Engineering (CME), 2010 IEEE/ICME International Conference on
Conference_Location :
Gold Coast, QLD
Print_ISBN :
978-1-4244-6841-6
Type :
conf
DOI :
10.1109/ICCME.2010.5558880
Filename :
5558880
Link To Document :
بازگشت