Title :
Identifying Differentially Expressed Genes via Weighted Rank Aggregation
Author :
Fang, Qiong ; Feng, Jianlin ; Ng, Wilfred
Author_Institution :
Dept. of Comput. Sci. & Eng., HKUST, Hong Kong, China
Abstract :
Identifying differentially expressed genes is an important problem in gene expression analysis, since these genes, exhibiting sufficiently different expression levels under distinct experiment conditions, could be critical for tracing the progression of a disease. In a micro array study, genes are usually sorted in terms of their differentiation abilities with the more differentially expressed genes being ranked higher in the list. As more micro array studies are conducted, rank aggregation becomes an important means to combine such ranked gene lists in order to discover more reliable differentially expressed genes. In this paper, we study a novel weighted gene rank aggregation problem whose complexity is at least NP-hard. To tackle the problem, we develop a new Markov-chain based rank aggregation method called Weighted MC (WMC). The WMC algorithm makes use of rank-based weight information to generate the transition matrix. Extensive experiments on the real biological datasets show that our approach is more efficient in aggregating long gene lists. Importantly, the WMC method is much more robust for identifying biologically significant genes compared with the state-of-the-art methods.
Keywords :
Markov processes; computational complexity; diseases; genetics; lab-on-a-chip; Markov-chain based rank aggregation method; NP-hard problem; differentially expressed gene identification; disease progression tracing; gene expression analysis; transition matrix; weighted gene rank aggregation problem; Complexity theory; Diseases; Gene expression; Itemsets; Markov processes; Reliability; Markov chain; differential expression; existence disagreement; ordering disagreement; rank aggregation;
Conference_Titel :
Data Mining (ICDM), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver,BC
Print_ISBN :
978-1-4577-2075-8
DOI :
10.1109/ICDM.2011.77