Title :
An ensemble rank learning approach for gene prioritization
Author :
Po-Feng Lee ; Von-Wun Soo
Author_Institution :
Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
Abstract :
Several different computational approaches have been developed to solve the gene prioritization problem. We intend to use the ensemble boosting learning techniques to combine variant computational approaches for gene prioritization in order to improve the overall performance. In particular we add a heuristic weighting function to the Rankboost algorithm according to: 1) the absolute ranks generated by the adopted methods for a certain gene, and 2) the ranking relationship between all gene-pairs from each prioritization result. We select 13 known prostate cancer genes in OMIM database as training set and protein coding gene data in HGNC database as test set. We adopt the leave-one-out strategy for the ensemble rank boosting learning. The experimental results show that our ensemble learning approach outperforms the four gene-prioritization methods in ToppGene suite in the ranking results of the 13 known genes in terms of mean average precision, ROC and AUC measures.
Keywords :
cancer; genetics; genomics; heuristic programming; learning (artificial intelligence); medical computing; proteins; AUC measure; HGNC database; OMIM database; ROC measure; Rankboost algorithm; ToppGene suite; absolute ranks; ensemble rank boosting learning techniques; gene-pairs; gene-prioritization method; heuristic weighting function; leave-one-out strategy; mean average precision; prostate cancer genes; protein coding gene data; ranking relationship; training set; variant computational approaches; Algorithm design and analysis; Bioinformatics; Boosting; Databases; Diseases; Markov processes; Training;
Conference_Titel :
Engineering in Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the IEEE
Conference_Location :
Osaka
DOI :
10.1109/EMBC.2013.6610298