DocumentCode :
3074325
Title :
A New Method to Combine Heterogeneous Data Sources for Candidate Gene Prioritization
Author :
Li, Yongjin ; Patra, Jagdish C. ; Sun, Jiabao
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2009
fDate :
22-24 June 2009
Firstpage :
123
Lastpage :
129
Abstract :
How to effectively integrate heterogeneous data sources is becoming extremely challenging, because many useful but noisy data sources are available for the problem at hand.In this paper, for disease gene prioritization problem, we investigated multiple kernels learning (MKL) and N dimensional order statistics (NDOS) method, but found that neither could effectively extract useful information from noisy data. Especially, in MKL algorithm, ineffective data source may be given more weight,which downgrades the effectiveness of the combined kernel. We proposed an improved procedure based on NDOS. We first use cross validation to evaluate each individual data source, and only effective data sources are used in the prioritizations of candidate genes.
Keywords :
bioinformatics; diseases; genetics; learning (artificial intelligence); statistical analysis; N dimensional order statistics; candidate gene prioritization; disease; heterogeneous data sources; multiple kernels learning; Bioinformatics; Biomedical engineering; Diseases; Genomics; Kernel; Proteins; Statistics; Support vector machine classification; Support vector machines; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and BioEngineering, 2009. BIBE '09. Ninth IEEE International Conference on
Conference_Location :
Taichung
Print_ISBN :
978-0-7695-3656-9
Type :
conf
DOI :
10.1109/BIBE.2009.42
Filename :
5211304
Link To Document :
بازگشت