• DocumentCode
    3074325
  • Title

    A New Method to Combine Heterogeneous Data Sources for Candidate Gene Prioritization

  • Author

    Li, Yongjin ; Patra, Jagdish C. ; Sun, Jiabao

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2009
  • fDate
    22-24 June 2009
  • Firstpage
    123
  • Lastpage
    129
  • Abstract
    How to effectively integrate heterogeneous data sources is becoming extremely challenging, because many useful but noisy data sources are available for the problem at hand.In this paper, for disease gene prioritization problem, we investigated multiple kernels learning (MKL) and N dimensional order statistics (NDOS) method, but found that neither could effectively extract useful information from noisy data. Especially, in MKL algorithm, ineffective data source may be given more weight,which downgrades the effectiveness of the combined kernel. We proposed an improved procedure based on NDOS. We first use cross validation to evaluate each individual data source, and only effective data sources are used in the prioritizations of candidate genes.
  • Keywords
    bioinformatics; diseases; genetics; learning (artificial intelligence); statistical analysis; N dimensional order statistics; candidate gene prioritization; disease; heterogeneous data sources; multiple kernels learning; Bioinformatics; Biomedical engineering; Diseases; Genomics; Kernel; Proteins; Statistics; Support vector machine classification; Support vector machines; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and BioEngineering, 2009. BIBE '09. Ninth IEEE International Conference on
  • Conference_Location
    Taichung
  • Print_ISBN
    978-0-7695-3656-9
  • Type

    conf

  • DOI
    10.1109/BIBE.2009.42
  • Filename
    5211304