• DocumentCode
    2319591
  • Title

    An integrative bioinformatics approach for identifying subtypes and subtype-specific drivers in cancer

  • Author

    Chen, Peikai ; Hung, Y.S. ; Fan, Yubo ; Wong, Stephen T -C

  • Author_Institution
    Univ. of Hong Kong, Hong Kong, China
  • fYear
    2012
  • fDate
    9-12 May 2012
  • Firstpage
    169
  • Lastpage
    176
  • Abstract
    Cancer is a complex disease and within a cancer, subtypes of patients with distinct behaviors often exist. The subtypes might have been caused by different hits, such as copy number aberrations (CNAs) and point mutations, on different pathways/cells-of-origin in a common tissue/organ. Identifying the subtypes with subtype-specific drivers, i.e., hits, is key to the understanding of cancer and development of novel treatments. Here, we report the development of an integrative method to identify the subtypes of cancer. Specifically, we consider CNAs and their impact on gene expressions. Based on these relations, we propose an iterative approach that alternates between kernel based gene expression clustering and gene signature selection. We applied the method to datasets of the pediatric cancer medulloblastoma (MB). The consensus number of clusters quickly converges to three; and for each of these three subtypes, the signature detection also converges to a consistent set of a few hundred highly functionally related genes. For each of the subtypes, we correlate its signature with the set of within-subtype recurrent CNA-affected genes for identifying drivers. The top-ranked driver candidates are found to be enriched with known pathways in certain subtypes of MB as well as containing novel genes that might reveal new understandings for other subtypes.
  • Keywords
    bioinformatics; cancer; data mining; genetics; iterative methods; medical computing; molecular biophysics; molecular configurations; paediatrics; pattern clustering; cancer subtype identification; copy number aberrations; gene expressions; gene signature selection; integrative bioinformatics approach; iterative approach; kernel based gene expression clustering; medulloblastoma; pediatric cancer; point mutations; recurrent CNA affected genes; subtype specific driver identification; Biological systems; Cancer; Clustering algorithms; Gene expression; Measurement; Smoothing methods; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2012 IEEE Symposium on
  • Conference_Location
    San Diego, CA
  • Print_ISBN
    978-1-4673-1190-8
  • Type

    conf

  • DOI
    10.1109/CIBCB.2012.6217227
  • Filename
    6217227