• DocumentCode
    814810
  • Title

    Jointly analyzing gene expression and copy number data in breast cancer using data reduction models

  • Author

    Berger, J.A. ; Hautaniemi, S. ; Mitra, S.K. ; Astola, J.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., California Univ., Santa Barbara, CA
  • Volume
    3
  • Issue
    1
  • fYear
    2006
  • Firstpage
    2
  • Lastpage
    16
  • Abstract
    With the growing surge of biological measurements, the problem of integrating and analyzing different types of genomic measurements has become an immediate challenge for elucidating events at the molecular level. In order to address the problem of integrating different data types, we present a framework that locates variation patterns in two biological inputs based on the generalized singular value decomposition (GSVD). In this work, we jointly examine gene expression and copy number data and iteratively project the data on different decomposition directions defined by the projection angle thetas in the GSVD. With the proper choice of thetas, we locate similar and dissimilar patterns of variation between both data types. We discuss the properties of our algorithm using simulated data and conduct a case study with biologically verified results. Ultimately, we demonstrate the efficacy of our method on two genome-wide breast cancer studies to identify genes with large variation in expression and copy number across numerous cell line and tumor samples. Our method identifies genes that are statistically significant in both input measurements. The proposed method is useful for a wide variety of joint copy number and expression-based studies. Supplementary information is available online, including software implementations and experimental data
  • Keywords
    biological organs; cancer; cellular biophysics; data reduction; genetics; gynaecology; medical computing; molecular biophysics; singular value decomposition; tumours; breast cancer; cell line; copy number data; data reduction models; gene expression; generalized singular value decomposition; tumor samples; variation patterns; Bioinformatics; Biological information theory; Biological system modeling; Breast cancer; DNA; Gene expression; Genomics; Humans; Proteins; Singular value decomposition; CGH microarray data; DNA copy numbers; Generalized singular value decomposition; breast cancer.; cDNA microarray data; gene expression; Breast Neoplasms; Cell Line, Tumor; Databases, Genetic; Gene Dosage; Gene Expression; Gene Expression Profiling; Genetic Markers; Humans; Information Storage and Retrieval; Models, Genetic; Neoplasm Proteins; Oligonucleotide Array Sequence Analysis; Reproducibility of Results; Sensitivity and Specificity; Tumor Markers, Biological;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2006.10
  • Filename
    1588842