DocumentCode
2039235
Title
Accurate identification of significant aberrations in contaminated cancer genome
Author
Xuchu Hou ; Guoqiang Yu ; Bai Zhang ; Ie-Ming Shih ; Zhen Zhang ; Xiguo Yuan ; Clarke, Roger ; Madhavan, S.
Author_Institution
Bradley Dept. of Electr. & Comput. Eng., Virginia Polytech. Inst. & State Univ., Arlington, VA, USA
fYear
2012
fDate
2-4 Dec. 2012
Firstpage
74
Lastpage
77
Abstract
Somatic Copy Number Alterations (CNAs) are quite common in human cancers. Identifying CNAs and Significant Copy number Aberrations (SCAs) in cancer genomes is a critical task in searching for cancer-associated genes. The advanced genomic technologies, such as SNP array technology, facilitate copy number study at a genome-wide scale with high resolution. However, in reality, due to normal tissue contamination, the observed intensity signals are actually the mixture of copy number signals contributed from both tumor cells and normal cells. This genetic heterogeneity could significantly affect the subsequent copy number analysis and SCAs detection. In order to accurately identify significant aberrations in contaminated cancer genome, we devise an approach including two major steps. We first use a statistical method, Bayesian Analysis of Copy number Mixtures (BACOM) to estimate the normal tissue contamination fraction and recover the “true” copy number profile. Then, based on the recovered profiles, we detect SCAs using Genome-wide Identification of Significant Aberrations in Cancer Genome (SAIC). We comprehensively evaluate the performance of the proposed algorithm on a large number of simulation data. The results show that the algorithm has higher detection power than peer methods including the most popular GISTIC. We then apply the method to the real copy number data of Glioblastoma Multiforme and successfully identified majority of SCAs reported by GISTIC, and some novel SCAs that contain some cancer-associated genes.
Keywords
Bayes methods; cancer; cellular biophysics; data analysis; genetics; genomics; identification; medical computing; statistical analysis; tumours; Bayesian Analysis of Copy number Mixtures; GISTIC; Genome-wide Identification; Glioblastoma Multiforme; SCA detection; SNP array technology; Significant Aberrations in Cancer Genome; Significant Copy number Aberration identification; Somatic Copy Number Alterations; advanced genomic technologies; cancer genomes; cancer-associated genes; copy number analysis; copy number signal mixture; genetic heterogeneity; genome-wide scale; human cancer; intensity signal; normal cell; normal tissue contamination fraction; peer methods; significant aberration; simulation data; statistical method; true copy number profile; tumor cell; copy number alterations; normal tissue contamination; significant copy number aberrations;
fLanguage
English
Publisher
ieee
Conference_Titel
Genomic Signal Processing and Statistics, (GENSIPS), 2012 IEEE International Workshop on
Conference_Location
Washington, DC
ISSN
2150-3001
Print_ISBN
978-1-4673-5234-5
Type
conf
DOI
10.1109/GENSIPS.2012.6507730
Filename
6507730
Link To Document