DocumentCode :
1220
Title :
Common Copy Number Variation Detection From Multiple Sequenced Samples
Author :
Junbo Duan ; Hong-Wen Deng ; Yu-Ping Wang
Author_Institution :
Dept. of Biomed. Eng., Xi´an Jiaotong Univ., Xi´an, China
Volume :
61
Issue :
3
fYear :
2014
fDate :
Mar-14
Firstpage :
928
Lastpage :
937
Abstract :
Common copy number variations (CNVs) are small regions of genomic variations at the same loci across multiple samples, which can be detected with high resolution from next-generation sequencing (NGS) technique. Multiple sequencing data samples are often available from genomic studies; examples include sequences from multiple platforms and sequences from multiple individuals. By integrating complementary information from multiple data samples, detection power can be potentially improved. However, most of current CNV detection methods often process an individual sequence sample, or two samples in an abnormal versus matched normal study; researches on detecting common CNVs across multiple samples have been very limited but are much needed. In this paper, we propose a novel method to detect common CNVs from multiple sequencing samples by exploiting the concurrency of genomic variations in read depth signals derived from multiple NGS data. We use a penalized sparse regression model to fit multiple read depth profiles, based on which common CNV identification is formulated as a change-point detection problem. Finally, we validate the proposed method on both simulation and real data, showing that it can give both higher detection power and better break point estimation over several published CNV detection methods.
Keywords :
diseases; genomics; patient diagnosis; physiological models; regression analysis; CNV detection methods; break point estimation; complementary information; copy number variation detection; genomic variations; multiple NGS data; multiple data samples; multiple read depth profiles; multiple sequenced samples; multiple sequencing data samples; next-generation sequencing technique; read depth signals; sparse regression model; Bioinformatics; Dispersion; Estimation; Genomics; Matching pursuit algorithms; Sequential analysis; Silicon carbide; $ell$-0 norm penalty; Copy number variation (CNV); Schur complement; model selection; next generation sequencing (NGS); structured sparse modeling; the 1000 genomes project;
fLanguage :
English
Journal_Title :
Biomedical Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9294
Type :
jour
DOI :
10.1109/TBME.2013.2292588
Filename :
6675802
Link To Document :
بازگشت