Title :
Integrated genotyping of structural variation
Author :
Xian Fan ; Nakhleh, Luay ; Ken Chen
Author_Institution :
Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
Abstract :
Discovering genotype of structural variations (SV) is a new and challenging topic. To the best of our knowledge, estimation of variant allele frequency (VAF) of an SV from both read depth and read alignment has not been done. In this study, we propose BreakDown, a new statistical model that integrates read depth, discordant and split paired-end read alignment to accurately estimate SVs´ genotypes and VAFs. Application to 43 low coverage samples in the 1000 Genomes Project shows that BreakDown produces highly accurate genotypes, especially on heterozygous deletions. Estimation of VAFs on both low- and high-coverage data is shown to converge on expected values with small divergence.
Keywords :
bioinformatics; genetics; genomics; statistical analysis; 1000 Genomes Project; BreakDown statistical model; SV genotype estimation; VAF estimation; discordant; heterozygous deletions; high-coverage data; integrated genotyping; low-coverage data; read depth; split paired-end read alignment; structural variations; variant allele frequency estimation; Accuracy; Bioinformatics; Electric breakdown; Estimation; Genomics; Sociology; Statistics;
Conference_Titel :
Global Conference on Signal and Information Processing (GlobalSIP), 2013 IEEE
Conference_Location :
Austin, TX
DOI :
10.1109/GlobalSIP.2013.6736808