Title :
Optimal haplotype assembly from high-throughput mate-pair reads
Author :
Govinda M. Kamath;Eren Şaşoğlu;David Tse
Author_Institution :
Department of Electrical Engineering, Stanford University, USA
fDate :
6/1/2015 12:00:00 AM
Abstract :
Humans have 23 pairs of homologous chromosomes. The homologous pairs are identical except on certain documented positions called single nucleotide polymorphisms (SNPs). A haplotype of an individual is the pair of sequences of SNPs on the two homologous chromosomes. In this paper, we study the problem of inferring haplotypes of individuals from mate-pair reads of their genome. We give a simple formula for the coverage needed for haplotype assembly, under a generative model. The analysis here leverages connections of this problem with decoding convolutional codes.
Keywords :
"Biological cells","Bioinformatics","Assembly","Genomics","Silicon","Reliability","Random variables"
Conference_Titel :
Information Theory (ISIT), 2015 IEEE International Symposium on
Electronic_ISBN :
2157-8117
DOI :
10.1109/ISIT.2015.7282588