Title :
HIV Haplotype Inference Using a Propagating Dirichlet Process Mixture Model
Author :
Prabhakaran, Suraj ; Rey, Melanie ; Zagordi, Osvaldo ; Beerenwinkel, Niko ; Roth, Volker
Author_Institution :
Dept. of Math. & Comput. Sci., Univ. of Basel, Basel, Switzerland
Abstract :
This paper presents a new computational technique for the identification of HIV haplotypes. HIV tends to generate many potentially drug-resistant mutants within the HIV-infected patient and being able to identify these different mutants is important for efficient drug administration. With the view of identifying the mutants, we aim at analyzing short deep sequencing data called reads. From a statistical perspective, the analysis of such data can be regarded as a nonstandard clustering problem due to missing pairwise similarity measures between non-overlapping reads. To overcome this problem we propagate a Dirichlet Process Mixture Model by sequentially updating the prior information from successive local analyses. The model is verified using both simulated and real sequencing data.
Keywords :
bioinformatics; data analysis; diseases; drugs; genetics; inference mechanisms; medical computing; mixture models; molecular biophysics; molecular configurations; sequences; statistical analysis; HIV haplotype identification; HIV haplotype inference; HIV-infected patient; drug administration; drug-resistant mutant identification; missing pairwise similarity measures; nonoverlapping reads; nonstandard clustering problem; propagating Dirichlet process mixture model; sequencing reads; sequential information updating; short deep sequencing data analysis; statistical analysis; Analytical models; Data models; Drugs; Human immunodeficiency virus; Sequential analysis; 454 sequencing reads; HIV; MCMC; haplotype inference;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2013.145