Title : 
Poster: ViSpA: Viral spectrum assembling method
         
        
            Author : 
Astrovskaya, Irina ; Tork, Bassam ; Mangul, Serghei ; Westbrooks, Kelly ; Mandoiu, Ion ; Balfe, Peter ; Zelikovsky, Alex
         
        
            Author_Institution : 
Dept. of Comput. Sci., Georgia State Univ., Atlanta, GA, USA
         
        
        
        
        
        
            Abstract : 
Like many RNA viruses, Hepatitis C virus (HCV) exists as a set of closely related sequences (quasispecies). The diversity of the quasispecies sequences can explain vaccines failures and virus resistance to existing therapies. Since the original software of next-generation sequencing systems assumes a single genome, there is a need for a new assembler that infers viral population in a host. Thus, the paper focuses on Quasispecies Spectrum Reconstruction (QSR) Problem: given a collection of 454 pyrosequencing reads taken from a sample quasispecies population, reconstruct the quasispecies spectrum, i.e., the set of sequences and the relative frequency of each sequence in the sample population.This poster introduces the ViSpA method that significantly extends previous approach by handling contaminated reads and overlaps with partial agreement between reads, by assembling haplotypes from per-vertex max-bandwidth paths via mutation-based clustering, and by estimating assemblies´ frequencies via EM. A procedure to fix systematic 454 errors in homopolymers if they happen in the coding region is suggested.
         
        
            Keywords : 
diseases; macromolecules; medical computing; microorganisms; molecular biophysics; molecular configurations; polymers; RNA virus; ViSpA; closely related sequences; coding region; haplotypes; hepatitis C virus; homopolymers; mutation-based clustering; next-generation sequencing systems; per-vertex max-bandwidth paths; pyrosequencing; quasispecies; quasispecies spectrum reconstruction; vaccines failures; viral population; viral spectrum assembling method; virus resistance; Assembly; Computer science; Electronic mail; Frequency estimation; Proteins; Software; Systematics; Next-generation sequencing; expectation maximization; viral assembling;
         
        
        
        
            Conference_Titel : 
Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on
         
        
            Conference_Location : 
Orlando, FL
         
        
            Print_ISBN : 
978-1-61284-851-8
         
        
        
            DOI : 
10.1109/ICCABS.2011.5729888