Title :
A low-complexity probabilistic genome assembly based on hashing functions with SNP detection
Author :
Mounsef, Naji ; Karam, Lina ; Lacroix, Zoe ; Legendre, Christophe
Author_Institution :
Dept. of Electr. Eng., Arizona State Univ., Tempe, AZ
Abstract :
This paper presents an efficient low-complexity genome assembly algorithm with the ability to detect bit errors (SNPs). A hashing function is used to reduce the complexity of the assembly process. The algorithm is tested against genomic sequences of different lengths. Its performance in terms of completeness, accuracy, and efficiency (time and space) is evaluated against Phrap, a well-known sequence assembly tool. It is shown that the proposed assembly algorithm outperforms Phrap in terms of accuracy, time, and memory.
Keywords :
DNA; biology computing; error statistics; genetics; probability; Phrap comparison; SNP detection; accuracy terms; bit errors detection; genomic sequences; hashing functions; low-complexity probabilistic genome assembly algorithm; memory aspects; performance evaluation; sequence assembly tool; time terms; Assembly; Bioinformatics; Cells (biology); DNA; Genomics; Humans; Organisms; Pharmaceuticals; Sequences; Testing;
Conference_Titel :
Genomic Signal Processing and Statistics, 2008. GENSiPS 2008. IEEE International Workshop on
Conference_Location :
Phoenix, AZ
Print_ISBN :
978-1-4244-2371-2
Electronic_ISBN :
978-1-4244-2372-9
DOI :
10.1109/GENSIPS.2008.4555676