DocumentCode
1991959
Title
A low-complexity probabilistic genome assembly based on hashing functions with SNP detection
Author
Mounsef, Naji ; Karam, Lina ; Lacroix, Zoe ; Legendre, Christophe
Author_Institution
Dept. of Electr. Eng., Arizona State Univ., Tempe, AZ
fYear
2008
fDate
8-10 June 2008
Firstpage
1
Lastpage
4
Abstract
This paper presents an efficient low-complexity genome assembly algorithm with the ability to detect bit errors (SNPs). A hashing function is used to reduce the complexity of the assembly process. The algorithm is tested against genomic sequences of different lengths. Its performance in terms of completeness, accuracy, and efficiency (time and space) is evaluated against Phrap, a well-known sequence assembly tool. It is shown that the proposed assembly algorithm outperforms Phrap in terms of accuracy, time, and memory.
Keywords
DNA; biology computing; error statistics; genetics; probability; Phrap comparison; SNP detection; accuracy terms; bit errors detection; genomic sequences; hashing functions; low-complexity probabilistic genome assembly algorithm; memory aspects; performance evaluation; sequence assembly tool; time terms; Assembly; Bioinformatics; Cells (biology); DNA; Genomics; Humans; Organisms; Pharmaceuticals; Sequences; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Genomic Signal Processing and Statistics, 2008. GENSiPS 2008. IEEE International Workshop on
Conference_Location
Phoenix, AZ
Print_ISBN
978-1-4244-2371-2
Electronic_ISBN
978-1-4244-2372-9
Type
conf
DOI
10.1109/GENSIPS.2008.4555676
Filename
4555676
Link To Document