DocumentCode :
2036361
Title :
Lossless Segment Based DNA Compression
Author :
Mridula, T.V. ; Samuel, Philip
Author_Institution :
Dept. of Comput. Sci., Cochin Univ. of Sci. & Technol., Cochin, India
Volume :
2
fYear :
2011
fDate :
8-10 April 2011
Firstpage :
298
Lastpage :
302
Abstract :
This paper introduces a new Lossless Segment Based DNA Compression (LSBD) method for compressing the DNA sequences. It stores the individual gene position in a compressed file. Since LSBD method performs a gene wise compression, further processing of compressed data reduces memory usage. The biggest advantage of this algorithm is that it enables part by part decompression and can work on any sized data. Here the method identifies individual gene location and then constructs triplets that are mapped to an eight bit number. The individual gene information is stored in a pointer table and a pointer is provided to corresponding location in the compressed file. The LSBD technique appropriately compresses the non-base characters and performs well on repeating sequences.
Keywords :
biocomputing; data compression; DNA sequences; gene repetition; lossless segment based DNA compression; Algorithm design and analysis; Approximation algorithms; Bioinformatics; Compression algorithms; DNA; Genomics; Memory management; DNA compression; Factor repetition; Gene repetition; LSBD Method; Non-base characters;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronics Computer Technology (ICECT), 2011 3rd International Conference on
Conference_Location :
Kanyakumari
Print_ISBN :
978-1-4244-8678-6
Electronic_ISBN :
978-1-4244-8679-3
Type :
conf
DOI :
10.1109/ICECTECH.2011.5941705
Filename :
5941705
Link To Document :
بازگشت