DocumentCode
267014
Title
Optimizing DNA sequences using Tetra-nucleotide RankList
Author
Ajwad, Rasif ; Hossain, Syed Nayem ; Hasan, M. Anwar
Author_Institution
Dept. of Comput. Sci. & Eng., Islamic Univ. of Technol., Gazipur, Bangladesh
fYear
2014
fDate
10-12 April 2014
Firstpage
1
Lastpage
4
Abstract
Recent advancement in the field of life science has caused the generation of massive amount of genomic data. Storing such huge amount of data require a lot of memory. However, by using efficient algorithm we can optimize the size of the dataset and save memory storage. In this paper we have proposed an algorithm which uses Tetra-nucleotide RankList for optimizing the storage requirement for storing DNA sequences in database which can be easily retrieved in a time efficient manner. Tetra-nucleotide RankList has been generated by testing several DNA sequences to confirm the uniformity of the RankList over all DNA sequences. The algorithm has been applied on the genome sequences of different bacteria and viruses and the space for storing those DNA sequences has been reduced up to 30%.
Keywords
DNA; biocomputing; genomics; storage allocation; Tetra-nucleotide RankList; genome sequences; genomic data; memory storage; optimizing DNA sequences; Algorithm design and analysis; Bioinformatics; Biological cells; DNA; Databases; Genomics; Training; DNA sequences; RankList; memory storage; tetra-nucleotide;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical Engineering and Information & Communication Technology (ICEEICT), 2014 International Conference on
Conference_Location
Dhaka
Print_ISBN
978-1-4799-4820-8
Type
conf
DOI
10.1109/ICEEICT.2014.6919125
Filename
6919125
Link To Document