Title :
Optimizing DNA sequences using Tetra-nucleotide RankList
Author :
Ajwad, Rasif ; Hossain, Syed Nayem ; Hasan, M. Anwar
Author_Institution :
Dept. of Comput. Sci. & Eng., Islamic Univ. of Technol., Gazipur, Bangladesh
Abstract :
Recent advancement in the field of life science has caused the generation of massive amount of genomic data. Storing such huge amount of data require a lot of memory. However, by using efficient algorithm we can optimize the size of the dataset and save memory storage. In this paper we have proposed an algorithm which uses Tetra-nucleotide RankList for optimizing the storage requirement for storing DNA sequences in database which can be easily retrieved in a time efficient manner. Tetra-nucleotide RankList has been generated by testing several DNA sequences to confirm the uniformity of the RankList over all DNA sequences. The algorithm has been applied on the genome sequences of different bacteria and viruses and the space for storing those DNA sequences has been reduced up to 30%.
Keywords :
DNA; biocomputing; genomics; storage allocation; Tetra-nucleotide RankList; genome sequences; genomic data; memory storage; optimizing DNA sequences; Algorithm design and analysis; Bioinformatics; Biological cells; DNA; Databases; Genomics; Training; DNA sequences; RankList; memory storage; tetra-nucleotide;
Conference_Titel :
Electrical Engineering and Information & Communication Technology (ICEEICT), 2014 International Conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-4799-4820-8
DOI :
10.1109/ICEEICT.2014.6919125