DocumentCode :
3752260
Title :
A novel DNA sequence compression scheme using both intra and inter sequences correlation
Author :
K. O. Cheng;N. F. Law;W. C. Siu
Author_Institution :
Centre for Signal Processing, Department of Electronic and Information Engineering, the Hong Kong Polytechnic University, Hong Kong
fYear :
2015
Firstpage :
237
Lastpage :
241
Abstract :
Classical DNA sequence compression algorithms consider only intra-sequence similarity, i.e., similar subsequences within the DNA sequence are found and encoded together. In this work, in addition to the intra-sequence similarity, we exploit the inter-sequence similarities in that similar subsequences are found within the DNA sequence as well as from other reference sequences. Hence, highly similar sequences from the same population or partially similar chromosome sequences of the same species can be compressed together to reduce the storage space. Experimental results show that the proposed scheme achieves good compressibility for both partially similar chromosome sequences and highly similar population sequences.
Keywords :
"Biological cells","DNA","Encoding","Sociology","Statistics","Decoding","Compression algorithms"
Publisher :
ieee
Conference_Titel :
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2015 Asia-Pacific
Type :
conf
DOI :
10.1109/APSIPA.2015.7415512
Filename :
7415512
Link To Document :
بازگشت