Title : 
Random-access compression of annotated DNA sequences
         
        
            Author : 
Korodi, Gergely ; Tabus, Ioan
         
        
            Author_Institution : 
Inst. of Signal Process., Tampere Univ. of Technol., Tampere
         
        
        
        
        
        
            Abstract : 
This article investigates the efficiency of randomly accessible coding for annotated genome files and compares it to universal coding. The result is an encoder which has excellent compression efficiency on annotated genome sequences, provides instantaneous access to functional elements in the file, and thus it serves as a basis for further applications, such as indexing and searching for specified feature entries.
         
        
            Keywords : 
DNA; biology computing; data compression; encoding; file organisation; random processes; sequences; encoder; functional element; random-access DNA sequence compression; Bioinformatics; Biological information theory; Biomedical signal processing; DNA; Genomics; Indexing; Information retrieval; Probability distribution; Sequences; Training data;
         
        
        
        
            Conference_Titel : 
Genomic Signal Processing and Statistics, 2006. GENSIPS '06. IEEE International Workshop on
         
        
            Conference_Location : 
College Station, TX
         
        
            Print_ISBN : 
1-4244-0384-7
         
        
            Electronic_ISBN : 
1-4244-0385-5
         
        
        
            DOI : 
10.1109/GENSIPS.2006.353160