DocumentCode
48748
Title
Detection of Replication Origin Sites in Herpesvirus Genomes by Clustering and Scoring of Palindromes with Quadratic Entropy Measures
Author
Rizvi, Ahsan Z. ; Bhattacharya, C.
Author_Institution
Indian Inst. of Technol., Indore, Indore, India
Volume
11
Issue
6
fYear
2014
fDate
Nov.-Dec. 1 2014
Firstpage
1108
Lastpage
1118
Abstract
Replication in herpesvirus genomes is a major concern of public health as they multiply rapidly during the lytic phase of infection that cause maximum damage to the host cells. Earlier research has established that sites of replication origin are dominated by high concentration of rare palindrome sequences of DNA. Computational methods are devised based on scoring to determine the concentration of palindromes. In this paper, we propose both extraction and localization of rare palindromes in an automated manner. Discrete Cosine Transform (DCT-II), a widely recognized image compression algorithm is utilized here to extract palindromic sequences based on their reverse complimentary symmetry property of existence. We formulate a novel approach to localize the rare palindrome clusters by devising a Minimum Quadratic Entropy (MQE) measure based on the Renyi´s Quadratic Entropy (RQE) function. Experimental results over a large number of herpesvirus genomes show that the RQE based scoring of rare palindromes have higher order of sensitivity, and lesser false alarm in detecting concentration of rare palindromes and thereby sites of replication origin.
Keywords
DNA; bioinformatics; cellular biophysics; data compression; discrete cosine transforms; diseases; entropy; feature extraction; genomics; image sequences; microorganisms; molecular biophysics; molecular configurations; pattern clustering; DNA; RQE based scoring; Renyi quadratic entropy function; computational methods; discrete cosine transform; herpesvirus genome detection; herpesvirus genome replication origin sites; host cells; infection; lytic phase; maximum damage; minimum quadratic entropy; palindromes clustering; palindromes concentration; palindromes scoring; palindromic sequences; public health; quadratic entropy measures; rare palindrome clusters; rare palindrome sequences; rare palindromes extraction; rare palindromes localization; recognized image compression algorithm; reverse complimentary symmetry property; Bioinformatics; Clustering algorithms; Computational biology; DNA; Entropy; Genomics; Renyi???s quadratic entropy (RQE); Replication origin sites; discrete cosine transform (DCT-II); herpesvirus; minimum quadratic entropy (MQE); sensitivity; specificity;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2014.2330622
Filename
6832521
Link To Document