• DocumentCode
    49025
  • Title

    Fast Entropic Profiler: An Information Theoretic Approach for the Discovery of Patterns in Genomes

  • Author

    Comin, Matteo ; Antonello, Morris

  • Author_Institution
    Dept. of Inf. Eng., Univ. of Padova, Padua, Italy
  • Volume
    11
  • Issue
    3
  • fYear
    2014
  • fDate
    May-June 2014
  • Firstpage
    500
  • Lastpage
    509
  • Abstract
    Information theory has been used for quite some time in the area of computational biology. In this paper we present a pattern discovery method, named Fast Entropic Profiler, that is based on a local entropy function that captures the importance of a region with respect to the whole genome. The local entropy function has been introduced by Vinga and Almeida in , here we discuss and improve the original formulation. We provide a linear time and linear space algorithm called Fast Entropic Profiler ( FastEP), as opposed to the original quadratic implementation. Moreover we propose an alternative normalization that can be also efficiently implemented. We show that FastEP is suitable for large genomes and for the discovery of patterns with unbounded length. FastEP is available at http://www.dei.unipd.it/~ciompin/main/FastEP.html.
  • Keywords
    bioinformatics; entropy; genomics; pattern recognition; Fast Entropic Profiler; FastEP; computational biology; genome pattern discovery; information theoretic approach; linear space algorithm; linear time algorithm; local entropy function; normalization; pattern discovery method; Bioinformatics; Computational biology; DNA; Entropy; Genomics; Information theory; Pattern discovery; computational biology; information theory; local entropy;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2013.2297924
  • Filename
    6702482