• DocumentCode
    3230408
  • Title

    An IDC-based algorithm for efficient homology filtration with guaranteed seriate coverage

  • Author

    Lee, Hsiao Ping ; Tsai, Yin Te ; Shih, Ching Hua ; Sheu, Tzu Fang ; Tang, Chuan Yi

  • Author_Institution
    Dept. of Comput. Sci., Nat. Tsing Hua Univ., Taiwan
  • fYear
    2004
  • fDate
    19-21 May 2004
  • Firstpage
    395
  • Lastpage
    402
  • Abstract
    The homology search within genomic databases is a fundamental and crucial work for biological knowledge discovery. With exponentially increasing sizes and accesses of databases, the filtration approach, which filters impossible homology candidates to reduce the time for homology verification, becomes more important in bioinformatics. Most of known gram-based filtration approaches, like QUASAR, in the literature have limited error tolerance and would conduct potentially higher false-positives. In this paper, we present an IDC-based lossless filtration algorithm with guaranteed seriate coverage and error tolerance for efficient homology discovery. In our method, the original work of homology extraction with requested seriate coverage and error levels is transformed to a longest increasing subsequence problem with range constraints, and an efficient algorithm is proposed for the problem in this paper. The experimental results show that the method significantly outperforms QUASAR. On some comparable sensitivity levels, our homology filter would make the discovery more than three orders of magnitude faster than that QUASAR does, and more than four orders faster than the exhaustive search.
  • Keywords
    biology computing; data mining; database indexing; genetics; molecular biophysics; proteins; IDC-based algorithm; biological knowledge discovery; error levels; genomic databases; homology extraction; homology filtration; seriate coverage; Bioinformatics; Computer science; DNA; Databases; Evolution (biology); Filters; Filtration; Genomics; Proteins; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2004. BIBE 2004. Proceedings. Fourth IEEE Symposium on
  • Print_ISBN
    0-7695-2173-8
  • Type

    conf

  • DOI
    10.1109/BIBE.2004.1317370
  • Filename
    1317370