• DocumentCode
    3097127
  • Title

    Intron Identification Approaches Based on Weighted Features and Fuzzy Decision Trees

  • Author

    Huang, Yin-Fu ; Liang, Ching Ping ; Liou, Sing-Wu

  • Author_Institution
    Grad. Sch. of Comput. Sci. & Inf. Eng., Nat. Yunlin Univ. of Sci. & Technol. Touliu, Touliu, Taiwan
  • fYear
    2010
  • fDate
    18-20 June 2010
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Current computational predictions of splice sites largely depend on the sequence patterns of known intronic sequence features (ISFs) described in the classical intron definition model (IDM). The computation-oriented IDM (CO-IDM) clearly provides more specific and concrete information for describing intron flanks of splice sites (IFSSs). In the paper, we proposed a novel approach of fuzzy decision trees (FDTs) which utilize 1) weighted ISFs of twelve uni-frame patterns (UFPs) and forty-five multi-frame patterns (MFPs) and 2) gain ratios to improve the performances in identifying an intron. First, we fuzzified extracted features from genomic sequences using membership functions with an unsupervised self-organizing map (SOM) technique. Then, we brought in different viewpoints of globally weighting and crossly referring in generating fuzzy rules which are interpretable and useful for biologists to verify whether a sequence is an intron or not. Finally, the experimental results revealed the effectiveness of the proposed method in improving the identification accuracy. Besides, we also implemented an on-line intronic identifier to infer an unknown genomic sequence.
  • Keywords
    biology computing; decision trees; feature extraction; fuzzy set theory; genomics; macromolecules; pattern classification; self-organising feature maps; computational prediction; features extraction; fuzzy decision tree; fuzzy rules generation; genomic sequence; intron identification approach; intronic sequence feature; membership function; multiframe pattern; splice site; uniframe pattern; unsupervised self organizing map technique; weighted feature; Bioinformatics; Biology computing; Computational modeling; Computer science; Concrete; Decision trees; Diversity reception; Genomics; Predictive models; Pulse width modulation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
  • Conference_Location
    Chengdu
  • ISSN
    2151-7614
  • Print_ISBN
    978-1-4244-4712-1
  • Electronic_ISBN
    2151-7614
  • Type

    conf

  • DOI
    10.1109/ICBBE.2010.5515314
  • Filename
    5515314