• DocumentCode
    1362392
  • Title

    What is Unequal among the Equals? Ranking Equivalent Rules from Gene Expression Data

  • Author

    Cai, Ruichu ; Tung, Anthony K H ; Zhang, Zhenjie ; Hao, Zhifeng

  • Author_Institution
    Fac. of Comput. Sci., Guangdong Univ. of Technol., Guangzhou, China
  • Volume
    23
  • Issue
    11
  • fYear
    2011
  • Firstpage
    1735
  • Lastpage
    1747
  • Abstract
    In previous studies, association rules have been proven to be useful in classification problems over high dimensional gene expression data. However, due to the nature of such data sets, it is often the case that millions of rules can be derived such that many of them are covered by exactly the same set of training tuples and thus have exactly the same support and confidence. Ranking and selecting useful rules from such equivalent rule groups remain an interesting and unexplored problem. In this paper, we look at two interestingness measures for ranking the interestingness of rules within equivalent rule group: Max-Subrule-Conf and Min-Subrule-Conf. Based on these interestingness measures, an incremental Apriori-like algorithm is designed to select more interesting rules from the lower bound rules of the group. Moreover, we present an improved classification model to fully exploit the potential of the selected rules. Our empirical studies on our proposed methods over five gene expression data sets show that our proposals improve both the efficiency and effectiveness of the rule extraction and classifier construction over gene expression data sets.
  • Keywords
    biology computing; data mining; pattern classification; Max-Subrule-Conf; Min-Subrule-Conf; association rules; classification problems; classifier construction; equivalent rule ranking; gene expression data; incremental apriori like algorithm; rule extraction; rule groups; Accuracy; Association rules; Gene expression; Itemsets; Lattices; Upper bound; Association rules; gene expression data; incremental mining framework; robust classification.;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2010.207
  • Filename
    5611519