• DocumentCode
    3620792
  • Title

    Finding reducts without building the discernibility matrix

  • Author

    M. Korzen;S. Jaroszewicz

  • Author_Institution
    Fac. of Comput. Sci. & Inf. Syst., Tech. Univ. Szczecin, Poland
  • fYear
    2005
  • fDate
    6/27/1905 12:00:00 AM
  • Firstpage
    450
  • Lastpage
    455
  • Abstract
    We present algorithms for fast generation of short reducts which avoid building the discernibility matrix explicitly. We show how information obtained from this matrix can be obtained based only on the distributions of attribute values. Since the size of discernibility matrix is quadratic in the number of data records, not building the matrix explicitly gives a very significant speedup and makes it possible to find reducts even in very large databases. Algorithms are given for both absolute and relative reducts. Experiments show that our approach outperforms other reduct finding algorithms. Furthermore it is shown that many heuristic reduct finding algorithms using the discernibility matrix in fact select attributes based on their Gini index. A new definition of conditional Gini index is presented, motivated by reduct finding heuristics.
  • Keywords
    "Information systems","Databases","Heuristic algorithms","Computer science","Intelligent systems"
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications, 2005. ISDA ´05. Proceedings. 5th International Conference on
  • Print_ISBN
    0-7695-2286-6
  • Type

    conf

  • DOI
    10.1109/ISDA.2005.45
  • Filename
    1578826