• DocumentCode
    888584
  • Title

    A fuzzy approach to partitioning continuous attributes for classification

  • Author

    Au, Wai-Ho ; Chan, Keith C C ; Wong, Andrew K.C.

  • Author_Institution
    Microsoft Corp., Redmond, WA, USA
  • Volume
    18
  • Issue
    5
  • fYear
    2006
  • fDate
    5/1/2006 12:00:00 AM
  • Firstpage
    715
  • Lastpage
    719
  • Abstract
    Classification is an important topic in data mining research. To better handle continuous data, fuzzy sets are used to represent interval events in the domains of continuous attributes, allowing continuous data lying on the interval boundaries to partially belong to multiple intervals. Since the membership functions of fuzzy sets can profoundly affect the performance of the models or rules discovered, the determination of membership functions or fuzzy partitioning is crucial. In this paper, we present a new method to determine the membership functions of fuzzy sets directly from data to maximize the class-attribute interdependence and, hence, improve the classification results. In other words, it forms a fuzzy partition of the input space automatically, using an information-theoretic measure to evaluate the interdependence between the class membership and an attribute as the objective function for fuzzy partitioning. To find the optimum of the measure, it employs fractional programming. To evaluate the effectiveness of the proposed method, several real-world data sets are used in our experiments. The experimental results show that this method outperforms other well-known discretization and fuzzy partitioning approaches.
  • Keywords
    data mining; fuzzy set theory; nonlinear programming; pattern classification; data classification; data mining; discretization approach; fractional programming; fuzzy partitioning; fuzzy set; information-theoretic measure; membership function; rule discovery; Biological cells; Clustering algorithms; Data mining; Dynamic programming; Fuzzy neural networks; Fuzzy sets; Genetic algorithms; Information theory; Neural networks; Partitioning algorithms; Fuzzy partition; classification; data mining.; discretization; fuzzy sets; membership functions;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2006.70
  • Filename
    1613872