• DocumentCode
    3262922
  • Title

    A majority rules approach to data mining

  • Author

    Roiger, Richard J. ; Azarbod, Cyrus ; Sant, Rajiv R.

  • Author_Institution
    Dept. of Comput. Sci., Mankato State Univ., MN, USA
  • fYear
    35765
  • fDate
    8-10 Dec1997
  • Firstpage
    100
  • Lastpage
    107
  • Abstract
    Knowledge discovery in databases (KDD) offers a methodology for developing tools to extract meaningful knowledge from large volumes of data. We propose a generalized KDD model for supervised training. A main step in this process, data mining, involves the creation of a classification structure that is representative of the concept classes identified in the data set. Data mining incorporates learning which may be supervised or unsupervised and often uses statistical as well as heuristic (machine learning) techniques. Previous research has shown that different supervised models perform better under certain conditions. We tested the extent of overlap of instance classifications between five supervised models in two real world domains. Experimental results showed that in one domain all five models classified 75.8% of the instances identically, correct or incorrect. In the second domain, the corresponding figure was 63.3%. The amount of agreement between models can be used to help determine the nature of the domain and the applicability of a supervised learning approach. We extend the above experimental result and propose a multi model majority rules (MR) data mining technique to learn about the nature of a given domain. We conclude with directions for future work
  • Keywords
    deductive databases; knowledge acquisition; knowledge based systems; learning (artificial intelligence); classification structure; data mining; generalized KDD model; instance classifications; knowledge discovery in databases; knowledge extraction; machine learning; majority rules approach; multi model majority rules; real world domains; supervised models; supervised training; Computer science; Couplings; Data mining; Data warehouses; Databases; Finance; Machine learning; Supervised learning; Testing; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Information Systems, 1997. IIS '97. Proceedings
  • Conference_Location
    Grand Bahama Island
  • Print_ISBN
    0-8186-8218-3
  • Type

    conf

  • DOI
    10.1109/IIS.1997.645197
  • Filename
    645197