DocumentCode
3262922
Title
A majority rules approach to data mining
Author
Roiger, Richard J. ; Azarbod, Cyrus ; Sant, Rajiv R.
Author_Institution
Dept. of Comput. Sci., Mankato State Univ., MN, USA
fYear
35765
fDate
8-10 Dec1997
Firstpage
100
Lastpage
107
Abstract
Knowledge discovery in databases (KDD) offers a methodology for developing tools to extract meaningful knowledge from large volumes of data. We propose a generalized KDD model for supervised training. A main step in this process, data mining, involves the creation of a classification structure that is representative of the concept classes identified in the data set. Data mining incorporates learning which may be supervised or unsupervised and often uses statistical as well as heuristic (machine learning) techniques. Previous research has shown that different supervised models perform better under certain conditions. We tested the extent of overlap of instance classifications between five supervised models in two real world domains. Experimental results showed that in one domain all five models classified 75.8% of the instances identically, correct or incorrect. In the second domain, the corresponding figure was 63.3%. The amount of agreement between models can be used to help determine the nature of the domain and the applicability of a supervised learning approach. We extend the above experimental result and propose a multi model majority rules (MR) data mining technique to learn about the nature of a given domain. We conclude with directions for future work
Keywords
deductive databases; knowledge acquisition; knowledge based systems; learning (artificial intelligence); classification structure; data mining; generalized KDD model; instance classifications; knowledge discovery in databases; knowledge extraction; machine learning; majority rules approach; multi model majority rules; real world domains; supervised models; supervised training; Computer science; Couplings; Data mining; Data warehouses; Databases; Finance; Machine learning; Supervised learning; Testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Information Systems, 1997. IIS '97. Proceedings
Conference_Location
Grand Bahama Island
Print_ISBN
0-8186-8218-3
Type
conf
DOI
10.1109/IIS.1997.645197
Filename
645197
Link To Document