DocumentCode
3463674
Title
Rule extraction based on rough set theory and its application to medical data analysis
Author
Nakayama, Hirotaka ; Hattori, Yuichi ; Ishii, Renichi
Author_Institution
Dept. of Appl. Math., Konan Univ., Kobe, Japan
Volume
5
fYear
1999
fDate
1999
Firstpage
924
Abstract
Knowledge discovery from databases is an important theme not only in medical data analysis but also in many other practical fields. Recently, rough set theory has been attracting many researchers´ attention as an effective method for knowledge discovery. The main idea of rough set theory is to obtain rules which are as simple as possible from the given database by reducing the database while holding the original degree of consistency. To this end, two kinds of approximation sets to the original rough set are introduced: the lower approximation provides an inevitable rule, while the upper approximation provides a possible rule. In addition, a method is suggested for reducing the number of attributes while keeping the degree of consistency of the database. The aim of the paper is to apply such techniques to medical data analysis. Traditional rough set theory can treat only categorical data. Unfortunately, however, many medical data have continuous numerical values. In order to convert continuous numerical data into categorical data, we apply an ID3-like technique on the basis of information quantity. In addition, an idea for utilizing inconsistent data is suggested by defining the quality of boundary. This provides us with more information on which attributes are important, and simpler rules from databases. Finally, those techniques are applied for finding rules which cause MA (macroangiopathy) to NIDDM (Non-Insulin Dependent Diabetes Mellitus) patients
Keywords
data analysis; data mining; deductive databases; medical information systems; rough set theory; ID3-like technique; MA; NIDDM; Non-Insulin Dependent Diabetes Mellitus patients; approximation sets; categorical data; continuous numerical values; inconsistent data; inevitable rule; information quantity; knowledge discovery; knowledge discovery from databases; lower approximation; macroangiopathy; medical data analysis; possible rule; rough set theory; rule extraction; upper approximation; Data analysis; Data mining; Diabetes; Mathematics; Rough sets; Set theory;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
Conference_Location
Tokyo
ISSN
1062-922X
Print_ISBN
0-7803-5731-0
Type
conf
DOI
10.1109/ICSMC.1999.815677
Filename
815677
Link To Document