Title :
Temporal knowledge discovery in time-series medical databases based on fuzzy-rough reasoning
Author :
Tsumoto, Shusaku
Author_Institution :
Dept. of Med. Inf., Shimane Med. Univ., Izumo City, Japan
Abstract :
Since hospital information systems have been introduced in large hospitals, a large amount of data, including laboratory examinations, have been stored as temporal databases. The characteristics of these temporal databases are outlined. (1) Each record is inhomogeneous with respect to time-series, including short-term effects and long-term effects. (2) Each record has more than 1000 attributes when a patient is followed for more than one year. (3) When a patient is admitted for a long time, a large amount of data is stored in a very short term. Even medical experts cannot deal with these large databases, and the interest in mining some useful information from the data are growing. The article introduces a combination of extended moving average method and rule induction method, called CEARI to discover new knowledge in temporal databases. An extended moving average method is used for preprocessing, to deal with the irregularity of temporal data. Using several parameters for time-scaling, given by users, this moving average method generates a new database for each time scale with summarized attributes. Then, the rule induction method is applied to each new database with summarized attributes. This CEARI is applied to three medical datasets, the results of which show that interesting knowledge is discovered from each database
Keywords :
data mining; fuzzy set theory; inference mechanisms; medical expert systems; medical information systems; moving average processes; temporal databases; time series; uncertainty handling; very large databases; CEARI; data mining; extended moving average method; fuzzy-rough reasoning; hospital information systems; laboratory examinations; large databases; large hospitals; long-term effects; medical datasets; medical experts; new knowledge discovery; rule induction method; short-term effects; summarized attributes; temporal databases; temporal knowledge discovery; time-scaling; time-series medical databases; Biomedical informatics; Cities and towns; Data mining; Database systems; Hospitals; Induction generators; Information retrieval; Information systems; Laboratories;
Conference_Titel :
IFSA World Congress and 20th NAFIPS International Conference, 2001. Joint 9th
Conference_Location :
Vancouver, BC
Print_ISBN :
0-7803-7078-3
DOI :
10.1109/NAFIPS.2001.944370