DocumentCode :
3316068
Title :
Rough Set Theory for the Treatment of Incomplete Data
Author :
Nelwamondo, Fulufhelo V. ; Marwala, Tshilidzi
Author_Institution :
Univ. of the Witwatersrand, Witwatersrand
fYear :
2007
fDate :
23-26 July 2007
Firstpage :
1
Lastpage :
6
Abstract :
This paper proposes an algorithm based on rough set theory for missing data estimation. This paper also applies a rough set technique for missing data estimation to a large and real database for the first time. It is envisaged in this work that in large databases, it is more likely that the missing values could be correlated to some other variables observed somewhere in the same data. Instead of approximating missing data, it might be cheaper to identify indiscernibility relations between the observed data instances and those that contain missing attributes. Results obtained using the HIV database are acceptable with accuracies ranging from 74.7% to 100%. One drawback of this method is that it makes no extrapolation or interpolation and as a result, can only be used if the missing case is similar or related to another case with more observations.
Keywords :
decision tables; rough set theory; very large databases; decision tables; large database; missing data estimation; observed data instance; rough set theory; Data acquisition; Data communication; Data engineering; Databases; Extrapolation; GSM; Human immunodeficiency virus; Interpolation; Set theory; Transmission line measurements;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems Conference, 2007. FUZZ-IEEE 2007. IEEE International
Conference_Location :
London
ISSN :
1098-7584
Print_ISBN :
1-4244-1209-9
Electronic_ISBN :
1098-7584
Type :
conf
DOI :
10.1109/FUZZY.2007.4295389
Filename :
4295389
Link To Document :
بازگشت