Title :
Improving Relational Database Quality Based on Adaptive Learning Method for Estimating Null Value
Author :
Cheng, Ching-Hsue ; Wei, Liang-Ying ; Lin, Tzu-Cheng
Author_Institution :
Nat. Yunlin Univ. of Sci. & Technol., Touliu
Abstract :
The poor data quality can cause a host of negative results, including lost information, operational inefficiencies in a database system and much more. Therefore, for many applications in the area of data analysis processing, data preprocessing is an essential step. In addition, the handling of null values is the major task of the data quality. In this paper, we present a new method for estimating null values in relational database systems based on adaptive learning techniques. The proposed method utilizes clustering algorithms to cluster data, and calculate coefficient values between different attributes by generating minimum average error. To verify our method, this paper utilizes two database ( Waugh ´s database and Employee relational database), and Mean of Absolute Error Rate (MAER) as evaluation criterion to compare with the listing methods; it is shown that our proposed method is better than the listing methods for estimating null values in relational database systems.
Keywords :
adaptive systems; data analysis; learning systems; relational databases; Waugh database; adaptive learning method; clustering algorithms; data analysis processing; database system; employee relational database; estimating null value; mean of absolute error rate; poor data quality; relational database quality; Clustering algorithms; Data mining; Data preprocessing; Database systems; Error analysis; Learning systems; Null value; Partitioning algorithms; Relational databases; Remuneration;
Conference_Titel :
Innovative Computing, Information and Control, 2007. ICICIC '07. Second International Conference on
Conference_Location :
Kumamoto
Print_ISBN :
0-7695-2882-1
DOI :
10.1109/ICICIC.2007.350