DocumentCode
3234528
Title
Research of error data detection algorithm based on rules
Author
Zhong-Bin Zhang ; Yu-Hua Zhou ; Yong-zhi Liu
Author_Institution
Dept. of Equip. Command & Manage., Acad. of the Armored Force Eng., Beijing, China
fYear
2011
fDate
27-29 May 2011
Firstpage
159
Lastpage
163
Abstract
Data entry errors, improper integration, data environment changes, etc., will affect the quality of the data. Among them, the error data is the most serious data quality problems. To clean up the error data, to play the role of information systems and improve the quality of the data, the detection method of error data based on rules is studied, the detection process is analyzed, a common set of detection rules is established, how the SQL statements into the rules is discussed, the detection algorithm is achieved and carried out a series of optimization. This method is easy, its rules are simple, and the efficiency and the false discovery rate are high after optimization. Therefore, this approach may well be a good method of data cleaning.
Keywords
SQL; error handling; information systems; optimisation; SQL statement; data cleaning; data entry error; data quality; detection rules; error data detection algorithm; false discovery rate; information system; optimization; Optimization; data Detection; data cleaning; error data;
fLanguage
English
Publisher
ieee
Conference_Titel
Communication Software and Networks (ICCSN), 2011 IEEE 3rd International Conference on
Conference_Location
Xi´an
Print_ISBN
978-1-61284-485-5
Type
conf
DOI
10.1109/ICCSN.2011.6014412
Filename
6014412
Link To Document