• DocumentCode
    3234528
  • Title

    Research of error data detection algorithm based on rules

  • Author

    Zhong-Bin Zhang ; Yu-Hua Zhou ; Yong-zhi Liu

  • Author_Institution
    Dept. of Equip. Command & Manage., Acad. of the Armored Force Eng., Beijing, China
  • fYear
    2011
  • fDate
    27-29 May 2011
  • Firstpage
    159
  • Lastpage
    163
  • Abstract
    Data entry errors, improper integration, data environment changes, etc., will affect the quality of the data. Among them, the error data is the most serious data quality problems. To clean up the error data, to play the role of information systems and improve the quality of the data, the detection method of error data based on rules is studied, the detection process is analyzed, a common set of detection rules is established, how the SQL statements into the rules is discussed, the detection algorithm is achieved and carried out a series of optimization. This method is easy, its rules are simple, and the efficiency and the false discovery rate are high after optimization. Therefore, this approach may well be a good method of data cleaning.
  • Keywords
    SQL; error handling; information systems; optimisation; SQL statement; data cleaning; data entry error; data quality; detection rules; error data detection algorithm; false discovery rate; information system; optimization; Optimization; data Detection; data cleaning; error data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communication Software and Networks (ICCSN), 2011 IEEE 3rd International Conference on
  • Conference_Location
    Xi´an
  • Print_ISBN
    978-1-61284-485-5
  • Type

    conf

  • DOI
    10.1109/ICCSN.2011.6014412
  • Filename
    6014412