Title :
An open data cleaning framework based on semantic rules for Continuous Auditing
Author :
Ye, Huanzhuo ; Wu, Di ; Chen, Shuai
Author_Institution :
Dept. of Inf., Zhongnan Univ. of Econ. & Law, Wuhan, China
Abstract :
Continuous Auditing (CA) is an important form of computer-assisted audit techniques (CAATs), which is also an active research domain in audit field. Because of the strict requirements for data quality for Continuous Auditing, a semantic rule-based open data cleaning framework (ODCF) with self-learning function is designed in this paper, which can improve the accuracy and adaptability of data cleaning. The semantic rules were used in the framework to recognize the hierarchy semantic and dependence among the fields. Firstly, introduce the structure, components and workflow of the framework in detail. And then describe the cooperation of various components, which improves cleaning efficiency, through the processing of various types of dirty data. Finally, analyze the performance of the framework, and point out its adaptability and universality to use. This is an open framework for data cleaning with good scalability, which will becomes more and more perfect and powerful with the success of data cleaning practice in different fields.
Keywords :
auditing; data handling; cleaning efficiency; computer-assisted audit techniques; continuous auditing; data quality; hierarchy semantic; open data cleaning framework; open framework; self-learning function; semantic rules; Cleaning; Companies; Data analysis; Data processing; Expert systems; Industrial relations; Libraries; Performance analysis; Power generation economics; Scalability; continuous auditing; data cleaning; semantic rule warehouse;
Conference_Titel :
Computer Engineering and Technology (ICCET), 2010 2nd International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-6347-3
DOI :
10.1109/ICCET.2010.5485262