DocumentCode
2646287
Title
An open data cleaning framework based on semantic rules for Continuous Auditing
Author
Ye, Huanzhuo ; Wu, Di ; Chen, Shuai
Author_Institution
Dept. of Inf., Zhongnan Univ. of Econ. & Law, Wuhan, China
Volume
2
fYear
2010
fDate
16-18 April 2010
Abstract
Continuous Auditing (CA) is an important form of computer-assisted audit techniques (CAATs), which is also an active research domain in audit field. Because of the strict requirements for data quality for Continuous Auditing, a semantic rule-based open data cleaning framework (ODCF) with self-learning function is designed in this paper, which can improve the accuracy and adaptability of data cleaning. The semantic rules were used in the framework to recognize the hierarchy semantic and dependence among the fields. Firstly, introduce the structure, components and workflow of the framework in detail. And then describe the cooperation of various components, which improves cleaning efficiency, through the processing of various types of dirty data. Finally, analyze the performance of the framework, and point out its adaptability and universality to use. This is an open framework for data cleaning with good scalability, which will becomes more and more perfect and powerful with the success of data cleaning practice in different fields.
Keywords
auditing; data handling; cleaning efficiency; computer-assisted audit techniques; continuous auditing; data quality; hierarchy semantic; open data cleaning framework; open framework; self-learning function; semantic rules; Cleaning; Companies; Data analysis; Data processing; Expert systems; Industrial relations; Libraries; Performance analysis; Power generation economics; Scalability; continuous auditing; data cleaning; semantic rule warehouse;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Engineering and Technology (ICCET), 2010 2nd International Conference on
Conference_Location
Chengdu
Print_ISBN
978-1-4244-6347-3
Type
conf
DOI
10.1109/ICCET.2010.5485262
Filename
5485262
Link To Document