DocumentCode :
2734258
Title :
A Comprehensive Data Quality Methodology for Web and Structured Data
Author :
Batini, Carlo ; Cabitza, Federico ; cappiello, cinzia ; Francalanci, C.
Author_Institution :
Univ. degli Studi di Milano Bicocca, Milano
fYear :
2006
fDate :
6-6 Dec. 2006
Firstpage :
448
Lastpage :
456
Abstract :
Measuring and improving data quality in an organization or in a group of interacting organizations is a complex task. Several methodologies have been developed in the past providing a basis for the definition of a complete data quality program applying assessment and improvement techniques in order to guarantee high data quality levels. Since the main limitation of existing approaches is their specialization on specific issues or contexts, this paper presents the comprehensive data quality (CDQ) methodology that aims at integrating and enhancing the phases, techniques and tools proposed by previous approaches. CDQ methodology is conceived to be at the same time complete, flexible and simple to apply. Completeness is achieved by considering existing techniques and tools and integrating them in a framework that can work in both intra and inter organizational contexts, and can be applied to all types of data. The methodology is flexible since it supports the user in the selection of the most suitable techniques and tools within each phase and in any context. Finally, CDQ is simple since it is organized in phases and each phase is characterized by a specific goal and techniques to apply. The methodology is explained by means of a running example.
Keywords :
Internet; data analysis; data integrity; World Wide Web; business data processing; comprehensive data quality methodology; data quality program; structured data; Cost benefit analysis; Databases; Design methodology; Error correction; Information analysis; Information systems; Manufacturing processes; Phase measurement; Quality management; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Information Management, 2006 1st International Conference on
Conference_Location :
Bangalore
Print_ISBN :
1-4244-0682-X
Type :
conf
DOI :
10.1109/ICDIM.2007.369236
Filename :
4221928
Link To Document :
بازگشت