DocumentCode :
1694496
Title :
Characterization and Resolution of Incompleteness in (World-Wide-Web) Information Extraction
Author :
Feilmayr, Christina
Author_Institution :
Inst. of Applic. Oriented Knowledge Process. (FAW), Johannes Kepler Univ. Linz, Linz, Austria
fYear :
2012
Firstpage :
241
Lastpage :
245
Abstract :
Low information quality is one of the reasons why information extraction initiatives fail. Incomplete information has a pervasive negative impact on downstream processing steps. This work addresses this problem with a novel information extraction approach, which integrates data mining and information extraction methods into a single complementary approach in order to benefit from their respective advantages and reduce incompleteness in information extraction. In this context, various types of incompleteness are identified and an approach to their automatic detection is presented. Further, a prototype generic framework that incorporates the complementarity approach is proposed.
Keywords :
Internet; data mining; information retrieval; World-Wide-Web; complementarity approach; complementary approach; data mining; downstream processing steps; incompleteness characterization; incompleteness resolution; information extraction approach; low information quality; prototype generic framework; Accuracy; Conferences; Context; Data mining; Information retrieval; Null value; Semantics; Data Mining; Information Extraction; Information Integration; Information Quality;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications (DEXA), 2012 23rd International Workshop on
Conference_Location :
Vienna
ISSN :
1529-4188
Print_ISBN :
978-1-4673-2621-6
Type :
conf
DOI :
10.1109/DEXA.2012.35
Filename :
6327433
Link To Document :
بازگشت