DocumentCode :
2080026
Title :
Fuzzy matching of Web queries to structured data
Author :
Cheng, Tao ; Lauw, Hady W. ; Paparizos, Stelios
Author_Institution :
Univ. of Illinois, Urbana, IL, USA
fYear :
2010
fDate :
1-6 March 2010
Firstpage :
713
Lastpage :
716
Abstract :
Recognizing the alternative ways people use to reference an entity, is important for many Web applications that query structured data. In such applications, there is often a mismatch between how content creators describe entities and how different users try to retrieve them. In this paper, we consider the problem of determining whether a candidate query approximately matches with an entity. We propose an off-line, data-driven, bottom-up approach that mines query logs for instances where Web content creators and Web users apply a variety of strings to refer to the same Web pages. This way, given a set of strings that reference entities, we generate an expanded set of equivalent strings for each entity. The proposed method is verified with experiments on real-life data sets showing that we can dramatically increase the queries that can be matched.
Keywords :
Internet; data mining; fuzzy set theory; query processing; Web pages; Web queries; candidate query; fuzzy matching; query log mining; structured data; Africa; Content based retrieval; Databases; Earth Observing System; Motion pictures; Search engines; Skull; Uniform resource locators; Web pages; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2010 IEEE 26th International Conference on
Conference_Location :
Long Beach, CA
Print_ISBN :
978-1-4244-5445-7
Electronic_ISBN :
978-1-4244-5444-0
Type :
conf
DOI :
10.1109/ICDE.2010.5447817
Filename :
5447817
Link To Document :
بازگشت