DocumentCode
2734143
Title
Approximate Keyword Search in Web Search Engines
Author
Wu, Sun ; Chang, Hsien-Tsung ; Hsu, Ting-Chao ; Liu, Pei-Shin
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Chung Cheng Univ., Min-Hsiung
fYear
2006
fDate
6-6 Dec. 2006
Firstpage
404
Lastpage
411
Abstract
We present a new index method to provide approximate keyword search in search engines. Our approximate keyword matching adopts a new similarity measurement called Listance model, which is a variation of the LCS (longest common subsequence) model. Two keywords are considered approximately matched, if their Listance is no more than a predefined parameter k. Suppose the length of keywords A and B are m and n respectively, the Listance between A and B is defined to be max(m, n) - LCS(A, B). The index method uses a new data structure called LBS index (listance bounded subsequence index), which was designed to allow for very fast approximate keyword matching. In the index phase, a collection of keywords is used as a reference dictionary. We transform keywords in the Web pages into a special form to be indexed if they match one of the keywords approximately in the reference dictionary. During the query processing, a similar keyword transformation is conducted to search the approximate index. The experimental result shows that our approach is efficient and can provide approximate keyword search capability that could be practically interesting.
Keywords
data structures; search engines; Web search engines; approximate keyword search; data structure; longest common subsequence model; query processing; similarity measurement; Computer science; Dictionaries; Keyword search; Los Angeles Council; Pattern matching; Query processing; Search engines; Sun; Web pages; Web search;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Information Management, 2006 1st International Conference on
Conference_Location
Bangalore
Print_ISBN
1-4244-0682-X
Type
conf
DOI
10.1109/ICDIM.2007.369229
Filename
4221921
Link To Document