DocumentCode :
3106300
Title :
Toward DB-IR Integration: Per-Document Basis Transactional Index Maintenance
Author :
Kim, Kwangyoung ; Jin, Du-Seok ; Choi, Yunsoo ; Jeong, Chang-Hoo ; Kim, Kwangyoung ; Choi, Sung-Pil ; Lee, Minho ; Cho, Min-Hee ; Choe, Ho-Seop ; Yoon, Hwa-Mook ; Seo, Jeong-Hyun
fYear :
2007
fDate :
22-24 Aug. 2007
Firstpage :
452
Lastpage :
462
Abstract :
While information retrieval(IR) and databases(DB) have been developed independently, there have been emerging requirements that both data management and efficient text retrieval should be supported simultaneously in an information system such as health care systems, bulletin boards, XML data management, and digital libraries. Recently DB-IR integration issue has been budded in the research field. The great divide between DB and IR has caused different manners in index maintenance for newly arriving documents. While DB has extended its SQL layer to cope with text fields due to lack of intact mechanism to build IR-like index, IR usually treats a block of new documents as a logical unit of index maintenance since it has no concept of integrity constraint. However, towards DB-IR integration, a transaction on adding or updating a document should include maintenance of the postings lists accompanied by the document - hence per-document basis transactional index maintenance. In this paper, performance of a few strategies for per-document basis transaction for inserting documents -- direct index update, stand-alone auxiliary index and pulsing auxiliary index - will be evaluated. The result tested on the KRISTAL-IRMS shows that the pulsing auxiliary strategy, where long postings lists in the auxiliary index are in-place updated to the main index whereas short lists are directly updated in the auxiliary index, can be a challenging candidate for text field indexing in DB-IR integration.
Keywords :
Conference management; Crawlers; Databases; Information retrieval; Information technology; Management information systems; Middleware; Search engines; Web search; XML; DB-IR integrationdynamic index maintenancestand-alone auxiliary indexpulsing auxiliary index;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Language Processing and Web Information Technology, 2007. ALPIT 2007. Sixth International Conference on
Conference_Location :
Luoyang, Henan, China
Print_ISBN :
978-0-7695-2930-1
Type :
conf
DOI :
10.1109/ALPIT.2007.15
Filename :
4460683
Link To Document :
بازگشت