DocumentCode :
2832726
Title :
A Universal Full Text Index with Access Control and Annotation Driven Information Retrieval
Author :
Chávez, Edgar ; Téllez, Eric Sadit
Author_Institution :
Univ. Michoacana
fYear :
2006
fDate :
Nov. 2006
Firstpage :
135
Lastpage :
140
Abstract :
Full text databases are tightly linked to the application layer. Currently IR projects must be integrated in the back-end using, at best, a general-purpose language-independent API. This architecture limits and precludes the rapid prototyping. In this paper we present a new approach, a very simple architecture, towards the development of a general purpose full-text database. We implemented a standard inverted file index, providing various extra capabilities. For each document stored we simply added a set of qualifiers, MD5 hashes and keywords, algorithmic ally unrelated to the document content. This allows to hierarchically control access to the document, iteratively improve document categorization, add and delete annotations, and document versions. All transactions are done through a standard Web service interface. This feature facilitates system integration, and testing. We describe a set of applications where our concept can be useful. The universe of applications for our concept encompass those areas where document annotations are relevant. Once stored and annotated (with qualifiers), the documents can be retrieved by a combination of qualifiers and document content. Additionally, we show our prototype in action, explaining how can be extended to support retrieval and storage models appeared in some popular sites recently
Keywords :
full-text databases; indexing; information retrieval; MD5 hashes; annotation driven information retrieval; document access control; document categorization; full-text database; standard inverted file index; universal full text index; Access control; Computer architecture; Content based retrieval; Databases; Information retrieval; Prototypes; Software prototyping; System testing; Web search; Web services;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing, 2006. CIC '06. 15th International Conference on
Conference_Location :
Mexico City
Print_ISBN :
0-7695-2708-6
Type :
conf
DOI :
10.1109/CIC.2006.17
Filename :
4023800
Link To Document :
بازگشت