Title :
Integrating a query language for structured and semi-structured data and IR techniques
Author :
Heuer, Andreas ; Priebe, Denny
Author_Institution :
Database Res. Group, Rostock Univ., Germany
Abstract :
The authors describe the basic ideas and concepts behind the Information Retrieval Query Language (IRQL) that is used as one of the back-ends in the GETESS project. The front-end provides a user interface which is embedded in a dialogue system. This dialogue system allows queries to be formulated in a user friendly (i.e. exploiting a limited range of natural language) and interactive way. Access to the analyzed data is provided by IRQL. The principal focus of IRQL development is the integration of concepts of information retrieval, database query languages, and query languages for semi-structured data. Therefore, we will be able to exploit the structure of documents, if known, and can additionally use information retrieval techniques regardless of whether the structure is known or not. Our approach develops a query language that is compatible with the recently adopted SQL99 standard and information retrieval clauses (e.g. Boolean retrieval). Our data model extends the object-relational model and additionally supports an abstraction of attributes. That is, we can use attribute-independent queries as well as attribute-dependent ones as in RDBMSs. We evaluate IRQL queries by mapping them to queries supported by existing systems such as object-relational DBMSs, full-text DBMSs, or conventional search engines, and post processing the results supplied by these systems, if necessary
Keywords :
data models; document handling; full-text databases; information retrieval; object-oriented databases; query languages; relational databases; Boolean retrieval; GETESS project; IR techniques; IRQL development; IRQL queries; Information Retrieval Query Language; RDBMSs; SQL99 standard; attribute-independent queries; back-ends; conventional search engines; data access; data model; database query languages; dialogue system; full-text DBMSs; information retrieval; information retrieval clauses; information retrieval techniques; object-relational DBMSs; object-relational model; post processing; semi-structured data; structured data; user interface; Abstracts; Computer science; Content based retrieval; Database languages; Information retrieval; Natural languages; Search engines; Spatial databases; Standards development; World Wide Web;
Conference_Titel :
Database and Expert Systems Applications, 2000. Proceedings. 11th International Workshop on
Conference_Location :
London
Print_ISBN :
0-7695-0680-1
DOI :
10.1109/DEXA.2000.875102