DocumentCode :
2605993
Title :
Concept hierarchy based text database categorization in a metasearch engine environment
Author :
Wang, Wenxian ; Meng, Weiyi ; Yu, Clement
Author_Institution :
Dept. of Comput. Sci., State Univ. of New York, Binghamton, NY, USA
Volume :
1
fYear :
2000
fDate :
2000
Firstpage :
283
Abstract :
Document categorization, as a technique to improve the retrieval of useful documents, has been extensively investigated. One important issue in a large-scale meta-search engine is to select text databases that are likely to contain useful documents for a given query. We believe that database categorization can be a potentially effective technique for good database selection, especially in the Internet environment, where short queries are usually submitted. In this paper, we propose and evaluate several database categorization algorithms. This study indicates that, while some document categorization algorithms could be adopted for database categorization, algorithms that take into consideration the special characteristics of databases may be more effective. Preliminary experimental results are provided to compare the proposed database categorization algorithms
Keywords :
Internet; full-text databases; information retrieval system evaluation; search engines; semantic networks; Internet; concept hierarchy; database selection; document categorization; document retrieval; large-scale meta-search engine; short queries; text database categorization algorithms; useful documents; Computer science; Databases; Indexes; Information retrieval; Internet; Large-scale systems; Metasearch; Scattering; Search engines; Telecommunication traffic;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Information Systems Engineering, 2000. Proceedings of the First International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-0577-5
Type :
conf
DOI :
10.1109/WISE.2000.882403
Filename :
882403
Link To Document :
بازگشت