Title :
The categorisation of hidden Web databases through concept specificity and coverage
Author :
Hedley, Yih-Ling ; Younas, Muhammad ; James, Anne
Author_Institution :
Sch. of Math. & Inf. Sci., Coventry Univ., UK
Abstract :
Hidden Web databases maintain a collection of specialised documents, which are dynamically generated in response to users´ queries. The categorisation of such databases into a set of predefined categories has been widely employed to assist users in their information searches. In this paper we present a technique that automatically categorises a document database through its content summary and concepts described by their specificity and coverage. Experimental results show that our approach categorises databases with a larger number of relevant categories.
Keywords :
classification; content management; content-based retrieval; document handling; information retrieval; information retrieval systems; concept coverage; concept specificity; content summary; document database; hidden Web databases; information search; Data mining; Databases; Frequency; Information retrieval; Sampling methods; Web pages;
Conference_Titel :
Advanced Information Networking and Applications, 2005. AINA 2005. 19th International Conference on
Print_ISBN :
0-7695-2249-1
DOI :
10.1109/AINA.2005.323