Title :
Towards creating a knowledge base for World-Wide Web documents
Author :
Lambrix, Patrick ; Shahmehri, Nahid ; Åberg, Johan
Author_Institution :
Dept. of Comput. & Inf. Sci., Linkoping Univ., Sweden
Abstract :
The lack of organization of information on the web results in non-efficient information retrieval. Several approaches for improvement have been suggested. We propose to use a document knowledge base that contains semantic and structural information concerning the retrievable documents that is extracted from the actual documents. We show that using such a knowledge base gives a number of advantages, including advanced query functionality. We also discuss the creation of such a knowledge base and in particular we show how we can automatically extract structural information from HTML documents for addition to the document knowledge base
Keywords :
Internet; deductive databases; document handling; knowledge based systems; page description languages; query processing; HTML documents; Internet; World Wide Web documents; advanced query functionality; deductive databases; document knowledge base; information retrieval; knowledge based system; semantic information; structural information; Data mining; Databases; HTML; Information retrieval; Information science; Internet; Search engines; Text analysis; Web search; Web sites;
Conference_Titel :
Intelligent Information Systems, 1997. IIS '97. Proceedings
Conference_Location :
Grand Bahama Island
Print_ISBN :
0-8186-8218-3
DOI :
10.1109/IIS.1997.645367