Title :
SVD: a novel content-based representation technique for Web documents
Author :
Chue, W.L. ; Chen, L.H.
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
Abstract :
Most users typically express their information need via short queries to search engines and they often have to physically sift through the search results based on relevance ranking, making the process of relevance judgement time-consuming. The structure of the Web is increasingly being used to improve organisation, search and analysis of information on the Web. In this paper, we describe how the Web structure together with our novel summarisation techniques can be applied to better represent knowledge in actual Web documents via the proposed semantic virtual documents (SVD). We will also outline our experimental design to evaluate the effectiveness of the proposed SVD with a prototype system called iSEARCH (intelligent search and review of cluster hierarchy) for Web content retrieval and mining. The experimental results confirm that the novel technique show promising qualities in content-based representation for Web documents to enhance Web content retrieval and mining.
Keywords :
Internet; content-based retrieval; data mining; virtual reality; SEARCH; SVD; Web content mining; Web content retrieval; Web documents; communications techniques; content-based representation technique; intelligent search and review of cluster hierarchy; semantic virtual documents; Content based retrieval; Design for experiments; Information analysis; Information retrieval; Intelligent systems; Internet; Knowledge representation; Prototypes; Search engines; Visualization;
Conference_Titel :
Information, Communications and Signal Processing, 2003 and Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint Conference of the Fourth International Conference on
Print_ISBN :
0-7803-8185-8
DOI :
10.1109/ICICS.2003.1292785