Title :
Based on semantic web similarity
Author :
Li Ruijie ; Yang Weidong ; Haowei Jiang
Author_Institution :
Coll. of Inf. Sci. & Eng., Henan Univ. of Technol., Zhengzhou, China
Abstract :
In this paper, based on a web page to determine similarity is proposed which is based on semantic web similarity judgments. First on the web page of text extraction, is text segmentation, also taking stop words to go out, Similarity judgments for the Web page provides a basis. Then based on Vector Space Model using TF-IDF method similar to the Chinese website text judgment, In the process of judging, We are adopting a dictionary Words of Synonyms thesaurus, the unity of the key words on the page synonymous, And replacement of pages without the text a synonym for comparison, Experimental results show, The text on the page replacing the unity of synonyms significantly improved the accuracy of the web page similarity judgments.
Keywords :
Web sites; dictionaries; semantic Web; text analysis; thesauri; Chinese website text judgment; TF-IDF method; dictionary Words; page synonymous; semantic web similarity judgments; synonyms thesaurus; text extraction; text segmentation; vector space model; web page; Accuracy; Indexes; Laboratories; Manganese; Navigation; Semantics; Wireless sensor networks; page similarity; semantic; synonyms; text;
Conference_Titel :
Computer Science and Information Technology (ICCSIT), 2010 3rd IEEE International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-5537-9
DOI :
10.1109/ICCSIT.2010.5564990