DocumentCode :
2498450
Title :
Recognizing and Filtering Web Documents with Using Ontology
Author :
Abadi, Reza Mohamadi Bahram ; Yektaie, Mohammadi Hossein ; Abbasi, Mashallah
Author_Institution :
Azad Univ. Of Oloum-va-Tahghighat, Ahwaz, Iran
fYear :
2010
fDate :
23-25 April 2010
Firstpage :
524
Lastpage :
530
Abstract :
In so far as users can have access to required data among piles of documents in World Wide Web, there is need for systematic methods of search. This paper puts to evaluation the extent of relationship between a semi structured HTML and ontology using some statistical techniques. For this purpose, having considered the existing data in ontology, what be calculated is the expected density and expected value for an ontology-related document. For giving an idea on a sample document, we need to count lexical objects viewed in the document conforming to lexical objects within ontology and upon using it, viewed density is put to calculation for the document. Then, having used lemma Pearson, there is a need to set an acceptable limitation for comparing expected density and expected value with view density and view value. If calculations for the two cases of expected value with density and view value are within the required range, then ontology would be related. According to experimental Results within a 95% reliable range, shows that the recommended method´s ability to achieve value recall 100% and precision 83% is able.
Keywords :
Internet; hypermedia markup languages; information filtering; ontologies (artificial intelligence); statistical analysis; Web document filtering; Web document recognition; World Wide Web; lemma Pearson; ontology related document; semistructured HTML; statistical techniques; systematic methods; Computer networks; Data mining; Decision trees; HTML; Information filtering; Information filters; Ontologies; Search methods; Web pages; Web sites; Information filtering; Web documents; application- ontology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Network Technology (ICCNT), 2010 Second International Conference on
Conference_Location :
Bangkok
Print_ISBN :
978-0-7695-4042-9
Electronic_ISBN :
978-1-4244-6962-8
Type :
conf
DOI :
10.1109/ICCNT.2010.95
Filename :
5474444
Link To Document :
بازگشت