Title :
Classification of durian characteristics for semantic representation from web documents
Author :
Bakar, Z.A. ; Ismail, Khairul Nurmazianna
Author_Institution :
Fac. of Comput. & Math. Sci., Dept. of Comput. Sci., Univ. Teknol. MARA (UiTM), Shah Alam, Malaysia
Abstract :
The Web contains enormous size of information that is represented in various document structures. The information is scattered and redundant. Currently, search engine is the main medium for retrieving this information. Yet, the most popular search engine cannot satisfy user query. Alternatively, semantic technology can alleviate this problem. In this paper, only relevant web HTML documents on durian also known as king of fruits are chosen. The characteristics of durian will be extracted from those HTML documents. These characteristics are then employed in semantic representation and stored along with their Uniform Resource Identifier (URI) in Resource Description Framework (RDF). The RDF provides the ontology link to many other web documents on durian. Experiment on 40 HTML documents provides eleven new characteristics of durian that can be represent in RDF for semantic search engine.
Keywords :
Internet; agricultural products; document handling; food products; hypermedia markup languages; information retrieval; pattern classification; search engines; semantic Web; RDF; URI; Web HTML documents; durian characteristics; fruits; information retrieval; ontology link; resource description framework; search engine; semantic representation; semantic technology; uniform resource identifier; Agriculture; Government; HTML; Ontologies; Resource description framework; Semantics; Durian; HTML; RDF; semantic;
Conference_Titel :
E-Learning, E-Management and E-Services (IS3e), 2012 IEEE Symposium on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4673-2390-1
DOI :
10.1109/IS3e.2012.6414956