DocumentCode :
2123935
Title :
Deriving Semantic Sessions from Semantic Clusters
Author :
Safarkhani, Banafsheh ; Talabeigi, Mojde ; Mohsenzadeh, Mehran ; Meybodi, Mohammad Reza
Author_Institution :
Dept. of Comput. Eng., Islamic Azad Univ., Tehran
fYear :
2009
fDate :
3-5 April 2009
Firstpage :
523
Lastpage :
528
Abstract :
A important phase in any Web personalization system is transaction identification. Recently a number of researches have been done to incorporate semantics of a website in representation of transactions. Building a hierarchy of concepts manually is time consuming and expensive. In this paper we intend to address these shortcomings. Our contribution is that we introduce a mechanism to automatically improve the representation of the user in the Website using a comprehensive lexical semantic resource and semantic clusters. We utilize Wikipedia, the largest encyclopedia to date, as a rich lexical resource to enhance the automatic construction of vector model representation of user sessions. We cluster Web pages based on their content with hierarchical unsupervised fuzzy clustering algorithms ,are effective methods, for exploring the structure of complex real data where grouping of overlapping and vague elements is necessary. Entries in Web server logs are used to identify users and visit sessions, while Web page or resources in the site are clustered based on their content and their semantic. Theses clusters of Web documents are used to scrutinize the discovered web sessions in order to identify what we call sub-sessions. Each subsession have consistent goal. This process engendered to improving deriving semantic sessions from Web site user page views. Our experiments show that proposed system significantly improves the quality of Web personalization process.
Keywords :
Web sites; document handling; fuzzy set theory; pattern clustering; personal computing; user interfaces; Web documents; Web pages; Web personalization system; Web server logs; Web site; Wikipedia; hierarchical unsupervised fuzzy clustering algorithms; semantic clusters; semantic resource; semantic sessions; transaction identification; Clustering algorithms; Data mining; Encyclopedias; Feature extraction; Information management; Ontologies; Taxonomy; Web pages; Web server; Wikipedia; Semantic cluster; Semantic sub-session; Semantic vectors; Wikipedia;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Management and Engineering, 2009. ICIME '09. International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-0-7695-3595-1
Type :
conf
DOI :
10.1109/ICIME.2009.131
Filename :
5077090
Link To Document :
بازگشت