DocumentCode
3014613
Title
Web document clustering approach using wordnet lexical categories and fuzzy clustering
Author
Gharib, T.F. ; Fouad, Mohammed M. ; Aref, MostafaM
Author_Institution
Fac. of Comput. & Inf. Sci., Ain Shams Univ., Cairo
fYear
2008
fDate
24-27 Dec. 2008
Firstpage
48
Lastpage
55
Abstract
Web mining is defined as applying data mining techniques to the content, structure, and usage of Web resources. The three areas of Web mining are commonly distinguished: content mining, structure mining, and usage mining. In all these areas, a wide range of general data mining techniques, in particular association rule discovery, clustering, classification, and sequence mining, are employed and developed further to reflect the specific structures of Web resources and the specific questions posed in Web mining. In this paper, we introduced a Web document clustering approach that uses WordNet lexical categories and fuzzy c-means algorithm to improve the performance of clustering problem for Web document. Experiments show that fuzzy c-means algorithm achieves great performance optimization with comparison with the recent algorithms for document clustering.
Keywords
Internet; content management; data mining; document handling; fuzzy set theory; pattern clustering; Web document clustering; Web mining; Web resource; WordNet lexical category; association rule discovery; content mining; data mining; fuzzy c-means algorithm; fuzzy clustering; sequence mining; structure mining; usage mining; Artificial intelligence; Association rules; Clustering algorithms; Clustering methods; Computer science; Data mining; Information science; Optimization; Text mining; Web mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Information Technology, 2008. ICCIT 2008. 11th International Conference on
Conference_Location
Khulna
Print_ISBN
978-1-4244-2135-0
Electronic_ISBN
978-1-4244-2136-7
Type
conf
DOI
10.1109/ICCITECHN.2008.4803109
Filename
4803109
Link To Document