DocumentCode
2315384
Title
Web Data Clustering Using FCM and Proximity Hints from Text as well as Hyperlink-Structure
Author
Agrawal, Deepak
Author_Institution
Inst. of Technol, Banahas Hindu Univ., Varanasi
fYear
2008
fDate
16-18 July 2008
Firstpage
1104
Lastpage
1108
Abstract
In this study, we use FCM clustering along with proximity hints (P-FCM) to the Web pages for clustering. We provide proximity hints using a new approach of combining textual information, hyperlink structure and co-citation relations into a single similarity metric. We provide the result of Web-based experiments to show the significance of proximity hints during P-FCM functioning. These observations suggest that with the combination of textual and hyperlink-structure information we can improve the clustering done by FCM. We also show that the correlation value of human clustering and our approach is very high, showing thereby the efficiency over the existing FCM algorithm.
Keywords
Internet; fuzzy set theory; search engines; FCM; Web data clustering; cocitation relations; fuzzy C-mean algorithm; hyperlink-structure; proximity hints; textual information; Clustering algorithms; Data engineering; Data mining; Equations; Fuzzy logic; Humans; Search engines; Web and internet services; Web pages; Web search; Fuzzy C-mean algorithm; Fuzzy logic; Humancomputer interaction; Search engines; Similarity;
fLanguage
English
Publisher
ieee
Conference_Titel
Emerging Trends in Engineering and Technology, 2008. ICETET '08. First International Conference on
Conference_Location
Nagpur, Maharashtra
Print_ISBN
978-0-7695-3267-7
Electronic_ISBN
978-0-7695-3267-7
Type
conf
DOI
10.1109/ICETET.2008.31
Filename
4580068
Link To Document