DocumentCode :
2315384
Title :
Web Data Clustering Using FCM and Proximity Hints from Text as well as Hyperlink-Structure
Author :
Agrawal, Deepak
Author_Institution :
Inst. of Technol, Banahas Hindu Univ., Varanasi
fYear :
2008
fDate :
16-18 July 2008
Firstpage :
1104
Lastpage :
1108
Abstract :
In this study, we use FCM clustering along with proximity hints (P-FCM) to the Web pages for clustering. We provide proximity hints using a new approach of combining textual information, hyperlink structure and co-citation relations into a single similarity metric. We provide the result of Web-based experiments to show the significance of proximity hints during P-FCM functioning. These observations suggest that with the combination of textual and hyperlink-structure information we can improve the clustering done by FCM. We also show that the correlation value of human clustering and our approach is very high, showing thereby the efficiency over the existing FCM algorithm.
Keywords :
Internet; fuzzy set theory; search engines; FCM; Web data clustering; cocitation relations; fuzzy C-mean algorithm; hyperlink-structure; proximity hints; textual information; Clustering algorithms; Data engineering; Data mining; Equations; Fuzzy logic; Humans; Search engines; Web and internet services; Web pages; Web search; Fuzzy C-mean algorithm; Fuzzy logic; Human–computer interaction; Search engines; Similarity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Emerging Trends in Engineering and Technology, 2008. ICETET '08. First International Conference on
Conference_Location :
Nagpur, Maharashtra
Print_ISBN :
978-0-7695-3267-7
Electronic_ISBN :
978-0-7695-3267-7
Type :
conf
DOI :
10.1109/ICETET.2008.31
Filename :
4580068
Link To Document :
بازگشت