• DocumentCode
    2315384
  • Title

    Web Data Clustering Using FCM and Proximity Hints from Text as well as Hyperlink-Structure

  • Author

    Agrawal, Deepak

  • Author_Institution
    Inst. of Technol, Banahas Hindu Univ., Varanasi
  • fYear
    2008
  • fDate
    16-18 July 2008
  • Firstpage
    1104
  • Lastpage
    1108
  • Abstract
    In this study, we use FCM clustering along with proximity hints (P-FCM) to the Web pages for clustering. We provide proximity hints using a new approach of combining textual information, hyperlink structure and co-citation relations into a single similarity metric. We provide the result of Web-based experiments to show the significance of proximity hints during P-FCM functioning. These observations suggest that with the combination of textual and hyperlink-structure information we can improve the clustering done by FCM. We also show that the correlation value of human clustering and our approach is very high, showing thereby the efficiency over the existing FCM algorithm.
  • Keywords
    Internet; fuzzy set theory; search engines; FCM; Web data clustering; cocitation relations; fuzzy C-mean algorithm; hyperlink-structure; proximity hints; textual information; Clustering algorithms; Data engineering; Data mining; Equations; Fuzzy logic; Humans; Search engines; Web and internet services; Web pages; Web search; Fuzzy C-mean algorithm; Fuzzy logic; Human–computer interaction; Search engines; Similarity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Emerging Trends in Engineering and Technology, 2008. ICETET '08. First International Conference on
  • Conference_Location
    Nagpur, Maharashtra
  • Print_ISBN
    978-0-7695-3267-7
  • Electronic_ISBN
    978-0-7695-3267-7
  • Type

    conf

  • DOI
    10.1109/ICETET.2008.31
  • Filename
    4580068