• DocumentCode
    1728484
  • Title

    An Adaptive Page Clustering Based Weighting Method for Information Retrieval

  • Author

    Yi-Xian Lin ; Hung-Yu Kao

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
  • fYear
    2013
  • Firstpage
    199
  • Lastpage
    204
  • Abstract
    With the coming of the era of information explosion, using Internet to obtain information has become the most convenient pipeline information flow. However, the found information mostly based on keyword matching through the search engines, and the search engines do not generally conduct filtering and screening in order to enhance the returns. If the web pages pass a systematic arrangement and are divided into multiple categories or clusters, the users will be guided to obtain real help of information. In this paper, we propose an adaptive web pages clustering algorithm to perform this task. It extracts features to reduce feature dimensions, then filters automatically web pages into its appropriate cluster and enhances the features of the pages to site features for different coefficients to improve the effect. Finally, providing users a more accurate search data model. The experimental results show that compared to the traditional TF-IDF, the proposed approach can find the needed web pages and the topics of the web pages in the corresponding cluster that are highly similar.
  • Keywords
    Internet; Web sites; document handling; information retrieval; pattern clustering; search engines; Internet; TF-IDF; Web pages clustering algorithm; adaptive page clustering based weighting method; information explosion; information retrieval; keyword matching; pipeline information flow; search engines; systematic arrangement; Clustering algorithms; Clustering methods; Feature extraction; Keyword search; Search engines; Web pages; Correlation Coefficient; Information Retrieval; Page Clustering; Topic Feature;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Technologies and Applications of Artificial Intelligence (TAAI), 2013 Conference on
  • Conference_Location
    Taipei
  • Print_ISBN
    978-1-4799-2528-5
  • Type

    conf

  • DOI
    10.1109/TAAI.2013.48
  • Filename
    6783867