• DocumentCode
    2651997
  • Title

    An Improved Centroid-Based Approach for Multi-label Classification of Web Pages by Genre

  • Author

    Jebari, Chaker

  • Author_Institution
    Coll. of Appl. Sci., Ibri, Oman
  • fYear
    2011
  • fDate
    7-9 Nov. 2011
  • Firstpage
    889
  • Lastpage
    890
  • Abstract
    In this paper, we propose an improved multi-label approach to classify web pages by genre. Our approach provides a multi-label classification scheme in which a web page can be assigned to more than one genre. To deal with the rapid evolution of web genres, our approach implements an incremental centroid-based classification scheme. Conducted experiments on a multi-labeled corpus of web pages show that our approach provides good results.
  • Keywords
    Internet; pattern classification; Web genre; Web page; improved centroid-based approach; incremental centroid-based classification scheme; multilabel classification; multilabeled corpus; Classification algorithms; Complexity theory; Educational institutions; Noise; Support vector machines; Training; Web pages; centroid adjustement; genre centroid; incremental classification; multi-label classification; noise web page;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence (ICTAI), 2011 23rd IEEE International Conference on
  • Conference_Location
    Boca Raton, FL
  • ISSN
    1082-3409
  • Print_ISBN
    978-1-4577-2068-0
  • Electronic_ISBN
    1082-3409
  • Type

    conf

  • DOI
    10.1109/ICTAI.2011.142
  • Filename
    6103433