• DocumentCode
    3312644
  • Title

    A novel Voting Algorithm of multi-class SVM for web page classification

  • Author

    Thamrongrat, Pornpon ; Preechaveerakul, Ladda ; Wettayaprasit, Wiphada

  • Author_Institution
    Comput. Sci. Dept., Prince of Songkla Univ., Songkla, Thailand
  • fYear
    2009
  • fDate
    8-11 Aug. 2009
  • Firstpage
    327
  • Lastpage
    331
  • Abstract
    The increasing numbers of Web pages on the cyber world result to the less effectiveness of document retrieval that matches the need of users. The classification of Web pages is one of the solutions to solve this problem. This paper proposes VAMSVM_WPC model which is a novel voting algorithm for classifying the Web pages, which uses a multi-class SVM method. First, feature is generated from text and title, and then reduces the number of features by two feature selection techniques. Use these two types of features to give input to multi-class SVM. Finally, on the output of SVM, a voting algorithm is used to determine the category of the Web pages. Results on CMU benchmark dataset show that using text and title feature with 1vsAll_Voting Algorithm gives the highest F-measure value.
  • Keywords
    Internet; information retrieval; pattern classification; support vector machines; text analysis; Web page classification; document retrieval; feature selection; multiclass support vector machine; text analysis; Artificial intelligence; Computer science; Equations; Laboratories; Performance gain; Support vector machine classification; Support vector machines; Testing; Voting; Web pages; feature selection; support vector machine; web page classification voting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology, 2009. ICCSIT 2009. 2nd IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-4519-6
  • Electronic_ISBN
    978-1-4244-4520-2
  • Type

    conf

  • DOI
    10.1109/ICCSIT.2009.5234603
  • Filename
    5234603