DocumentCode
2651997
Title
An Improved Centroid-Based Approach for Multi-label Classification of Web Pages by Genre
Author
Jebari, Chaker
Author_Institution
Coll. of Appl. Sci., Ibri, Oman
fYear
2011
fDate
7-9 Nov. 2011
Firstpage
889
Lastpage
890
Abstract
In this paper, we propose an improved multi-label approach to classify web pages by genre. Our approach provides a multi-label classification scheme in which a web page can be assigned to more than one genre. To deal with the rapid evolution of web genres, our approach implements an incremental centroid-based classification scheme. Conducted experiments on a multi-labeled corpus of web pages show that our approach provides good results.
Keywords
Internet; pattern classification; Web genre; Web page; improved centroid-based approach; incremental centroid-based classification scheme; multilabel classification; multilabeled corpus; Classification algorithms; Complexity theory; Educational institutions; Noise; Support vector machines; Training; Web pages; centroid adjustement; genre centroid; incremental classification; multi-label classification; noise web page;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence (ICTAI), 2011 23rd IEEE International Conference on
Conference_Location
Boca Raton, FL
ISSN
1082-3409
Print_ISBN
978-1-4577-2068-0
Electronic_ISBN
1082-3409
Type
conf
DOI
10.1109/ICTAI.2011.142
Filename
6103433
Link To Document