DocumentCode :
2651997
Title :
An Improved Centroid-Based Approach for Multi-label Classification of Web Pages by Genre
Author :
Jebari, Chaker
Author_Institution :
Coll. of Appl. Sci., Ibri, Oman
fYear :
2011
fDate :
7-9 Nov. 2011
Firstpage :
889
Lastpage :
890
Abstract :
In this paper, we propose an improved multi-label approach to classify web pages by genre. Our approach provides a multi-label classification scheme in which a web page can be assigned to more than one genre. To deal with the rapid evolution of web genres, our approach implements an incremental centroid-based classification scheme. Conducted experiments on a multi-labeled corpus of web pages show that our approach provides good results.
Keywords :
Internet; pattern classification; Web genre; Web page; improved centroid-based approach; incremental centroid-based classification scheme; multilabel classification; multilabeled corpus; Classification algorithms; Complexity theory; Educational institutions; Noise; Support vector machines; Training; Web pages; centroid adjustement; genre centroid; incremental classification; multi-label classification; noise web page;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2011 23rd IEEE International Conference on
Conference_Location :
Boca Raton, FL
ISSN :
1082-3409
Print_ISBN :
978-1-4577-2068-0
Electronic_ISBN :
1082-3409
Type :
conf
DOI :
10.1109/ICTAI.2011.142
Filename :
6103433
Link To Document :
بازگشت