DocumentCode :
3764639
Title :
Classifying web hierarchically using multi label tree classifier
Author :
Daya Gupta;Harsh Tripathi;Mayukh Maitra
Author_Institution :
Department of Computer Science and Software Engineering, Delhi Technological University, New Delhi, India
fYear :
2015
Firstpage :
1
Lastpage :
6
Abstract :
Classification and extraction of web finds its applications in semantic web, searching and information extraction. The first part of the paper deals with the problem of classifying web pages, according to their content. Further, the methodology to classify web pages hierarchically in order to achieve topic-wise modeling of websites using multi label tree classifier, a variant of classification where instances may belong to multiple classes at the same time. Data from an implementation of multi label tree classifier shows marked improvements in processing multi-class classification in comparison to conventional hierarchical classification techniques.
Keywords :
"Web pages","Training","Support vector machines","Feature extraction","Dictionaries","Multimedia communication","Classification algorithms"
Publisher :
ieee
Conference_Titel :
India Conference (INDICON), 2015 Annual IEEE
Electronic_ISBN :
2325-9418
Type :
conf
DOI :
10.1109/INDICON.2015.7443337
Filename :
7443337
Link To Document :
بازگشت