DocumentCode
197369
Title
Clasificación multi-etiqueta utilizando computación distribuida
Author
Rodriguez, Juan Manuel ; Zunino, Alejandro ; Godoy, Daniela ; Mateos, Cristian
Author_Institution
ISISTAN, UNICEN-CONICET, Campus Universitario - Paraje Arroyo Seco, Tandil, Buenos Aires, Argentina
fYear
2014
fDate
11-13 June 2014
Firstpage
90
Lastpage
95
Abstract
Multi-label classification techniques have been developed for problems where objects can be associated to several disjoint labels, such as the scientific topics covered by a paper. However, these techniques tend to be computationally complex, which makes it difficult to use them in practice. Therefore, they might be unsuitable for large problems. This paper presents an approach to accelerate a well-know multi-label classification technique, called Binary Relevance, by using small computational clusters. In this classification technique, the training times grow linearly with the number of labels. In particular, this work aims at reducing the times required for training a Binary Relevance classifier. This approach was tested using 7 data-sets with 81 associated labels and more than a quarter million training instances. Experimental results shown a linear increment on the speed-up when computational nodes are added to the cluster.
fLanguage
English
Publisher
ieee
Conference_Titel
Biennial Congress of Argentina (ARGENCON), 2014 IEEE
Conference_Location
Bariloche, Argentina
Print_ISBN
978-1-4799-4270-1
Type
conf
DOI
10.1109/ARGENCON.2014.6868477
Filename
6868477
Link To Document