DocumentCode :
1733804
Title :
Linear Online Learning over Structured Data with Distributed Tree Kernels
Author :
Filice, S. ; Croce, Daniele ; Basili, Roberto ; Zanzotto, Fabio Massimo
Author_Institution :
Elettron. S.p.A., Rome, Italy
Volume :
1
fYear :
2013
Firstpage :
123
Lastpage :
128
Abstract :
Online algorithms are an important class of learning machines as they are extremely simple and computationally efficient. Kernel methods versions can handle structured data, such as trees, and achieve state-of-the-art performance. However kernelized versions of Online Learning algorithms slow down when the number of support vectors becomes large. The traditional way to cope with this problem is introducing budgets that set the maximum number of support vectors. In this paper, we investigate Distributed Trees (DT) as an efficient way to use structured data in online learning. DTs effectively embed the huge feature space of the tree fragments into small vectors, so enabling the use of linear versions of kernel machines over tree structured data. We experiment with the Passive-Aggressive (PA) algorithm by comparing the linear and the kernelized version. A massive dataset made with tree structured data is employed: it is originated from a natural language processing task, the Boundary Detection in the context of Semantic Role Labeling over Frame Net. Results on a sample of the final data show that the DTs along with the Linear PA algorithm and the Tree Kernel along with the Bundgeted PA achieve comparable results in terms of f1-measure. Finally, the exploration of the full dataset allows the former to improve the performance on the classification task, with respect to the latter.
Keywords :
learning (artificial intelligence); support vector machines; tree data structures; Frame Net; boundary detection; distributed tree kernel; learning machine; linear online learning; natural language processing task; online algorithm; passive-aggressive algorithm; semantic role labeling; structured data; support vector; Complexity theory; Kernel; Semantics; Support vector machines; Syntactics; Training; Vectors; Distributed Trees; Online Learning; Tree Kernels;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2013 12th International Conference on
Conference_Location :
Miami, FL
Type :
conf
DOI :
10.1109/ICMLA.2013.28
Filename :
6784598
Link To Document :
بازگشت