Title :
Efficient algorithms for linear summed error structural SVMs
Author :
Balamurugan, P. ; Shevade, Shirish ; Babu, T. Ravindra
Author_Institution :
Comput. Sci. & Autom., Indian Inst. of Sci., Bangalore, India
Abstract :
Structural Support Vector Machines (SSVMs) have become a popular tool in machine learning for predicting structured objects like parse trees, Part-of-Speech (POS) label sequences and image segments. Various efficient algorithmic techniques have been proposed for training SSVMs for large datasets. The typical SSVM formulation contains a regularizer term and a composite loss term. The loss term is usually composed of the Linear Maximum Error (LME) associated with the training examples. Other alternatives for the loss term are yet to be explored for SSVMs. We formulate a new SSVM with Linear Summed Error (LSE) loss term and propose efficient algorithms to train the new SSVM formulation using primal cutting-plane method and sequential dual coordinate descent method. Numerical experiments on benchmark datasets demonstrate that the sequential dual coordinate descent method is faster than the cutting-plane method and reaches the steady-state generalization performance faster. It is thus a useful alternative for training SSVMs when linear summed error is used.
Keywords :
learning (artificial intelligence); support vector machines; LME; LSE; POS label sequences; SSVM formulation; algorithmic techniques; composite loss term; image segments; linear maximum error; linear summed error structural SVM; machine learning; parse trees; part-of-speech label sequences; primal cutting-plane method; regularizer term; sequential dual coordinate descent method; structural support vector machines; Hidden Markov models; Labeling; Optimization; Silicon; Support vector machines; Training; Vectors; Cutting-plane method; Dual method; Structural SVMs;
Conference_Titel :
Neural Networks (IJCNN), The 2012 International Joint Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-1488-6
Electronic_ISBN :
2161-4393
DOI :
10.1109/IJCNN.2012.6252830