Title :
Multitask Coupled Logistic Regression and its Fast Implementation for Large Multitask Datasets
Author :
Xin Gu ; Fu-Lai Chung ; Ishibuchi, Hisao ; Shitong Wang
Author_Institution :
Sch. of Digital Media, Jiangnan Univ., Wuxi, China
Abstract :
When facing multitask-learning problems, it is desirable that the learning method could find the correct input-output features and share the commonality among multiple domains and also scale-up for large multitask datasets. We introduce the multitask coupled logistic regression (LR) framework called LR-based multitask classification learning algorithm (MTC-LR), which is a new method for generating each classifier for each task, capable of sharing the commonality among multitask domains. The basic idea of MTC-LR is to use all individual LR based classifiers, each one appropriate for each task domain, but in contrast to other support vector machine (SVM)-based proposals, learning all the parameter vectors of all individual classifiers by using the conjugate gradient method, in a global way and without the use of kernel trick, and being easily extended into its scaled version. We theoretically show that the addition of a new term in the cost function of the set of LRs (that penalizes the diversity among multiple tasks) produces a coupling of multiple tasks that allows MTC-LR to improve the learning performance in a LR way. This finding can make us easily integrate it with a state-of-the-art fast LR algorithm called dual coordinate descent method (CDdual) to develop its fast version MTC-LR-CDdual for large multitask datasets. The proposed algorithm MTC-LR-CDdual is also theoretically analyzed. Our experimental results on artificial and real-datasets indicate the effectiveness of the proposed algorithm MTC-LR-CDdual in classification accuracy, speed, and robustness.
Keywords :
conjugate gradient methods; pattern classification; regression analysis; support vector machines; LR-based multitask classification learning algorithm; MTC-LR-CDdual method; SVM-based proposal; dual coordinate descent method; multitask coupled logistic regression; support vector machine; support vector machine-based proposal; Accuracy; Cost function; Couplings; Educational institutions; Gradient methods; Training; Vectors; Dual coordinate descent method (CDdual); logistic regression (LR); multitask classification learning (MTC); posterior probability;
Journal_Title :
Cybernetics, IEEE Transactions on
DOI :
10.1109/TCYB.2014.2362771