مرکز منطقه ای اطلاع رساني علوم و فناوري - Improving Classifier Performance Using Data with Different Taxonomies

DocumentCode :

1330843

Title :

Improving Classifier Performance Using Data with Different Taxonomies

Author :

Iwata, Tomoharu ; Tanaka, Toshiyuki ; Yamada, Takeshi ; Ueda, Naonori

Author_Institution :

NTT Commun. Sci. Labs., Keihanna Science City, Japan

Volume :

Issue :

fYear :

2011

Firstpage :

1668

Lastpage :

1677

Abstract :

We propose a framework for improving classifier performance by effectively using auxiliary samples. The auxiliary samples are labeled not in terms of the target taxonomy according to which we wish to classify samples, but according to classification schemes or taxonomies that are different from the target taxonomy. Our method finds a classifier by minimizing a weighted error over the target and auxiliary samples. The weights are defined so that the weighted error approximates the expected error when samples are classified into the target taxonomy. Experiments using synthetic and text data show that our method significantly improves the classifier performance in most cases compared to conventional data augmentation methods.

Keywords :

error analysis; pattern classification; auxiliary samples; classifier performance; data augmentation methods; synthetic data; taxonomies; text data; weighted error minimization; Accuracy; Computational modeling; Correlation; Estimation; Taxonomy; Training; Web pages; Transfer learning; semisupervised learning; text classification.;

fLanguage :

English

Journal_Title :

Knowledge and Data Engineering, IEEE Transactions on

Publisher :

ieee

ISSN :

1041-4347

Type :

jour

DOI :

10.1109/TKDE.2010.170

Filename :

5582091

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1330843