DocumentCode :
2210655
Title :
Passive Sampling for Regression
Author :
Yu, Hwanjo ; Kim, Sungchul
Author_Institution :
Pohang Univ. of Sci. & Technol. (POSTECH), Pohang, South Korea
fYear :
2010
fDate :
13-17 Dec. 2010
Firstpage :
1151
Lastpage :
1156
Abstract :
Active sampling (also called active learning or selective sampling) has been extensively researched for classification and rank learning methods, which is to select the most informative samples from unlabeled data such that, once the samples are labeled, the accuracy of the function learned from the samples is maximized. While active sampling methods require learning a function at each iteration to find the most informative samples, this paper proposes passive sampling techniques for regression, which find the informative samples not based on the learned function but based on the samples´ geometric characteristics in the feature space. Passive sampling is more efficient than active sampling, as it does not require, at each iteration, learning and validating the regression functions and evaluating the unlabeled data using the function. For regression, passive sampling is also more effective, Active sampling for regression suffers from serious performance fluctuations in practice, because it selects the samples of highest regression errors and such samples are likely noisy. Passive sampling, on the other hand, shows more stable performance. We observe from our extensive experiments that our passive sampling methods perform even better than the ``omniscient´´ active sampling that knows the labels of unlabeled data.
Keywords :
learning (artificial intelligence); regression analysis; sampling methods; active learning; active sampling; passive sampling; rank learning method; regression; selective sampling; active learning; active sampling; passive sampling; regression; selective sampling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location :
Sydney, NSW
ISSN :
1550-4786
Print_ISBN :
978-1-4244-9131-5
Electronic_ISBN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2010.9
Filename :
5694100
Link To Document :
بازگشت