DocumentCode :
427108
Title :
A new analysis of the value of unlabeled data in semi-supervised learning for image retrieval
Author :
Qi Tian ; Yu, Jie ; Xue, Qing ; Sebe, Nicu
Author_Institution :
Dept. of Comput. Sci., Texas Univ., San Antonio, TX
Volume :
2
fYear :
2004
fDate :
30-30 June 2004
Firstpage :
1019
Abstract :
There has been an increasing interest in using unlabeled data in semi-supervised learning for various classification problems. Previous work shows that unlabeled data can improve or degrade the classification performance depending on whether the model assumption matches the ground-truth data distribution, and also on the complexity of the classifier compared with the size of the labeled training set. In this paper, we provide a new analysis on the value of unlabeled data by considering different distributions of the labeled and unlabeled data and showing the migrating effect for semi-supervised learning. Extensive experiments have been performed in the context of image retrieval applications. Our approach evaluates the value of unlabeled data from a new aspect and is aimed to provide a guideline on how unlabeled data should be used
Keywords :
image classification; image retrieval; learning (artificial intelligence); statistical distributions; classification performance; classifier complexity; ground-truth data distribution; image retrieval; probabilistic distribution migrating effect; semi-supervised learning; unlabeled data value; Computer science; Degradation; Guidelines; Image analysis; Image retrieval; Information retrieval; Maximum likelihood estimation; Semisupervised learning; Text categorization; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
0-7803-8603-5
Type :
conf
DOI :
10.1109/ICME.2004.1394376
Filename :
1394376
Link To Document :
بازگشت