Title :
Fast Pairwise Query Selection for Large-Scale Active Learning to Rank
Author :
Buyue Qian ; Xiang Wang ; Jun Wang ; Hongfei Li ; Nan Cao ; Weifeng Zhi ; Davidson, Ian
Author_Institution :
IBM T. J. Watson Res., Yorktown Heights, NY, USA
Abstract :
Pair wise learning to rank algorithms (such as Rank SVM) teach a machine how to rank objects given a collection of ordered object pairs. However, their accuracy is highly dependent on the abundance of training data. To address this limitation and reduce annotation efforts, the framework of active pair wise learning to rank was introduced recently. However, in such a framework the number of possible query pairs increases quadratic ally with the number of instances. In this work, we present the first scalable pair wise query selection method using a layered (two-step) hashing framework. The first step relevance hashing aims to retrieve the strongly relevant or highly ranked points, and the second step uncertainty hashing is used to nominate pairs whose ranking is uncertain. The proposed framework aims to efficiently reduce the search space of pair wise queries and can be used with any pair wise learning to rank algorithm with a linear ranking function. We evaluate our approach on large-scale real problems and show it has comparable performance to exhaustive search. The experimental results demonstrate the effectiveness of our approach, and validate the efficiency of hashing in accelerating the search of massive pair wise queries.
Keywords :
file organisation; learning (artificial intelligence); query processing; active pair wise learning; algorithm ranking; fast pairwise query selection; large-scale active learning; layered hashing framework; scalable pair wise query selection method; search space reduction; uncertainty hashing; Accuracy; Artificial neural networks; Databases; Search problems; Training data; Uncertainty; Vectors; Active Learning; Hashing; Learning to Rank;
Conference_Titel :
Data Mining (ICDM), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
DOI :
10.1109/ICDM.2013.54