DocumentCode :
1302909
Title :
Tree-Based Ranking Methods
Author :
Clémençon, Stéphan ; Vayatis, Nicolas
Author_Institution :
LTCI UMR Inst. Telecom, Telecom Paristech (TSI), Paris, France
Volume :
55
Issue :
9
fYear :
2009
Firstpage :
4316
Lastpage :
4336
Abstract :
This paper investigates how recursive partitioning methods can be adapted to the bipartite ranking problem. In ranking, the pursued goal is global: based on past data, define an order on the whole input space X, so that positive instances take up the top ranks with maximum probability. The most natural way to order all instances consists of projecting the input data onto the real line through a real-valued scoring function s and use the natural order on R. The accuracy of the ordering induced by a candidate s is classically measured in terms of the ROC curve or the AUC. Here we discuss the design of tree-structured scoring functions obtained by recursively maximizing the AUC criterion. The connection with recursive piecewise linear approximation of the optimal ROC curve both in the L1-sense and in the Linfin-sense is highlighted. A novel tree-based algorithm for ranking, called TreeRank, is proposed. Consistency results and generalization bounds of functional nature are established for this ranking method, when considering either the L1 or Linfin distance. We also describe committee-based learning procedures using TreeRank as a ldquobase ranker,rdquo in order to overcome obvious drawbacks of such a top-down partitioning technique. Simulation results on artificial data are also displayed.
Keywords :
approximation theory; decision trees; learning (artificial intelligence); piecewise linear techniques; tree data structures; bipartite ranking problem; committee-based learning procedures; recursive partitioning methods; recursive piecewise linear approximation; tree-based algorithm; tree-based ranking methods; tree-structured scoring functions; Calibration; Decision trees; Helium; Information retrieval; Machine learning; Medical diagnosis; Piecewise linear approximation; Search engines; Statistical learning; Telecommunications; $ {rm AUC}$ criterion; $ {rm ROC}$ curve; Adaptive piecewise linear approximation; bipartite ranking problem; decision Tree;
fLanguage :
English
Journal_Title :
Information Theory, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9448
Type :
jour
DOI :
10.1109/TIT.2009.2025558
Filename :
5208572
Link To Document :
بازگشت