DocumentCode :
3717123
Title :
Towards scalable quantile regression trees
Author :
Harish S. Bhat;Nitesh Kumar;Garnet J. Vaz
Author_Institution :
Applied Mathematics Unit, UC Merced, Merced, USA
fYear :
2015
Firstpage :
53
Lastpage :
60
Abstract :
We provide an algorithm to build quantile regression trees in O(N log N) time, where N is the number of instances in the training set. Quantile regression trees are regression trees that model conditional quantiles of the response variable, rather than the conditional expectation as in standard regression trees. We build quantile regression trees by using the quantile loss function in our node splitting criterion. The performance of our algorithm stems from new online update procedures for both the quantile function and the quantile loss function. We test the quantile tree algorithm in three ways, comparing its running time against implementations of standard regression trees, demonstrating its ability to recover a known set of nonlinear quantile functions, and showing that quantile trees yield smaller test set errors (computed using mean absolute deviation) than standard regression trees. The tests include training sets with up to 16 million instances. Overall, our results enable future use of quantile regression trees for large-scale data mining.
Keywords :
"Regression tree analysis","Vegetation","Prediction algorithms","Yttrium","Standards","Approximation methods","Training"
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/BigData.2015.7363741
Filename :
7363741
Link To Document :
بازگشت