DocumentCode :
2916173
Title :
Estimating End-to-End Performance by Collaborative Prediction with Active Sampling
Author :
Rish, Irina ; Tesauro, Gerald
Author_Institution :
T.J. Watson Res. Center, IBM, Hawthorne, NY
fYear :
2007
fDate :
May 21 2007-Yearly 25 2007
Firstpage :
294
Lastpage :
303
Abstract :
Accurately estimating end-to-end performance in distributed systems is essential both for monitoring compliance with service-level agreements (SLAs) and for performance optimization (e.g., choosing the highest-bandwidth server for a download request in a content-distribution system). Due to infeasibility of exhaustive pairwise measurements, a natural alternative is to predict unobserved end-to-end performances from available historic data, with minimal additional measurements. In this paper we present an approach to this based on Collaborative Prediction (CP), an estimation method designed to work with sparse data, that has enjoyed much success in other domains (e.g. product recommendation systems), and obviates the need for landmark nodes commonly assumed in other approaches. Specifically, we use Max-Margin Matrix Factorization (MMMF), a linear factor model for CP that has outperformed state- of-art CP techniques. Moreover, our approach readily admits active sampling based on prediction confidence, and we further propose a novel active-sampling CP approach yielding even higher predictive accuracy, while allowing a flexible trade-off between "exploration" (choosing suboptimal samples to improve estimation accuracy) and "exploitation" (choosing node with best estimated performance). We demonstrate successful empirical results on a variety of practical problems, including network latency prediction (NLANR-AMP, P2PSim and PlanetLab datasets) and bandwidth prediction in content-distribution systems (IBM\´s downloadGrid data).
Keywords :
groupware; matrix decomposition; optimisation; sampling methods; sparse matrices; NLANR-AMP; P2PSim; PlanetLab datasets; active sampling; bandwidth prediction; collaborative prediction; compliance monitoring; content-distribution systems; distributed systems; downloadGrid data; landmark nodes; linear factor model; max-margin matrix factorization; network latency prediction; performance optimization; product recommendation systems; service-level agreements; sparse data; Collaboration; Collaborative work; Design methodology; Extraterrestrial measurements; Monitoring; Optimization; Performance evaluation; Sampling methods; Sparse matrices; Yield estimation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Integrated Network Management, 2007. IM '07. 10th IFIP/IEEE International Symposium on
Conference_Location :
Munich
Print_ISBN :
1-4244-0798-2
Electronic_ISBN :
1-4244-0799-0
Type :
conf
DOI :
10.1109/INM.2007.374794
Filename :
4258546
Link To Document :
بازگشت