Title :
Communication-efficient algorithms for statistical optimization
Author :
Yuchen Zhang ; Duchi, John C. ; Wainwright, Martin J.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., UC Berkeley, Berkeley, CA, USA
Abstract :
We study two communication-efficient algorithms for distributed statistical optimization on large-scale data. The first algorithm is an averaging method that distributes the N data samples evenly to m machines, performs separate minimization on each subset, and then averages the estimates. We provide a sharp analysis of this average mixture algorithm, showing that under a reasonable set of conditions, the combined parameter achieves mean-squared error that decays as O(N-1 + (N/m)-2). Whenever m ≤ √N, this guarantee matches the best possible rate achievable by a centralized algorithm with access to all N samples. The second algorithm is a novel method, based on an appropriate form of bootstrap. Requiring only a single round of communication, it has mean-squared error that decays as O(N-1 + (N/m)-3), and so is more robust to the amount of parallelization.
Keywords :
large-scale systems; optimisation; statistical analysis; average mixture algorithm; averaging method; centralized algorithm; communication-efficient algorithms; control sharp analysis; data samples; distributed statistical optimization; mean-squared error; Algorithm design and analysis; Distributed databases; Estimation; Inference algorithms; Minimization; Optimization; Robustness;
Conference_Titel :
Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
Conference_Location :
Maui, HI
Print_ISBN :
978-1-4673-2065-8
Electronic_ISBN :
0743-1546
DOI :
10.1109/CDC.2012.6426691