Poster

Adaptive Random Walk Gradient Descent for Decentralized Optimization

Tao Sun · Dongsheng Li · Bao Wang

Hall E #719

Keywords: [ OPT: Stochastic ] [ OPT: First-order ]

[ Abstract ]
[ Poster [ Paper PDF
Thu 21 Jul 3 p.m. PDT — 5 p.m. PDT
 
Spotlight presentation: Optimization/Theory
Thu 21 Jul 12:30 p.m. PDT — 2 p.m. PDT

Abstract:

In this paper, we study the adaptive step size random walk gradient descent with momentum for decentralized optimization, in which the training samples are drawn dependently with each other. We establish theoretical convergence rates of the adaptive step size random walk gradient descent with momentum for both convex and nonconvex settings. In particular, we prove that adaptive random walk algorithms perform as well as the non-adaptive method for dependent data in general cases but achieve acceleration when the stochastic gradients are “sparse”. Moreover, we study the zeroth-order version of adaptive random walk gradient descent and provide corresponding convergence results. All assumptions used in this paper are mild and general, making our results applicable to many machine learning problems.

Chat is not available.