ICML Poster Can Gaussian Sketching Converge Faster on a Preconditioned Landscape?

Poster

Can Gaussian Sketching Converge Faster on a Preconditioned Landscape?

Yilong Wang · Haishan Ye · Guang Dai · Ivor Tsang

Hall C 4-9 #2501

[ Abstract ] [ Paper PDF ]

[ Poster]

Thu 25 Jul 4:30 a.m. PDT — 6 a.m. PDT

Abstract:

This paper focuses on the large-scale optimization which is very popular in the big data era. The gradient sketching is an important technique in the large-scale optimization. Specifically, the random coordinate descent algorithm is a kind of gradient sketching method with the random sampling matrix as the sketching matrix. In this paper, we propose a novel gradient sketching called GSGD (Gaussian Sketched Gradient Descent). Compared with the classical gradient sketching methods such as the random coordinate descent and SEGA (Hanzely et al., 2018), our GSGD does not require the importance sampling but can achieve a fast convergence rate matching the ones of these methods with importance sampling. Furthermore, if the objective function has a non-smooth regularization term, our GSGD can also exploit the implicit structure information of the regularization term to achieve a fast convergence rate. Finally, our experimental results substantiate the effectiveness and efficiency of our algorithm.

Chat is not available.