ICML Poster Private Stochastic Convex Optimization: Optimal Rates in L1 Geometry

Poster

Private Stochastic Convex Optimization: Optimal Rates in L1 Geometry

Hilal Asi · Vitaly Feldman · Tomer Koren · Kunal Talwar

Virtual

Keywords: [ Deep Learning ] [ Algorithms -> Multitask and Transfer Learning; Algorithms ] [ Online Learning ] [ Privacy, Anonymity, and Security ] [ Social Aspects of Machine Learning ]

[ Abstract ] [ Paper PDF ]

[ Slides]

[ Paper ]

[ Visit Poster at Spot C6 in Virtual World ]

Abstract: Stochastic convex optimization over an

ℓ_{1}

$\ell_1$ -bounded domain is ubiquitous in machine learning applications such as LASSO but remains poorly understood when learning with differential privacy. We show that, up to logarithmic factors the optimal excess population loss of any

(ϵ, δ)

$(\epsilon,\delta)$ -differentially private optimizer is

\sqrt{\log (d) / n} + \sqrt{d} / ϵ n .

$\sqrt{\log(d)/n} + \sqrt{d}/\epsilon n.$ The upper bound is based on a new algorithm that combines the iterative localization approach of Feldman et al. (2020) with a new analysis of private regularized mirror descent. It applies to

ℓ_{p}

$\ell_p$ bounded domains for

p \in [1, 2]

$p\in [1,2]$ and queries at most

n^{3 / 2}

$n^{3/2}$ gradients improving over the best previously known algorithm for the

ℓ_{2}

$\ell_2$ case which needs

n^{2}

$n^2$ gradients. Further, we show that when the loss functions satisfy additional smoothness assumptions, the excess loss is upper bounded (up to logarithmic factors) by

\sqrt{\log (d) / n} + (\log (d) / ϵ n)^{2 / 3} .

$\sqrt{\log(d)/n} + (\log(d)/\epsilon n)^{2/3}.$ This bound is achieved by a new variance-reduced version of the Frank-Wolfe algorithm that requires just a single pass over the data. We also show that the lower bound in this case is the minimum of the two rates mentioned above.

Chat is not available.