ICML 2021 On the Power of Localized Perceptron for Label-Optimal Learning of Halfspaces with Adversarial Noise Spotlight

Spotlight

On the Power of Localized Perceptron for Label-Optimal Learning of Halfspaces with Adversarial Noise

Jie Shen

[ Abstract ] [ Visit Semisupervised Learning 2 ] [ Paper ]

[ Paper ]

Abstract: We study {\em online} active learning of homogeneous halfspaces in

R^{d}

$\mathbb{R}^d$ with adversarial noise where the overall probability of a noisy label is constrained to be at most

ν

$\nu$ . Our main contribution is a Perceptron-like online active learning algorithm that runs in polynomial time, and under the conditions that the marginal distribution is isotropic log-concave and

ν = Ω (ϵ)

$\nu = \Omega(\epsilon)$ , where

ϵ \in (0, 1)

$\epsilon \in (0, 1)$ is the target error rate, our algorithm PAC learns the underlying halfspace with near-optimal label complexity of

\tilde{O} (d \cdot \polylog (\frac{1}{ϵ}))

$\tilde{O}\big(d \cdot \polylog(\frac{1}{\epsilon})\big)$ and sample complexity of

\tilde{O} (\frac{d}{ϵ})

$\tilde{O}\big(\frac{d}{\epsilon} \big)$ . Prior to this work, existing online algorithms designed for tolerating the adversarial noise are subject to either label complexity polynomial in

\frac{1}{ϵ}

$\frac{1}{\epsilon}$ , or suboptimal noise tolerance, or restrictive marginal distributions. With the additional prior knowledge that the underlying halfspace is

s

$s$ -sparse, we obtain attribute-efficient label complexity of

\tilde{O} (s \cdot \polylog (d, \frac{1}{ϵ}))

$\tilde{O}\big( s \cdot \polylog(d, \frac{1}{\epsilon}) \big)$ and sample complexity of

\tilde{O} (\frac{s}{ϵ} \cdot \polylog (d))

$\tilde{O}\big(\frac{s}{\epsilon} \cdot \polylog(d) \big)$ . As an immediate corollary, we show that under the agnostic model where no assumption is made on the noise rate

ν

$\nu$ , our active learner achieves an error rate of

O (O P T) + ϵ

$O(OPT) + \epsilon$ with the same running time and label and sample complexity, where

O P T

$OPT$ is the best possible error rate achievable by any homogeneous halfspace.

Chat is not available.