ICML Poster Near-optimal sample complexity bounds for learning Latent $k-$polytopes and applications to Ad-Mixtures

Poster

Near-optimal sample complexity bounds for learning Latent $k-$ polytopes and applications to Ad-Mixtures

Chiranjib Bhattacharyya · Ravindran Kannan

Keywords: [ Combinatorial Optimization ] [ Generative Models ] [ Learning Theory ] [ Statistical Learning Theory ] [ Unsupervised and Semi-supervised Learning ]

[ Abstract ]

Abstract: Deriving Optimal bounds on Sample Complexity of Latent Variable models is an active area of research. Recently such bounds were obtained for Mixture of Gaussians \cite{HSNCAY18}, no such results are known for Ad-mixtures, a generalization of Mixture distributions. In this paper we show that

O^{*} (d k / m)

$O^*(dk/m)$ samples are sufficient to learn each of

k -

$k-$ topic vectors of LDA, a popular Ad-mixture model, with vocabulary size

d

$d$ and

m \in Ω (1)

$m\in \Omega(1)$ words per document, to any constant error in

L_{1}

$L_1$ norm. The result is a corollary of the major contribution of this paper: the first sample complexity upper bound for the problem (introduced in \cite{BK20}) of learning the vertices of a Latent

k -

$k-$ Polytope in

\RR^{d}

$\RR^d$ , given perturbed points from it. The bound,

O^{*} (d k / β)

$O^*(dk/\beta)$ , is optimal and linear in number of parameters. It applies to many stochastic models including a broad class Ad-mixtures. To demonstrate the generality of the approach we specialize the setting to Mixed Membership Stochastic Block Models(MMSB) and show for the first time that if an MMSB has

k

$k$ blocks, the sample complexity is

O^{*} (k^{2})

$O^*(k^2)$ under usual assumptions.

Chat is not available.

Poster

Near-optimal sample complexity bounds for learning Latent k−k−k-polytopes and applications to Ad-Mixtures

Chiranjib Bhattacharyya · Ravindran Kannan

Near-optimal sample complexity bounds for learning Latent $k-$ polytopes and applications to Ad-Mixtures