Spotlight
Near-Optimal Algorithms for Explainable k-Medians and k-Means
Konstantin Makarychev · Liren Shan
Abstract:
We consider the problem of explainable -medians and -means introduced by Dasgupta, Frost, Moshkovitz, and Rashtchian~(ICML 2020). In this problem, our goal is to find a \emph{threshold decision tree} that partitions data into clusters and minimizes the -medians or -means objective. The obtained clustering is easy to interpret because every decision node of a threshold tree splits data based on a single feature into two groups. We propose a new algorithm for this problem which is competitive with -medians with norm and competitive with -means. This is an improvement over the previous guarantees of and by Dasgupta et al (2020). We also provide a new algorithm which is competitive for -medians with norm. Our first algorithm is near-optimal: Dasgupta et al (2020) showed a lower bound of for -medians; in this work, we prove a lower bound of for -means. We also provide a lower bound of for -medians with norm.
Chat is not available.