Timezone: »
In the area of data analysis and arguably even in machine learning as a whole, few approaches have been as impactful as the classical k-means clustering. Here, we study the complexity of k-means clustering in settings where most of the data is not known or simply irrelevant. To obtain a more fine-grained understanding of the tractability of this clustering problem, we apply the parameterized complexity paradigm and obtain three new algorithms for k-means clustering of incomplete data: one for the clustering of bounded-domain (i.e., integer) data, and two incomparable algorithms that target real-valued data. Our approach is based on exploiting structural properties of a graphical encoding of the missing entries, and we show that tractability can be achieved using significantly less restrictive parameterizations than in the complementary case of few missing entries.
Author Information
Robert Ganian (TU Wien)
Thekla Hamm (TU Wien)
Viktoriia Korchemna (TU Wien)
Karolina Okrasa (University of Warsaw)
Kirill Simonov (University of Bergen)
Related Events (a corresponding poster, oral, or spotlight)
-
2022 Spotlight: The Complexity of k-Means Clustering when Little is Known »
Thu. Jul 21st 06:45 -- 06:50 PM Room Ballroom 3 & 4
More from the Same Authors
-
2023 Poster: The Computational Complexity of Concise Hypersphere Classification »
Eduard Eiben · Robert Ganian · Iyad Kanj · Sebastian Ordyniak · Stefan Szeider -
2021 Poster: Fixed-Parameter and Approximation Algorithms for PCA with Outliers »
Yogesh Dahiya · Fedor Fomin · Fahad Panolan · Kirill Simonov -
2021 Spotlight: Fixed-Parameter and Approximation Algorithms for PCA with Outliers »
Yogesh Dahiya · Fedor Fomin · Fahad Panolan · Kirill Simonov -
2019 Poster: Refined Complexity of PCA with Outliers »
Kirill Simonov · Fedor Fomin · Petr Golovach · Fahad Panolan -
2019 Oral: Refined Complexity of PCA with Outliers »
Kirill Simonov · Fedor Fomin · Petr Golovach · Fahad Panolan -
2018 Poster: Parameterized Algorithms for the Matrix Completion Problem »
Robert Ganian · Iyad Kanj · Sebastian Ordyniak · Stefan Szeider -
2018 Oral: Parameterized Algorithms for the Matrix Completion Problem »
Robert Ganian · Iyad Kanj · Sebastian Ordyniak · Stefan Szeider