Poster
Clustering Items through Bandit Feedback: Finding the Right Feature out of Many
Maximilian Graf · Victor Thuot · Nicolas Verzelen
East Exhibition Hall A-B #E-1811
Imagine you're trying to group a collection of objects—like images of vehicles—into two categories, but you don’t know the properties that define those categories. Each object has many characteristics (such as the number of visible wheels), but checking a characteristic takes time and resources. On top of that, each observation is noisy—it can contain errors. To solve this task, we proceed sequentially: one step at a time, we choose an object and which characteristic to observe on that object, aiming to gradually uncover both which characteristic matter and how the objects are grouped. We ask the following question: can we correctly sort all the objects while observing as few characteristics as possible?The key idea is that some features are especially informative for distinguishing between the two groups. Our method learns to identify and focus on these useful features throughout the observation process. We evaluate the performance of our method by measuring the number of observations it requires, and we show that no other method can achieve the same goal with significantly fewer observations.
Live content is unavailable. Log in and register to view live content