Timezone: »

 
Poster
Preselection Bandits
Viktor Bengs · Eyke Hüllermeier

Thu Jul 16 01:00 PM -- 01:45 PM & Fri Jul 17 02:00 AM -- 02:45 AM (PDT) @ Virtual #None

In this paper, we introduce the Preselection Bandit problem, in which the learner preselects a subset of arms (choice alternatives) for a user, which then chooses the final arm from this subset. The learner is not aware of the user's preferences, but can learn them from observed choices. In our concrete setting, we allow these choices to be stochastic and model the user's actions by means of the Plackett-Luce model. The learner's main task is to preselect subsets that eventually lead to highly preferred choices. To formalize this goal, we introduce a reasonable notion of regret and derive lower bounds on the expected regret. Moreover, we propose algorithms for which the upper bound on expected regret matches the lower bound up to a logarithmic term of the time horizon.

Author Information

Viktor Bengs (University of Paderborn)
Eyke Hüllermeier (Paderborn University)

More from the Same Authors

  • 2021 : Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods (Spotlight #4) »
    Eyke Hüllermeier
  • 2019 : Poster Session 1 (all papers) »
    Matilde Gargiani · Yochai Zur · Chaim Baskin · Evgenii Zheltonozhskii · Liam Li · Ameet Talwalkar · Xuedong Shang · Harkirat Singh Behl · Atilim Gunes Baydin · Ivo Couckuyt · Tom Dhaene · Chieh Lin · Wei Wei · Min Sun · Orchid Majumder · Michele Donini · Yoshihiko Ozaki · Ryan P. Adams · Christian Geißler · Ping Luo · zhanglin peng · · Ruimao Zhang · John Langford · Rich Caruana · Debadeepta Dey · Charles Weill · Xavi Gonzalvo · Scott Yang · Scott Yak · Eugen Hotaj · Vladimir Macko · Mehryar Mohri · Corinna Cortes · Stefan Webb · Jonathan Chen · Martin Jankowiak · Noah Goodman · Aaron Klein · Frank Hutter · Mojan Javaheripi · Mohammad Samragh · Sungbin Lim · Taesup Kim · SUNGWOONG KIM · Michael Volpp · Iddo Drori · Yamuna Krishnamurthy · Kyunghyun Cho · Stanislaw Jastrzebski · Quentin de Laroussilhe · Mingxing Tan · Xiao Ma · Neil Houlsby · Andrea Gesmundo · Zalán Borsos · Krzysztof Maziarz · Felipe Petroski Such · Joel Lehman · Kenneth Stanley · Jeff Clune · Pieter Gijsbers · Joaquin Vanschoren · Felix Mohr · Eyke Hüllermeier · Zheng Xiong · Wenpeng Zhang · wenwu zhu · Weijia Shao · Aleksandra Faust · Michal Valko · Michael Y Li · Hugo Jair Escalante · Marcel Wever · Andrey Khorlin · Tara Javidi · Anthony Francis · Saurajit Mukherjee · Jungtaek Kim · Michael McCourt · Saehoon Kim · Tackgeun You · Seungjin Choi · Nicolas Knudde · Alexander Tornede · Ghassen Jerfel
  • 2018 Poster: Ranking Distributions based on Noisy Sorting »
    Adil El Mesaoudi-Paul · Eyke Hüllermeier · Robert Busa-Fekete
  • 2018 Oral: Ranking Distributions based on Noisy Sorting »
    Adil El Mesaoudi-Paul · Eyke Hüllermeier · Robert Busa-Fekete
  • 2017 Poster: Statistical Inference for Incomplete Ranking Data: The Case of Rank-Dependent Coarsening »
    Mohsen Ahmadi Fahandar · Eyke Hüllermeier · Ines Couso
  • 2017 Talk: Statistical Inference for Incomplete Ranking Data: The Case of Rank-Dependent Coarsening »
    Mohsen Ahmadi Fahandar · Eyke Hüllermeier · Ines Couso