Timezone: »
Spotlight
Rotting Infinitely Many-Armed Bandits
Jung-hun Kim · Milan Vojnovic · Se-Young Yun
We consider the infinitely many-armed bandit problem with rotting rewards, where the mean reward of an arm decreases at each pull of the arm according to an arbitrary trend with maximum rotting rate $\varrho=o(1)$. We show that this learning problem has an $\Omega(\max\{\varrho^{1/3}T, \sqrt{T}\})$ worst-case regret lower bound where $T$ is the time horizon. We show that a matching upper bound $\tilde{O}(\max\{\varrho^{1/3}T, \sqrt{T}\})$, up to a poly-logarithmic factor, can be achieved by an algorithm that uses a UCB index for each arm and a threshold value to decide whether to continue pulling an arm or remove the arm from further consideration, when the algorithm knows the value of the maximum rotting rate $\varrho$. We also show that an $\tilde{O}(\max\{\varrho^{1/3}T, T^{3/4}\})$ regret upper bound can be achieved by an algorithm that does not know the value of $\varrho$, by using an adaptive UCB index along with an adaptive threshold value.
Author Information
Jung-hun Kim (KAIST)
Milan Vojnovic (London School of Economics)
Se-Young Yun (KAIST)
Related Events (a corresponding poster, oral, or spotlight)
-
2022 Poster: Rotting Infinitely Many-Armed Bandits »
Tue. Jul 19th through Wed the 20th Room Hall E #1017
More from the Same Authors
-
2022 : ALASCA: Rethinking Label Smoothing for Deep Learning Under Label Noise »
Jongwoo Ko · Bongsoo Yi · Se-Young Yun -
2023 : An Optimal Clustering Algorithm for the Labeled Stochastic Block Model »
Kaito Ariu · Se-Young Yun · Alexandre Proutiere -
2023 Poster: Doubly Adversarial Federated Bandits »
Jialin Yi · Milan Vojnovic -
2022 : Revisiting Architecture-aware Knowledge Distillation: Smaller Models and Faster Search »
Taehyeon Kim · Heesoo Myeong · Se-Young Yun -
2022 : Supernet Training for Federated Image Classification »
Taehyeon Kim · Se-Young Yun -
2021 Poster: Improved Regret Bounds of Bilinear Bandits using Action Space Analysis »
Kyoungseok Jang · Kwang-Sung Jun · Se-Young Yun · Wanmo Kang -
2021 Poster: Pure Exploration and Regret Minimization in Matching Bandits »
Flore Sentenac · Jialin Yi · Clément Calauzènes · Vianney Perchet · Milan Vojnovic -
2021 Spotlight: Pure Exploration and Regret Minimization in Matching Bandits »
Flore Sentenac · Jialin Yi · Clément Calauzènes · Vianney Perchet · Milan Vojnovic -
2021 Spotlight: Improved Regret Bounds of Bilinear Bandits using Action Space Analysis »
Kyoungseok Jang · Kwang-Sung Jun · Se-Young Yun · Wanmo Kang -
2019 Poster: Spectral Approximate Inference »
Sejun Park · Eunho Yang · Se-Young Yun · Jinwoo Shin -
2019 Oral: Spectral Approximate Inference »
Sejun Park · Eunho Yang · Se-Young Yun · Jinwoo Shin