Skip to yearly menu bar Skip to main content


Poster

The Pareto-optimal Trade-off between Regret and Statistical Inference in Linear Stochastic Bandits under Safety Constraints

Yuming Shao ⋅ Zhixuan Fang

Abstract

Log in and register to view live content