Poster
Optimal and Practical Batched Linear Bandit Algorithm
Sanghoon Yu · Min-hwan Oh
West Exhibition Hall B2-B3 #W-914
Many real-world tasks involve repeatedly making decisions to achieve the best possible outcomes, such as recommending products online, selecting medical treatments in clinical trials, or dynamically setting prices for online shoppers. These problems often involve balancing the exploration of new options with the exploitation of already-known successful strategies. However, updating these decision-making systems too frequently can be impractical due to costs, ethical issues, or computational limitations.In this research, we tackle the challenge of making good decisions when updates can only happen occasionally—in batches rather than continuously. Existing methods that work well theoretically either fail in practice or require too much computation. We propose a new method, called BLAE, that carefully eliminates poor-performing options while selecting promising ones efficiently. Our method achieves optimal performance guarantees, meaning it performs nearly as well as theoretically possible under any conditions. Crucially, BLAE also performs strongly in real-world applications, requiring fewer updates and less computation.In short, our approach bridges the gap between theory and practice, providing an effective and practical solution for decision-making problems where frequent updates are not feasible.
Live content is unavailable. Log in and register to view live content