DAL: A Practical Prior-Free Black-Box Framework for Piecewise Stationary Bandits
Argyrios Gerogiannis ⋅ Yu-Han Huang ⋅ Subhonmesh Bose ⋅ Venugopal Veeravalli
Abstract
We introduce a practical, black-box framework termed Detection Augmented Learning (DAL) for the problem of piecewise stationary bandits without knowledge of the underlying non-stationarity. DAL accepts any stationary bandit algorithm with order-optimal regret as input and augments it with a change detector, enabling applicability to all common bandit variants. Extensive experimentation demonstrates that DAL consistently surpasses all state-of-the-art methods across diverse non-stationary scenarios, including synthetic benchmarks and real-world datasets, underscoring its versatility and scalability. We provide theoretical insights into DAL's strong empirical performance, complemented by thorough empirical validation.
Successful Page Load