Poster
Multi-Armed Bandits with Interference: Bridging Causal Inference and Adversarial Bandits
Su Jia · Peter Frazier · Nathan Kallus
West Exhibition Hall B2-B3 #W-908
Imagine we’re running an online food delivery platform and want to test multiple promotion campaigns (i.e., treatments) to maximize total revenue over a sales season. A key challenge is interference between locations (e.g., ZIP codes): the effectiveness of a promotion at one location can depend heavily on what promotions are assigned to nearby locations, since they may compete for shared resources like delivery drivers.A naive approach is to assign promotions independently to each location and average the resulting revenues. Another naive method is switchback: assigning the historically best-performing promotion to all locations in each period.We propose a more effective alternative based on clustered randomization: we first group locations into clusters and assign promotions at the cluster level, favoring arms with strong historical performance. We show that this approach outperforms the above baselines by achieving the best possible worst-case expected revenue while also being more robust — i.e., significantly less likely to result in very low revenue.
Live content is unavailable. Log in and register to view live content