Timezone: »
Iterative methods for approximating zero-sum Nash equilibria in extensive-form games have been a core component of recent advances in superhuman poker AIs. In this talk, I will first give an optimization-oriented description of how these methods work. Then, I will discuss and contrast two recent results: First, the development of a new entropy-based regularization method for the decision spaces associated with extensive-form games, which is simultaneously simpler to analyze and has better theoretical properties than the current state of the art. Second, I will discuss new algorithms based on optimistic variants of regret matching and CFR, which lead to very strong practical performance, in spite of inferior theoretical guarantees.
This talk is based on joint work with Gabriele Farina and Tuomas Sandholm.
Author Information
Christian Kroer (Columbia University)
More from the Same Authors
-
2023 Poster: Statistical Inference and A/B Testing for First-Price Pacing Equilibria »
Luofeng Liao · Christian Kroer -
2022 Poster: Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games »
Gabriele Farina · Chung-Wei Lee · Haipeng Luo · Christian Kroer -
2022 Poster: Online Learning with Knapsacks: the Best of Both Worlds »
Matteo Castiglioni · Andrea Celli · Christian Kroer -
2022 Spotlight: Online Learning with Knapsacks: the Best of Both Worlds »
Matteo Castiglioni · Andrea Celli · Christian Kroer -
2022 Spotlight: Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games »
Gabriele Farina · Chung-Wei Lee · Haipeng Luo · Christian Kroer -
2021 Poster: First-Order Methods for Wasserstein Distributionally Robust MDP »
Julien Grand-Clement · Christian Kroer -
2021 Spotlight: First-Order Methods for Wasserstein Distributionally Robust MDP »
Julien Grand-Clement · Christian Kroer -
2020 Poster: Stochastic Regret Minimization in Extensive-Form Games »
Gabriele Farina · Christian Kroer · Tuomas Sandholm -
2019 Poster: Stable-Predictive Optimistic Counterfactual Regret Minimization »
Gabriele Farina · Christian Kroer · Noam Brown · Tuomas Sandholm -
2019 Poster: Regret Circuits: Composability of Regret Minimizers »
Gabriele Farina · Christian Kroer · Tuomas Sandholm -
2019 Oral: Stable-Predictive Optimistic Counterfactual Regret Minimization »
Gabriele Farina · Christian Kroer · Noam Brown · Tuomas Sandholm -
2019 Oral: Regret Circuits: Composability of Regret Minimizers »
Gabriele Farina · Christian Kroer · Tuomas Sandholm