Timezone: »
As artificial agents proliferate, it is becoming increasingly important to ensure that their interactions with one another are well-behaved. In this paper, we formalize a common-sense notion of when algorithms are well-behaved: an algorithm is safe if it does no harm. Motivated by recent progress in deep learning, we focus on the specific case where agents update their actions according to gradient descent. The paper shows that that gradient descent converges to a Nash equilibrium in safe games. The main contribution is to define strongly-typed agents and show they are guaranteed to interact safely, thereby providing sufficient conditions to guarantee safe interactions. A series of examples show that strong-typing generalizes certain key features of convexity, is closely related to blind source separation, and introduces a new perspective on classical multilinear games based on tensor decomposition.
Author Information
David Balduzzi (Victoria University Wellington)
Related Events (a corresponding poster, oral, or spotlight)
-
2017 Poster: Strongly-Typed Agents are Guaranteed to Interact Safely »
Tue Aug 8th 08:30 AM -- 12:00 PM Room Gallery
More from the Same Authors
-
2019 Poster: Open-ended learning in symmetric zero-sum games »
David Balduzzi · Marta Garnelo · Yoram Bachrach · Wojciech Czarnecki · Julien Perolat · Max Jaderberg · Thore Graepel -
2019 Oral: Open-ended learning in symmetric zero-sum games »
David Balduzzi · Marta Garnelo · Yoram Bachrach · Wojciech Czarnecki · Julien Perolat · Max Jaderberg · Thore Graepel -
2018 Poster: The Mechanics of n-Player Differentiable Games »
David Balduzzi · Sebastien Racaniere · James Martens · Jakob Foerster · Karl Tuyls · Thore Graepel -
2018 Oral: The Mechanics of n-Player Differentiable Games »
David Balduzzi · Sebastien Racaniere · James Martens · Jakob Foerster · Karl Tuyls · Thore Graepel -
2017 Poster: Neural Taylor Approximations: Convergence and Exploration in Rectifier Networks »
David Balduzzi · Brian McWilliams · Tony Butler-Yeoman -
2017 Poster: The Shattered Gradients Problem: If resnets are the answer, then what is the question? »
David Balduzzi · Marcus Frean · Wan-Duo Ma · Brian McWilliams · Lennox Leary · John Lewis -
2017 Talk: The Shattered Gradients Problem: If resnets are the answer, then what is the question? »
David Balduzzi · Marcus Frean · Wan-Duo Ma · Brian McWilliams · Lennox Leary · John Lewis -
2017 Talk: Neural Taylor Approximations: Convergence and Exploration in Rectifier Networks »
David Balduzzi · Brian McWilliams · Tony Butler-Yeoman