Timezone: »
We initiate the study of fairness in reinforcement learning, where the actions of a learning algorithm may affect its environment and future rewards. Our fairness constraint requires that an algorithm never prefers one action over another if the long-term (discounted) reward of choosing the latter action is higher. Our first result is negative: despite the fact that fairness is consistent with the optimal policy, any learning algorithm satisfying fairness must take time exponential in the number of states to achieve non-trivial approximation to the optimal policy. We then provide a provably fair polynomial time algorithm under an approximate notion of fairness, thus establishing an exponential gap between exact and approximate fairness.
Author Information
Shahin Jabbari (University of Pennsylvania)
Matthew Joseph (University of Pennsylvania)
Michael Kearns (University of Pennsylvania)
Jamie Morgenstern (University of Pennsylvania)
Aaron Roth (University of Pennsylvania)
Related Events (a corresponding poster, oral, or spotlight)
-
2017 Poster: Fairness in Reinforcement Learning »
Mon. Aug 7th 08:30 AM -- 12:00 PM Room Gallery #20
More from the Same Authors
-
2021 : Adaptive Machine Unlearning »
Varun Gupta · Christopher Jung · Seth Neel · Aaron Roth · Saeed Sharifi-Malvajerdi · Chris Waites -
2022 : Individually Fair Learning with One-Sided Feedback »
Yahav Bechavod · Aaron Roth -
2022 : Individually Fair Learning with One-Sided Feedback »
Yahav Bechavod · Aaron Roth -
2023 Poster: Characterizing Multicalibration via Property Elicitation »
Georgy Noarov · Aaron Roth -
2023 Poster: Individually Fair Learning with One-Sided Feedback »
Yahav Bechavod · Aaron Roth -
2023 Poster: Multicalibration as Boosting for Regression »
Ira Globus-Harris · Declan Harrison · Michael Kearns · Aaron Roth · Jessica Sorrell -
2023 Oral: Multicalibration as Boosting for Regression »
Ira Globus-Harris · Declan Harrison · Michael Kearns · Aaron Roth · Jessica Sorrell -
2021 Poster: Differentially Private Query Release Through Adaptive Projection »
Sergul Aydore · William Brown · Michael Kearns · Krishnaram Kenthapadi · Luca Melis · Aaron Roth · Ankit Siva -
2021 Oral: Differentially Private Query Release Through Adaptive Projection »
Sergul Aydore · William Brown · Michael Kearns · Krishnaram Kenthapadi · Luca Melis · Aaron Roth · Ankit Siva -
2019 Poster: Differentially Private Fair Learning »
Matthew Jagielski · Michael Kearns · Jieming Mao · Alina Oprea · Aaron Roth · Saeed Sharifi-Malvajerdi · Jonathan Ullman -
2019 Oral: Differentially Private Fair Learning »
Matthew Jagielski · Michael Kearns · Jieming Mao · Alina Oprea · Aaron Roth · Saeed Sharifi-Malvajerdi · Jonathan Ullman -
2018 Poster: Preventing Fairness Gerrymandering: Auditing and Learning for Subgroup Fairness »
Michael Kearns · Seth Neel · Aaron Roth · Steven Wu -
2018 Oral: Preventing Fairness Gerrymandering: Auditing and Learning for Subgroup Fairness »
Michael Kearns · Seth Neel · Aaron Roth · Steven Wu -
2018 Poster: Mitigating Bias in Adaptive Data Gathering via Differential Privacy »
Seth Neel · Aaron Roth -
2018 Oral: Mitigating Bias in Adaptive Data Gathering via Differential Privacy »
Seth Neel · Aaron Roth -
2017 Poster: Meritocratic Fairness for Cross-Population Selection »
Michael Kearns · Aaron Roth · Steven Wu -
2017 Talk: Meritocratic Fairness for Cross-Population Selection »
Michael Kearns · Aaron Roth · Steven Wu