ICML Federated Ensemble-Directed Offline Reinforcement Learning

Poster
in
Workshop: Federated Learning and Analytics in Practice: Algorithms, Systems, Applications, and Opportunities

Federated Ensemble-Directed Offline Reinforcement Learning

Desik Rengarajan · Nitin Ragothaman · Dileep Kalathil · Srinivas Shakkottai

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract:

We consider the problem of federated offline reinforcement learning (RL), where clients must collaboratively learn a control policy only using data collected using unknown behavior policies. Naively combining a standard offline RL approach with a standard federated learning approach to solve this problem can lead to poorly performing policies. We develop Federated Ensemble-Directed Offline Reinforcement Learning Algorithm (FEDORA), which distills the collective wisdom of the clients using an ensemble learning approach. We show that FEDORA significantly outperforms other approaches, including offline RL over the combined data pool, in various complex continuous control and real-world environments.

Chat is not available.

Poster in Workshop: Federated Learning and Analytics in Practice: Algorithms, Systems, Applications, and Opportunities

Federated Ensemble-Directed Offline Reinforcement Learning

Desik Rengarajan · Nitin Ragothaman · Dileep Kalathil · Srinivas Shakkottai

Poster
in
Workshop: Federated Learning and Analytics in Practice: Algorithms, Systems, Applications, and Opportunities