Timezone: »
In meta-learning an agent extracts knowledge from observed tasks, aiming to facilitate learning of novel future tasks. Under the assumption that future tasks are `related’ to previous tasks, representations should be learned in such a way that they capture the common structure across learned tasks, while allowing the learner sufficient flexibility to adapt to novel aspects of a new task. We present a framework for meta-learning that is based on generalization error bounds, allowing us to extend various PAC-Bayes bounds to meta-learning. Learning takes place through the construction of a distribution over hypotheses based on the observed tasks, and its utilization for learning a new task. Thus, prior knowledge is incorporated through setting an experience-dependent prior for novel tasks. We develop a gradient-based algorithm, and implement it for deep neural networks, based on minimizing an objective function derived from the bounds, and demonstrate its effectiveness numerically. In addition to establishing the improved performance available through meta-learning, we demonstrate the intuitive way by which prior information is manifested at different levels of the network.
Author Information
Ron Amit (Technion – Israel Institute of Technology)
Ron Meir (Technion Israeli Institute of Technology)
Related Events (a corresponding poster, oral, or spotlight)
-
2018 Oral: Meta-Learning by Adjusting Priors Based on Extended PAC-Bayes Theory »
Wed. Jul 11th 09:20 -- 09:40 AM Room A3
More from the Same Authors
-
2021 Poster: Ensemble Bootstrapping for Q-Learning »
Oren Peer · Chen Tessler · Nadav Merlis · Ron Meir -
2021 Spotlight: Ensemble Bootstrapping for Q-Learning »
Oren Peer · Chen Tessler · Nadav Merlis · Ron Meir -
2020 Poster: Option Discovery in the Absence of Rewards with Manifold Analysis »
Amitay Bar · Ronen Talmon · Ron Meir -
2020 Poster: Discount Factor as a Regularizer in Reinforcement Learning »
Ron Amit · Ron Meir · Kamil Ciosek -
2019 Poster: Distributional Multivariate Policy Evaluation and Exploration with the Bellman GAN »
dror freirich · Tzahi Shimkin · Ron Meir · Aviv Tamar -
2019 Oral: Distributional Multivariate Policy Evaluation and Exploration with the Bellman GAN »
dror freirich · Tzahi Shimkin · Ron Meir · Aviv Tamar