Timezone: »
Emphatic algorithms have shown great promise in stabilizing and improving reinforcement learning by selectively emphasizing the update rule. Although the emphasis fundamentally depends on an interest function which defines the intrinsic importance of each state, most approaches simply adopt a uniform interest over all states (except where a hand-designed interest is possible based on domain knowledge). In this paper, we investigate adaptive methods that allow the interest function to dynamically vary over states and iterations. In particular, we leverage meta-gradients to automatically discover online an interest function that would accelerate the agent’s learning process. Empirical evaluations on a wide range of environments show that adapting the interest is key to provide significant gains. Qualitative analysis indicates that the learned interest function emphasizes states of particular importance, such as bottlenecks, which can be especially useful in a transfer learning setting.
Author Information
Martin Klissarov (McGill University)
Rasool Fakoor (Amazon Web Services)
Jonas Mueller (Amazon Web Services)
Kavosh Asadi (Amazon)
Taesup Kim (Seoul National University)
Alex Smola (Amazon)
More from the Same Authors
-
2021 : Multimodal AutoML on Structured Tables with Text Fields »
Xingjian Shi · Jonas Mueller · Nick Erickson · Mu Li · Alex Smola -
2021 : Continuous Doubly Constrained Batch Reinforcement Learning »
Rasool Fakoor · Jonas Mueller · Kavosh Asadi · Pratik Chaudhari · Alex Smola -
2022 : Back to the Basics: Revisiting Out-of-Distribution Detection Baselines »
Johnson Kuan · Jonas Mueller -
2022 : Efficient Task Adaptation by Mixing Discovered Skills »
Eunseok Yang · JUNGSUB RHIM · Taesup Kim -
2023 : Uncertainty-Guided Online Test-Time Adaptation via Meta-Learning »
kyubyung chae · Taesup Kim -
2023 : Budgeting Counterfactual for Offline RL »
Yao Liu · Pratik Chaudhari · Rasool Fakoor -
2023 : How to Cope with Gradual Data Drift? »
Rasool Fakoor · Jonas Mueller · Zachary Lipton · Pratik Chaudhari · Alex Smola -
2023 : Detecting Dataset Drift and Non-IID Sampling via k-Nearest Neighbors »
Jesse Cummings · Jonas Mueller · Elías Snorrason -
2023 : Estimating label quality and errors in semantic segmentation data via any model »
Vedang Lad · Jonas Mueller -
2023 : Detecting Errors in Numerical Data via any Regression Model »
Hang Zhou · Jonas Mueller · Mayank Kumar · Jane-Ling Wang · Jing Lei -
2023 : ObjectLab: Automated Diagnosis of Mislabeled Images in Object Detection Data »
Ulyana Tkachenko · Aditya Thyagarajan · Jonas Mueller -
2023 : UOTA: Unsupervised Open-Set Task Adaptation Using a Vision-Language Foundation Model »
Youngjo Min · Kwangrok Ryoo · Bumsoo Kim · Taesup Kim -
2023 Poster: RLSbench: Domain Adaptation Under Relaxed Label Shift »
Saurabh Garg · Nick Erickson · University of California James Sharpnack · Alex Smola · Sivaraman Balakrishnan · Zachary Lipton -
2023 Poster: Deep Laplacian-based Options for Temporally-Extended Exploration »
Martin Klissarov · Marlos C. Machado -
2023 Poster: Flexible Model Aggregation for Quantile Regression »
Rasool Fakoor · Taesup Kim · Jonas Mueller · Alexander Smola · Ryan Tibshirani -
2022 : Discussion Panel »
Percy Liang · Léon Bottou · Jayashree Kalpathy-Cramer · Alex Smola -
2022 : Model-Agnostic Label Quality Scoring to Detect Real-World Label Errors »
Jonas Mueller -
2022 Poster: Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition »
Haotao Wang · Aston Zhang · Yi Zhu · Shuai Zheng · Mu Li · Alex Smola · Zhangyang “Atlas” Wang -
2022 Oral: Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition »
Haotao Wang · Aston Zhang · Yi Zhu · Shuai Zheng · Mu Li · Alex Smola · Zhangyang “Atlas” Wang -
2021 : Q&A Contributed Talk »
Jonas Mueller -
2021 : Contributed Talk: Multimodal AutoML on Structured Tables with Text Fields »
Jonas Mueller -
2021 Poster: Deep Learning for Functional Data Analysis with Adaptive Basis Layers »
Junwen Yao · Jonas Mueller · Jane-Ling Wang -
2021 Spotlight: Deep Learning for Functional Data Analysis with Adaptive Basis Layers »
Junwen Yao · Jonas Mueller · Jane-Ling Wang -
2020 : Panel Discussion »
Neil Lawrence · Mihaela van der Schaar · Alex Smola · Valerio Perrone · Jack Parker-Holder · Zhengying Liu -
2020 : "AutoGluon and Distillation" by Alex Smola »
Alex Smola -
2020 : 1.2 AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data »
Jonas Mueller -
2020 Poster: Educating Text Autoencoders: Latent Representation Guidance via Denoising »
Tianxiao Shen · Jonas Mueller · Regina Barzilay · Tommi Jaakkola -
2019 : posters »
Zhengxing Chen · Juan Jose Garau Luis · Ignacio Albert Smet · Aditya Modi · Sabina Tomkins · Riley Simmons-Edler · Hongzi Mao · Alexander Irpan · Hao Lu · Rose Wang · Subhojyoti Mukherjee · Aniruddh Raghu · Syed Arbab Mohd Shihab · Byung Hoon Ahn · Rasool Fakoor · Pratik Chaudhari · Elena Smirnova · Min-hwan Oh · Xiaocheng Tang · Tony Qin · Qingyang Li · Marc Brittain · Ian Fox · Supratik Paul · Xiaofeng Gao · Yinlam Chow · Gabriel Dulac-Arnold · Ofir Nachum · Nikos Karampatziakis · Bharathan Balaji · Supratik Paul · Ali Davody · Djallel Bouneffouf · Himanshu Sahni · Soo Kim · Andrey Kolobov · Alexander Amini · Yao Liu · Xinshi Chen · · Craig Boutilier -
2019 Poster: Deep Factors for Forecasting »
Yuyang Wang · Alex Smola · Danielle Robinson · Jan Gasthaus · Dean Foster · Tim Januschowski -
2019 Oral: Deep Factors for Forecasting »
Yuyang Wang · Alex Smola · Danielle Robinson · Jan Gasthaus · Dean Foster · Tim Januschowski -
2019 Tutorial: A Tutorial on Attention in Deep Learning »
Alex Smola · Aston Zhang -
2018 Poster: Learning Steady-States of Iterative Algorithms over Graphs »
Hanjun Dai · Zornitsa Kozareva · Bo Dai · Alex Smola · Le Song -
2018 Oral: Learning Steady-States of Iterative Algorithms over Graphs »
Hanjun Dai · Zornitsa Kozareva · Bo Dai · Alex Smola · Le Song -
2017 Poster: Canopy --- Fast Sampling with Cover Trees »
Manzil Zaheer · Satwik Kottur · Amr Ahmed · Jose Moura · Alex Smola -
2017 Talk: Canopy --- Fast Sampling with Cover Trees »
Manzil Zaheer · Satwik Kottur · Amr Ahmed · Jose Moura · Alex Smola -
2017 Poster: Latent LSTM Allocation: Joint clustering and non-linear dynamic modeling of sequence data »
Manzil Zaheer · Amr Ahmed · Alex Smola -
2017 Talk: Latent LSTM Allocation: Joint clustering and non-linear dynamic modeling of sequence data »
Manzil Zaheer · Amr Ahmed · Alex Smola -
2017 Tutorial: Distributed Deep Learning with MxNet Gluon »
Alex Smola · Aran Khanna