Workshop
|
|
Scaling Automated Quantum Error Correction Discovery with Reinforcement Learning
Jan Olle · Remmy Zen · Matteo Puviani · Florian Marquardt
|
|
Poster
|
Thu 2:30
|
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL
Yu Luo · Tianying Ji · Fuchun Sun · Jianwei Zhang · Huazhe Xu · Xianyuan Zhan
|
|
Oral
|
Wed 8:15
|
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
Uri Sherman · Alon Cohen · Tomer Koren · Yishay Mansour
|
|
Poster
|
Wed 4:30
|
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
Uri Sherman · Alon Cohen · Tomer Koren · Yishay Mansour
|
|
Poster
|
Tue 2:30
|
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
Akhil Agnihotri · Rahul Jain · Haipeng Luo
|
|
Poster
|
Thu 4:30
|
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee · Josiah Hanna · Robert Nowak
|
|
Poster
|
Tue 2:30
|
Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration
Youngsoo Jang · Geon-Hyeong Kim · Byoungjip Kim · Yu Jin Kim · Honglak Lee · Moontae Lee
|
|
Poster
|
Wed 2:30
|
Reflective Policy Optimization
Yaozhong Gan · yan renye · zhe wu · Junliang Xing
|
|
Poster
|
Tue 2:30
|
Risk-Sensitive Policy Optimization via Predictive CVaR Policy Gradient
Ju-Hyun Kim · Seungki Min
|
|
Poster
|
Tue 2:30
|
Iterative Regularized Policy Optimization with Imperfect Demonstrations
Xudong Gong · Feng Dawei · Kele Xu · Yuanzhao Zhai · Chengkang Yao · Weijia Wang · Bo Ding · Huaimin Wang
|
|
Poster
|
Thu 4:30
|
Adaptive-Gradient Policy Optimization: Enhancing Policy Learning in Non-Smooth Differentiable Simulations
Feng Gao · Liangzhi Shi · Shenao Zhang · Zhaoran Wang · Yi Wu
|
|
Workshop
|
|
A Policy Optimization Approach to the Solution of Unregularized Mean Field Games
Sihan Zeng · Sujay Bhatt · Alec Koppel · Sumitra Ganesh
|
|