Skip to yearly menu bar Skip to main content


Poster

QUATRO: Query-Adaptive Trust Region Policy Optimization for LLM Fine-tuning

Doyeon Lee ⋅ Eunyi Lyou ⋅ Hyunsoo Cho ⋅ Soo Kyung Kim ⋅ Joonseok Lee ⋅ Jaemoo Choi

Abstract

Log in and register to view live content