Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Tiny Titans: The next wave of On-Device Learning for Foundation Models (TTODLer-FM)
Fri, Jul 18, 2025 • 1:00 PM – 1:45 PM PDT

Zeroth-Order Optimization is Secretly Single-Step Policy Optimization

Junbin Qiu · Zhengpeng Xie · Xiangda Yan · Yongjie Yang · Yao Shu

Abstract

Chat is not available.