Skip to yearly menu bar Skip to main content


DePO: Elicit Chemical Reasoning Capability via Demonstration-Guided Policy Optimization

Xuan Li ⋅ Zhanke Zhou ⋅ Zongze Li ⋅ Jiangchao Yao ⋅ Yu Rong ⋅ Lu Zhang ⋅ Bo Han

Abstract

Video

Chat is not available.