Skip to yearly menu bar Skip to main content


P37: Structured, Flexible, and Robust: Benchmarking and Improving Large Language Models Towards More Human-like Behavior in Out-of-Distribution Reasoning Tasks

Jiahai Feng

Abstract

Chat is not available.