Toggle Poster Visibility
Oral
Tue Jul 23 05:30 PM -- 05:45 PM (KST) @ Hall C 1-3 None
Debating with More Persuasive LLMs Leads to More Truthful Answers
[
Slides]
Oral
Tue Jul 23 05:45 PM -- 06:00 PM (KST) @ Hall C 1-3 None
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Oral
Tue Jul 23 06:00 PM -- 06:15 PM (KST) @ Hall C 1-3 None
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity
Oral
Tue Jul 23 06:15 PM -- 06:30 PM (KST) @ Hall C 1-3 None
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Successful Page Load