Toggle Poster Visibility
Oral
Thu Jul 09 04:00 PM -- 04:15 PM (KST) None
Expressivity-Efficiency Tradeoffs for Hybrid Sequence Models
In
Oral 6A
[ OpenReview]
Oral
Thu Jul 09 04:15 PM -- 04:30 PM (KST) None
How much can language models memorize?
In
Oral 6A
[ OpenReview]
Oral
Thu Jul 09 04:30 PM -- 04:45 PM (KST) None
Prescriptive Scaling Reveals the Evolution of Language Model Capabilities
In
Oral 6A
[ OpenReview]
Oral
Thu Jul 09 04:45 PM -- 05:00 PM (KST) None
Procedural Pretraining: Warming Up Language Models with Abstract Data
In
Oral 6A
[ OpenReview]
Successful Page Load