firstbacksecondback
147 Results
Workshop
|
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training Hong Liu · Zhiyuan Li · David Hall · Percy Liang · Tengyu Ma |
||
Poster
|
Tue 17:00 |
Diffusion Models for Black-Box Optimization Siddarth Krishnamoorthy · Satvik Mehul Mashkaria · Aditya Grover |
|
Poster
|
Thu 13:30 |
Discrete Continuous Optimization Framework for Simultaneous Clustering and Training in Mixture Models Parth Sangani · Arjun Kashettiwar · Pritish Chakraborty · Bhuvan Gangula · Durga Sivasubramanian · Ganesh Ramakrishnan · Rishabh Iyer · Abir De |
|
Poster
|
Wed 14:00 |
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL Taku Yamagata · Ahmed Khalil · Raul Santos-Rodriguez |
|
Workshop
|
Learning to Optimize Non-Convex Sum-Rate Maximization Problems Qingyu Song · Guochen Liu · Hong Xu |
||
Poster
|
Wed 14:00 |
Differentially Private Optimization on Large Model at Small Cost Zhiqi Bu · Yu-Xiang Wang · Sheng Zha · George Karypis |
|
Poster
|
Tue 17:00 |
How to Trust Your Diffusion Model: A Convex Optimization Approach to Conformal Risk Control Jacopo Teneggi · Matthew Tivnan · Web Stayman · Jeremias Sulam |
|
Poster
|
Wed 14:00 |
Beyond Reward: Offline Preference-guided Policy Optimization Yachen Kang · Diyuan Shi · Jinxin Liu · Li He · Donglin Wang |
|
Workshop
|
Risk-Aware Image Generation by Estimating and Propagating Uncertainty Alejandro Perez · Iaroslav Elistratov · Fynn Schmitt-Ulms · Ege Demir · Sadhana Lolla · Elaheh Ahmadi · Daniela Rus · Alexander Amini |
||
Poster
|
Wed 17:00 |
Automatically Auditing Large Language Models via Discrete Optimization Erik Jones · Anca Dragan · Aditi Raghunathan · Jacob Steinhardt |
|
Workshop
|
Combining Thermodynamics-based Model of the Centrifugal Compressors and Active Machine Learning for Enhanced Industrial Design Optimization Shadi Ghiasi · Guido Pazzi · Concettina Del Grosso · Giovanni De Magistris · Giacomo Veneri |
||
Workshop
|
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Rafael Rafailov · Archit Sharma · Eric Mitchell · Stefano Ermon · Christopher Manning · Chelsea Finn |