Invited Keynote #2
Dan Alistarh
2025 Invited Talk
in
Workshop: Tiny Titans: The next wave of On-Device Learning for Foundation Models (TTODLer-FM)
in
Workshop: Tiny Titans: The next wave of On-Device Learning for Foundation Models (TTODLer-FM)
Abstract
The last few years have seen an explosion of interest in Ai efficiency. One of the holy grails of the area has been training and inferencing models in end-to-end low-precision, for instance by leveraging the quantized matrix multiplication support on modern GPUs. In this talk, I will present some of our lab’s recent work on this topic, investigating low-precision training of LLMs. Specifically, I will cover a new state-of-the-art algorithm for quantized training called QuEST, discuss the limits of current approaches characterized via scaling laws, and about fast kernel support for low-precision training.
Speaker
Dan Alistarh
Dan Alistarh is a Professor at IST Austria. His research focuses on high-performance algorithms for machine learning, and spans from purely theoretical results to practical implementations.
Before ISTA, ge was a researcher at ETH Zurich and Microsoft Research, and a Postdoctoral Associate at MIT CSAIL. He received my PhD from the EPFL.
Video
Chat is not available.
Successful Page Load