Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

48 Results

<<   <   Page 1 of 4   >   >>
Poster
Tue 4:30 Calibration Bottleneck: Over-compressed Representations are Less Calibratable
Deng-Bao Wang · Min-Ling Zhang
Oral
Wed 7:45 ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking
Wenshuo Li · Xinghao Chen · Han Shu · Yehui Tang · Yunhe Wang
Poster
Tue 4:30 LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging
Jinuk Kim · Marwa El Halabi · Mingi Ji · Hyun Oh Song
Poster
Thu 2:30 Get More with LESS: Synthesizing Recurrence with KV Cache Compression for Efficient LLM Inference
Harry Dong · Xinyu Yang · Zhenyu Zhang · Zhangyang “Atlas” Wang · Yuejie Chi · Beidi Chen
Poster
Tue 4:30 Compressing Large Language Models by Joint Sparsification and Quantization
Jinyang Guo · Jianyu Wu · Zining Wang · Jiaheng Liu · Ge Yang · Yifu Ding · Ruihao Gong · Haotong Qin · Xianglong Liu
Poster
Wed 4:30 Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression
Junyuan Hong · Jinhao Duan · Chenhui Zhang · Zhangheng Li · Chulin Xie · Kelsey Lieberman · James Diffenderfer · Brian Bartoldson · Ajay Jaiswal · Kaidi Xu · Bhavya Kailkhura · Dan Hendrycks · Dawn Song · Zhangyang “Atlas” Wang · Bo Li
Poster
Thu 4:30 Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference
Piotr Nawrot · Adrian Łańcucki · Marcin Chochowski · David Tarjan · Edoardo Ponti
Poster
Tue 2:30 Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth
Kevin Kögler · Aleksandr Shevchenko · Hamed Hassani · Marco Mondelli
Poster
Wed 4:30 ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking
Wenshuo Li · Xinghao Chen · Han Shu · Yehui Tang · Yunhe Wang
Poster
Tue 4:30 Debiased Distribution Compression
Lingxiao Li · Raaz Dwivedi · Lester Mackey
Poster
Thu 2:30 LoRAP: Transformer Sub-Layers Deserve Differentiated Structured Compression for Large Language Models
guangyan li · Yongqiang Tang · Wensheng Zhang
Oral
Thu 7:45 Data-free Neural Representation Compression with Riemannian Neural Dynamics
Zhengqi Pei · Anran Zhang · Shuhui Wang · Xiangyang Ji · Qingming Huang