Timezone: »
With the development of deep networks on various large-scale datasets, a large zoo of pretrained models are available. When transferring from a model zoo, applying classic single-model-based transfer learning methods to each source model suffers from high computational cost and cannot fully utilize the rich knowledge in the zoo. We propose \emph{Zoo-Tuning} to address these challenges, which learns to adaptively transfer the parameters of pretrained models to the target task. With the learnable channel alignment layer and adaptive aggregation layer, Zoo-Tuning \emph{adaptively aggregates channel aligned pretrained parameters to derive the target model}, which simultaneously promotes knowledge transfer and adapts source models to downstream tasks. The adaptive aggregation substantially reduces the computation cost at both training and inference. We further propose lite Zoo-Tuning with the temporal ensemble of batch average gating values to reduce the storage cost at the inference time. We evaluate our approach on a variety of tasks, including reinforcement learning, image classification, and facial landmark detection. Experiment results demonstrate that the proposed adaptive transfer learning approach can more effectively and efficiently transfer knowledge from a zoo of models.
Author Information
Yang Shu (Tsinghua University)
Zhi Kou (Tsinghua University)
Zhangjie Cao (Tsinghua University)
Jianmin Wang (Tsinghua University)
Mingsheng Long (Tsinghua University)
Related Events (a corresponding poster, oral, or spotlight)
-
2021 Spotlight: Zoo-Tuning: Adaptive Transfer from A Zoo of Models »
Tue. Jul 20th 12:40 -- 12:45 PM Room
More from the Same Authors
-
2023 Poster: CLIPood: Generalizing CLIP to Out-of-Distributions »
Yang Shu · Xingzhuo Guo · Jialong Wu · Ximei Wang · Jianmin Wang · Mingsheng Long -
2023 Poster: Solving High-Dimensional PDEs with Latent Spectral Models »
Haixu Wu · Tengge Hu · huakun luo · Jianmin Wang · Mingsheng Long -
2023 Poster: Estimating Heterogeneous Treatment Effects: Mutual Information Bounds and Learning Algorithms »
Xingzhuo Guo · Yuchen Zhang · Jianmin Wang · Mingsheng Long -
2022 Poster: Flowformer: Linearizing Transformers with Conservation Flows »
Haixu Wu · Jialong Wu · Jiehui Xu · Jianmin Wang · Mingsheng Long -
2022 Spotlight: Flowformer: Linearizing Transformers with Conservation Flows »
Haixu Wu · Jialong Wu · Jiehui Xu · Jianmin Wang · Mingsheng Long -
2021 Poster: LogME: Practical Assessment of Pre-trained Models for Transfer Learning »
Kaichao You · Yong Liu · Jianmin Wang · Mingsheng Long -
2021 Spotlight: LogME: Practical Assessment of Pre-trained Models for Transfer Learning »
Kaichao You · Yong Liu · Jianmin Wang · Mingsheng Long -
2021 Poster: Representation Subspace Distance for Domain Adaptation Regression »
Xinyang Chen · Sinan Wang · Jianmin Wang · Mingsheng Long -
2021 Spotlight: Representation Subspace Distance for Domain Adaptation Regression »
Xinyang Chen · Sinan Wang · Jianmin Wang · Mingsheng Long -
2021 Poster: Self-Tuning for Data-Efficient Deep Learning »
Ximei Wang · Jinghan Gao · Mingsheng Long · Jianmin Wang -
2021 Spotlight: Self-Tuning for Data-Efficient Deep Learning »
Ximei Wang · Jinghan Gao · Mingsheng Long · Jianmin Wang -
2020 Poster: Unsupervised Transfer Learning for Spatiotemporal Predictive Networks »
Zhiyu Yao · Yunbo Wang · Mingsheng Long · Jianmin Wang -
2019 Poster: Bridging Theory and Algorithm for Domain Adaptation »
Yuchen Zhang · Tianle Liu · Mingsheng Long · Michael Jordan -
2019 Oral: Bridging Theory and Algorithm for Domain Adaptation »
Yuchen Zhang · Tianle Liu · Mingsheng Long · Michael Jordan -
2019 Poster: Transferable Adversarial Training: A General Approach to Adapting Deep Classifiers »
Hong Liu · Mingsheng Long · Jianmin Wang · Michael Jordan -
2019 Poster: Towards Accurate Model Selection in Deep Unsupervised Domain Adaptation »
Kaichao You · Ximei Wang · Mingsheng Long · Michael Jordan -
2019 Poster: Transferability vs. Discriminability: Batch Spectral Penalization for Adversarial Domain Adaptation »
Xinyang Chen · Sinan Wang · Mingsheng Long · Jianmin Wang -
2019 Oral: Towards Accurate Model Selection in Deep Unsupervised Domain Adaptation »
Kaichao You · Ximei Wang · Mingsheng Long · Michael Jordan -
2019 Oral: Transferability vs. Discriminability: Batch Spectral Penalization for Adversarial Domain Adaptation »
Xinyang Chen · Sinan Wang · Mingsheng Long · Jianmin Wang -
2019 Oral: Transferable Adversarial Training: A General Approach to Adapting Deep Classifiers »
Hong Liu · Mingsheng Long · Jianmin Wang · Michael Jordan -
2018 Poster: PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning »
Yunbo Wang · Zhifeng Gao · Mingsheng Long · Jianmin Wang · Philip Yu -
2018 Oral: PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning »
Yunbo Wang · Zhifeng Gao · Mingsheng Long · Jianmin Wang · Philip Yu -
2017 Poster: Deep Transfer Learning with Joint Adaptation Networks »
Mingsheng Long · Han Zhu · Jianmin Wang · Michael Jordan -
2017 Talk: Deep Transfer Learning with Joint Adaptation Networks »
Mingsheng Long · Han Zhu · Jianmin Wang · Michael Jordan