Skip to main navigation Skip to search Skip to main content

Accelerating Dataset Distillation via Model Augmentation

  • Lei Zhang
  • , Jie Zhang
  • , Bowen Lei
  • , Subhabrata Mukherjee
  • , Xiang Pan
  • , Bo Zhao
  • , Caiwen Ding
  • , Yao Li
  • , Dongkuan Xu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Dataset Distillation (DD), a newly emerging field, aims at generating much smaller but efficient synthetic training datasets from large ones. Existing DD methods based on gradient matching achieve leading performance; however, they are extremely computationally intensive as they require continuously optimizing a dataset among thousands of randomly initialized models. In this paper, we assume that training the synthetic data with diverse models leads to better generalization performance. Thus we propose two model augmentation techniques, i.e. using early-stage models and parameter perturbation to learn an informative synthetic set with significantly reduced training cost. Extensive experiments demonstrate that our method achieves up to 20× speedup and comparable performance on par with state-of-The-Art methods.

Original languageEnglish (US)
Title of host publicationProceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023
PublisherIEEE Computer Society
Pages11950-11959
Number of pages10
ISBN (Electronic)9798350301298
ISBN (Print)9798350301298
DOIs
StatePublished - 2023
Externally publishedYes
Event2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 - Vancouver, Canada
Duration: Jun 18 2023Jun 22 2023

Publication series

NameProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Volume2023-June
ISSN (Print)1063-6919

Conference

Conference2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023
Country/TerritoryCanada
CityVancouver
Period6/18/236/22/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Keywords

  • Datasets and evaluation

Fingerprint

Dive into the research topics of 'Accelerating Dataset Distillation via Model Augmentation'. Together they form a unique fingerprint.

Cite this