CrAM: A Compression-Aware Minimizer

Peste, Alexandra; Vladu, Adrian; Kurtic, Eldar; Lampert, Christoph H.; Alistarh, Dan

Computer Science > Machine Learning

arXiv:2207.14200 (cs)

[Submitted on 28 Jul 2022 (v1), last revised 4 May 2023 (this version, v4)]

Title:CrAM: A Compression-Aware Minimizer

Authors:Alexandra Peste, Adrian Vladu, Eldar Kurtic, Christoph H. Lampert, Dan Alistarh

View PDF

Abstract:Deep neural networks (DNNs) often have to be compressed, via pruning and/or quantization, before they can be deployed in practical settings. In this work we propose a new compression-aware minimizer dubbed CrAM that modifies the optimization step in a principled way, in order to produce models whose local loss behavior is stable under compression operations such as pruning. Thus, dense models trained via CrAM should be compressible post-training, in a single step, without significant accuracy loss. Experimental results on standard benchmarks, such as residual networks for ImageNet classification and BERT models for language modelling, show that CrAM produces dense models that can be more accurate than the standard SGD/Adam-based baselines, but which are stable under weight pruning: specifically, we can prune models in one-shot to 70-80% sparsity with almost no accuracy loss, and to 90% with reasonable ($\sim 1\%$) accuracy loss, which is competitive with gradual compression methods. Additionally, CrAM can produce sparse models which perform well for transfer learning, and it also works for semi-structured 2:4 pruning patterns supported by GPU hardware. The code for reproducing the results is available at this https URL .

Comments:	Accepted to ICLR 2023
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2207.14200 [cs.LG]
	(or arXiv:2207.14200v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2207.14200

Submission history

From: Alexandra Peste [view email]
[v1] Thu, 28 Jul 2022 16:13:28 UTC (53 KB)
[v2] Thu, 27 Oct 2022 16:39:30 UTC (76 KB)
[v3] Mon, 31 Oct 2022 10:39:29 UTC (76 KB)
[v4] Thu, 4 May 2023 13:55:21 UTC (96 KB)

Computer Science > Machine Learning

Title:CrAM: A Compression-Aware Minimizer

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:CrAM: A Compression-Aware Minimizer

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators