Reparameterization through Spatial Gradient Scaling

Detkov, Alexander; Salameh, Mohammad; Qharabagh, Muhammad Fetrat; Zhang, Jialin; Lui, Wei; Jui, Shangling; Niu, Di

Computer Science > Machine Learning

arXiv:2303.02733 (cs)

[Submitted on 5 Mar 2023 (v1), last revised 7 Mar 2023 (this version, v2)]

Title:Reparameterization through Spatial Gradient Scaling

Authors:Alexander Detkov, Mohammad Salameh, Muhammad Fetrat Qharabagh, Jialin Zhang, Wei Lui, Shangling Jui, Di Niu

View PDF

Abstract:Reparameterization aims to improve the generalization of deep neural networks by transforming convolutional layers into equivalent multi-branched structures during training. However, there exists a gap in understanding how reparameterization may change and benefit the learning process of neural networks. In this paper, we present a novel spatial gradient scaling method to redistribute learning focus among weights in convolutional networks. We prove that spatial gradient scaling achieves the same learning dynamics as a branched reparameterization yet without introducing structural changes into the network. We further propose an analytical approach that dynamically learns scalings for each convolutional layer based on the spatial characteristics of its input feature map gauged by mutual information. Experiments on CIFAR-10, CIFAR-100, and ImageNet show that without searching for reparameterized structures, our proposed scaling method outperforms the state-of-the-art reparameterization strategies at a lower computational cost.

Comments:	Published at ICLR 2023. Code available at this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2303.02733 [cs.LG]
	(or arXiv:2303.02733v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2303.02733

Submission history

From: Alexander Detkov [view email]
[v1] Sun, 5 Mar 2023 17:57:33 UTC (2,645 KB)
[v2] Tue, 7 Mar 2023 02:07:01 UTC (2,645 KB)

Computer Science > Machine Learning

Title:Reparameterization through Spatial Gradient Scaling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Reparameterization through Spatial Gradient Scaling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators