CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training

Zheng, Haitian; Lin, Zhe; Lu, Jingwan; Cohen, Scott; Shechtman, Eli; Barnes, Connelly; Zhang, Jianming; Xu, Ning; Amirghodsi, Sohrab; Luo, Jiebo

Computer Science > Computer Vision and Pattern Recognition

arXiv:2203.11947 (cs)

[Submitted on 22 Mar 2022 (v1), last revised 21 Jul 2022 (this version, v3)]

Title:CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training

Authors:Haitian Zheng, Zhe Lin, Jingwan Lu, Scott Cohen, Eli Shechtman, Connelly Barnes, Jianming Zhang, Ning Xu, Sohrab Amirghodsi, Jiebo Luo

View PDF

Abstract:Recent image inpainting methods have made great progress but often struggle to generate plausible image structures when dealing with large holes in complex images. This is partially due to the lack of effective network structures that can capture both the long-range dependency and high-level semantics of an image. We propose cascaded modulation GAN (CM-GAN), a new network design consisting of an encoder with Fourier convolution blocks that extract multi-scale feature representations from the input image with holes and a dual-stream decoder with a novel cascaded global-spatial modulation block at each scale level. In each decoder block, global modulation is first applied to perform coarse and semantic-aware structure synthesis, followed by spatial modulation to further adjust the feature map in a spatially adaptive fashion. In addition, we design an object-aware training scheme to prevent the network from hallucinating new objects inside holes, fulfilling the needs of object removal tasks in real-world scenarios. Extensive experiments are conducted to show that our method significantly outperforms existing methods in both quantitative and qualitative evaluation. Please refer to the project page: \url{this https URL}.

Comments:	32 pages, 19 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2203.11947 [cs.CV]
	(or arXiv:2203.11947v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2203.11947

Submission history

From: Haitian Zheng [view email]
[v1] Tue, 22 Mar 2022 16:13:27 UTC (31,337 KB)
[v2] Fri, 15 Apr 2022 18:24:40 UTC (43,130 KB)
[v3] Thu, 21 Jul 2022 00:59:33 UTC (35,357 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators