Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Adapter

Xing, Peng; Wang, Ning; Ouyang, Jianbo; Li, Zechao

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.02881 (cs)

[Submitted on 5 Jun 2024 (v1), last revised 6 Jun 2024 (this version, v2)]

Title:Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Adapter

Authors:Peng Xing, Ning Wang, Jianbo Ouyang, Zechao Li

View PDF HTML (experimental)

Abstract:The remarkable advancement in text-to-image generation models significantly boosts the research in ID customization generation. However, existing personalization methods cannot simultaneously satisfy high fidelity and high-efficiency requirements. Their main bottleneck lies in the prompt image encoder, which produces weak alignment signals with the text-to-image model and significantly increased model size. Towards this end, we propose a lightweight Inv-Adapter, which first extracts diffusion-domain representations of ID images utilizing a pre-trained text-to-image model via DDIM image inversion, without additional image encoder. Benefiting from the high alignment of the extracted ID prompt features and the intermediate features of the text-to-image model, we then embed them efficiently into the base text-to-image model by carefully designing a lightweight attention adapter. We conduct extensive experiments to assess ID fidelity, generation loyalty, speed, and training parameters, all of which show that the proposed Inv-Adapter is highly competitive in ID customization generation and model scale.

Comments:	technical report
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.02881 [cs.CV]
	(or arXiv:2406.02881v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.02881

Submission history

From: Peng Xing [view email]
[v1] Wed, 5 Jun 2024 02:59:08 UTC (37,369 KB)
[v2] Thu, 6 Jun 2024 06:59:46 UTC (29,907 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Adapter

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Adapter

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators