Improved Zero-Shot Image Editing via Null-Toon and Directed Delta Denoising Score

Osuva_Fahim_Boutellier_2024.pdf
Hyväksytty kirjoittajan käsikirjoitus - 22.15 MB

Kuvaus

©2024 Springer. This is a post-peer-review, pre-copyedit version of an article published in Pattern Recognition: 27th International Conference, ICPR 2024, Kolkata, India, December 1–5, 2024, Proceedings, Part VI. The final authenticated version is available online at: https://doi.org/10.1007/978-3-031-78172-8_20
Recently, there has been a rapid surge in the utilization of diffusion models for customized image generation and editing tasks, especially using zero-shot editing algorithms that can largely operate on given images regardless of their source domain. This work is based on two well-known zero-shot image editing algorithms: Null Text Inversion (NTI) and Delta Denoising Score (DDS). With respect to NTI, we mainly focus on image cartoonization, which has received less attention in the context of text-guided image editing. In a nutshell, we propose a customized reconstruction phase for NTI, which helps transforming the natural input image into cartoon images with desired customization by supporting parameters. We also improve the current DDS optimization baseline and propose the Directed Delta Denoising Score (DDDS). Our DDDS algorithm offers a better image editing experience by replacing the target text prompt with the proposed directed text prompt. Computing directed text prompt requires one subtraction operation and yields significant reconstruction improvement over DDS. To demonstrate the effectiveness of our contributions, the paper presents both quantitative and qualitative comparisons against the state-of-the-art, as well as several visual examples.

Emojulkaisu

Pattern Recognition : 27th International Conference, ICPR 2024, Kolkata, India, December 1–5, 2024, Proceedings, Part VI

ISBN

978-3-031-78172-8

ISSN

1611-3349
0302-9743

Aihealue

Sarja

Lecture Notes in Computer Science|15306

OKM-julkaisutyyppi

A4 Artikkeli konferenssijulkaisussa