Papers
Topics
Authors
Recent
Search
2000 character limit reached

Perceptual Similarity guidance and text guidance optimization for Editing Real Images using Guided Diffusion Models

Published 9 Dec 2023 in cs.CV | (2312.06680v1)

Abstract: When using a diffusion model for image editing, there are times when the modified image can differ greatly from the source. To address this, we apply a dual-guidance approach to maintain high fidelity to the original in areas that are not altered. First, we employ text-guided optimization, using text embeddings to direct latent space and classifier-free guidance. Second, we use perceptual similarity guidance, optimizing latent vectors with posterior sampling via Tweedie formula during the reverse process. This method ensures the realistic rendering of both the edited elements and the preservation of the unedited parts of the original image.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (16)
  1. Ilvr: Conditioning method for denoising diffusion probabilistic models, 2021.
  2. Diffedit: Diffusion-based semantic image editing with mask guidance, 2022.
  3. High-fidelity and arbitrary face editing, 2021.
  4. Prompt-to-prompt image editing with cross attention control, 2022.
  5. Clipscore: A reference-free evaluation metric for image captioning, 2022.
  6. Classifier-free diffusion guidance, 2022.
  7. Enhancing diffusion-based image synthesis with robust classifier guidance, 2023.
  8. Details or artifacts: A locally discriminative learning approach to realistic image super-resolution, 2022.
  9. Repaint: Inpainting using denoising diffusion probabilistic models, 2022.
  10. Sdedit: Guided image synthesis and editing with stochastic differential equations, 2022.
  11. Null-text inversion for editing real images using guided diffusion models, 2022.
  12. Lanit: Language-driven image-to-image translation for unlabeled data, 2023.
  13. High-resolution image synthesis with latent diffusion models, 2022.
  14. Knn-diffusion: Image generation via large-scale retrieval, 2022.
  15. Adaint: Learning adaptive intervals for 3d lookup tables on real-time image enhancement, 2022.
  16. The unreasonable effectiveness of deep features as a perceptual metric, 2018.

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (1)

Collections

Sign up for free to add this paper to one or more collections.