kandinsky-community
/

kandinsky-2-2-decoder-inpaint

KandinskyV22InpaintPipeline

Model card Files Files and versions Community

YiYiXu commited on Jul 26, 2023

Commit

1df06eb

·

1 Parent(s): 48ea15d

Update README.md

Files changed (1) hide show

README.md +21 -12

README.md CHANGED Viewed

@@ -26,34 +26,29 @@ pip install diffusers transformers accelerate
 ### Text Guided Inpainting Generation
 ```python
-from diffusers import KandinskyV22InpaintPipeline, KandinskyV22PriorPipeline
 from diffusers.utils import load_image
 import torch
 import numpy as np
-pipe_prior = KandinskyV22PriorPipeline.from_pretrained(
-    "kandinsky-community/kandinsky-2-2-prior", torch_dtype=torch.float16
-)
-pipe_prior.to("cuda")
 prompt = "a hat"
-prior_output = pipe_prior(prompt)
-pipe = KandinskyV22InpaintPipeline.from_pretrained("kandinsky-community/kandinsky-2-2-decoder-inpaint", torch_dtype=torch.float16)
-pipe.to("cuda")
 init_image = load_image(
     "https://huggingface.co/datasets/hf-internal-testing/diffusers-images/resolve/main" "/kandinsky/cat.png"
 )
-mask = np.ones((768, 768), dtype=np.float32)
 # Let's mask out an area above the cat's head
-mask[:250, 250:-250] = 0
 out = pipe(
     image=init_image,
     mask_image=mask,
-    **prior_output,
     height=768,
     width=768,
     num_inference_steps=150,
@@ -64,6 +59,20 @@ image.save("cat_with_hat.png")
 ```
 ![img](https://huggingface.co/datasets/hf-internal-testing/diffusers-images/resolve/main/kandinskyv22/cat_with_hat.png)
 ## Model Architecture

 ### Text Guided Inpainting Generation
 ```python
+from diffusers import AutoPipelineForInpainting
 from diffusers.utils import load_image
 import torch
 import numpy as np
+pipe = AutoPipelineForInpainting.from_pretrained("kandinsky-community/kandinsky-2-2-decoder-inpaint", torch_dtype=torch.float16)
+pipe.enable_model_cpu_offload()
 prompt = "a hat"
 init_image = load_image(
     "https://huggingface.co/datasets/hf-internal-testing/diffusers-images/resolve/main" "/kandinsky/cat.png"
 )
+mask = np.zeros((768, 768), dtype=np.float32)
 # Let's mask out an area above the cat's head
+mask[:250, 250:-250] = 1
 out = pipe(
+    prompt=prompt,
     image=init_image,
     mask_image=mask,
     height=768,
     width=768,
     num_inference_steps=150,
 ```
 ![img](https://huggingface.co/datasets/hf-internal-testing/diffusers-images/resolve/main/kandinskyv22/cat_with_hat.png)
+__<font color=red>Breaking change on the mask input:</font>__
+We introduced a breaking change for Kandinsky inpainting pipeline in the following pull request: https://github.com/huggingface/diffusers/pull/4207. Previously we accepted a mask format where black pixels represent the masked-out area. We have changed to use white pixels to represent masks instead in order to have a unified mask format across all our pipelines.
+Please upgrade your inpainting code to follow the above. If you are using Kandinsky Inpaint in production. You now need to change the mask to:
+```python
+# For PIL input
+import PIL.ImageOps
+mask = PIL.ImageOps.invert(mask)
+# For PyTorch and Numpy input
+mask = 1 - mask
+```
 ## Model Architecture