[StableDiffusionInpaintPipeline] accept tensors for init and mask image #439

patil-suraj · 2022-09-09T06:20:26Z

This PR updates StableDiffusionInpaintPipeline to accept both torch.FloatTensor and PIL.Image.Image for init_image and mask_image.

Fixes #370

HuggingFaceDocBuilderDev · 2022-09-09T06:24:08Z

The documentation is not available anymore as the PR was closed or merged.

Inkorak · 2022-09-09T07:04:43Z

@patil-suraj There are problem here, in my opinion. If we send a FloatTensor, then there will be no preprocessing and the mask variable will not be declared, and secondly, there will be no transfer to the device.

if not isinstance(mask_image, torch.FloatTensor):
   mask = preprocess_mask(mask_image).to(self.device)
mask = torch.cat([mask] * batch_size)

With this, there should be no problems:

if not isinstance(mask_image, torch.FloatTensor):
   mask_image = preprocess_mask(mask_image)
mask = torch.cat([mask_image.to(self.device)] * batch_size)

patil-suraj · 2022-09-09T07:55:15Z

good catch! Updating it now

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_inpaint.py

pcuenca

LGTM!

Question: if the user provides an image and a mask as tensors, should we verify that the number of channels match? The image should have R,G,B while the mask is only L. Is this something we should check or is it too much?

On second thought, we could just update the documentation and not the code:

mask_image (`torch.FloatTensor` or `PIL.Image.Image`):
    `Image`, or tensor representing an image batch, to mask `init_image`. White pixels in the mask will be
    replaced by noise and therefore repainted, while black pixels will be preserved. If `mask_image` is a
    PIL image, it will be converted to a single channel (luminance) before use. If it's a tensor, it should
    contain one color channel (L) instead of 3, so the expected shape would be `(B, H, W, 1)`.

patil-suraj · 2022-09-16T15:27:08Z

Good point @pcuenca ! The pipeline is experimental and will be soon updated so just updated the docs for now.

…ge (huggingface#439) * accept tensors * fix mask handling * make device placement cleaner * update doc for mask image

accept tensors

6ae085f

patil-suraj requested a review from anton-l September 9, 2022 06:20

fix mask handling

7cd259d

pcuenca reviewed Sep 9, 2022

View reviewed changes

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_inpaint.py Show resolved Hide resolved

make device placement cleaner

02b6fd2

patil-suraj requested a review from pcuenca September 9, 2022 15:55

pcuenca approved these changes Sep 9, 2022

View reviewed changes

patrickvonplaten approved these changes Sep 16, 2022

View reviewed changes

update doc for mask image

5d5e194

patil-suraj merged commit 06924c6 into main Sep 16, 2022

patil-suraj deleted the update-inpaint branch September 16, 2022 15:35

PhaneeshB added a commit to nod-ai/diffusers that referenced this pull request Mar 1, 2023

add support for choosing vulkan device (huggingface#439)

749a2c2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[StableDiffusionInpaintPipeline] accept tensors for init and mask image #439

[StableDiffusionInpaintPipeline] accept tensors for init and mask image #439

patil-suraj commented Sep 9, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 9, 2022 •

edited

Loading

Inkorak commented Sep 9, 2022 •

edited

Loading

patil-suraj commented Sep 9, 2022

pcuenca left a comment

patil-suraj commented Sep 16, 2022

[StableDiffusionInpaintPipeline] accept tensors for init and mask image #439

[StableDiffusionInpaintPipeline] accept tensors for init and mask image #439

Conversation

patil-suraj commented Sep 9, 2022 • edited Loading

HuggingFaceDocBuilderDev commented Sep 9, 2022 • edited Loading

Inkorak commented Sep 9, 2022 • edited Loading

patil-suraj commented Sep 9, 2022

pcuenca left a comment

Choose a reason for hiding this comment

patil-suraj commented Sep 16, 2022

patil-suraj commented Sep 9, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 9, 2022 •

edited

Loading

Inkorak commented Sep 9, 2022 •

edited

Loading