StableDiffusion.text_to_image() casuses an excaption in Colab #2467

KouichiMatsuda · 2024-07-02T15:30:34Z

Hi Keras Team,

Current Behavior:

The code based on https://keras.io/api/keras_cv/models/tasks/stable_diffusion/ causes an exception: ValueError: Exception encountered when calling DiffusionModelV2.call().

https://colab.research.google.com/drive/1OYet7JBOwgt7L5itxOOzVg-jclgPpdnT?usp=sharing

StableDiffusion class, too.

Am I missing something?

Steps To Reproduce:

https://colab.research.google.com/drive/1OYet7JBOwgt7L5itxOOzVg-jclgPpdnT?usp=sharing

Version:

Keras 3.4.1
TF 2.16.2
KerasCV 0.9.0

pjogi-testy · 2024-07-05T20:32:50Z

I have the same error when trying to replicate any Stable-Diffusion-related tutorial from official Keras repo (eg. https://keras.io/examples/generative/random_walks_with_stable_diffusion/), no matter if run locally or using original repo in colab. Possibly something is broken with Keras3 and latest TF? It seems like encoder (77 tokens, 768 values) does not communicate with diffusor UNET (basic 64x64x3 shape). Any clues would be most welcome. Or even confirmation on which exact versions of tf, keras, keras_cv does the official repo works on, because the Keras api is so inconsistent between versions that it is really hard to follow on. Further details of Error:

ValueError: Exception encountered when calling DiffusionModel.call().

Invalid input shape for input Tensor("data_2:0", shape=(3, 77, 768), dtype=float32). Expected shape (None, 64, 64, 4), but input has incompatible shape (3, 77, 768)

Arguments received by DiffusionModel.call():
• inputs={'latent': 'tf.Tensor(shape=(3, 64, 64, 4), dtype=float32)', 'timestep_embedding': 'tf.Tensor(shape=(3, 320), dtype=float32)', 'context': 'tf.Tensor(shape=(3, 77, 768), dtype=float32)'}
• training=False
• mask={'latent': 'None', 'timestep_embedding': 'None', 'context': 'None'}

heydaari · 2024-07-10T13:39:16Z

i have the same as
#2467 (comment)

i tried different backends , issue wont be gone

OttoERM · 2024-07-19T04:59:39Z

I got the same problem by following this tensorflow tutorial with the same error as @pjogi-testy

Long story short got it working on: keras 2.13.1, keras-core 0.1.7, keras-cv 0.9.0, tensorflow 2.13.1
And also on: keras 2.15.0, keras-core 0.1.7, keras-cv 0.9.0, tensorflow 2.15.1

import time
import keras_cv
from tensorflow import keras
import matplotlib.pyplot as plt
from PIL import Image

model = keras_cv.models.StableDiffusion(img_width=512, img_height=512)

image = model.text_to_image(prompt="Flower", batch_size=1, num_steps=15)

Image.fromarray(image[0]).save("Flower.png")
print("Saved at flower.png")

Didn't tried out tensorflow 2.16.*
Anyways I guess is just a broken version between keras 3.4.1 and tensorflow 2.17.0 (Latest release at this time)

In the keras repo readme there is a note "Keras 3 will not function with TensorFlow 2.14 or earlier." Not sure how are you suppose to use Keras 3 because whenever I installed tensorflow a specific version of keras was added, I install --upgrade the keras version but gave me a version error incompatibility.

github-actions bot assigned sachinprasadhs Jul 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

StableDiffusion.text_to_image() casuses an excaption in Colab #2467

StableDiffusion.text_to_image() casuses an excaption in Colab #2467

KouichiMatsuda commented Jul 2, 2024 •

edited

Loading

pjogi-testy commented Jul 5, 2024

heydaari commented Jul 10, 2024

OttoERM commented Jul 19, 2024 •

edited

Loading

StableDiffusion.text_to_image() casuses an excaption in Colab #2467

StableDiffusion.text_to_image() casuses an excaption in Colab #2467

Comments

KouichiMatsuda commented Jul 2, 2024 • edited Loading

Current Behavior:

Steps To Reproduce:

Version:

pjogi-testy commented Jul 5, 2024

heydaari commented Jul 10, 2024

OttoERM commented Jul 19, 2024 • edited Loading

KouichiMatsuda commented Jul 2, 2024 •

edited

Loading

OttoERM commented Jul 19, 2024 •

edited

Loading