Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Opensora 1.2 inference fail #356

Open
qian18long opened this issue Jul 26, 2024 · 6 comments
Open

Opensora 1.2 inference fail #356

qian18long opened this issue Jul 26, 2024 · 6 comments

Comments

@qian18long
Copy link

The inference video is only noise from the given checkpoint: https://huggingface.co/LanguageBind/Open-Sora-Plan-v1.2.0

@TianxiangMa
Copy link

Me too, why?

0.1.mp4

@ytm-01
Copy link

ytm-01 commented Jul 26, 2024

i meet the same question.

@TianxiangMa
Copy link

I solved this problem by adjusting the "num_frames" to 93. The trained 4-second model can only be used to inference 4-second videos.

@LinB203
Copy link
Member

LinB203 commented Jul 26, 2024

Dynamic length of time is still in the training is not released for the time being.

@LinB203
Copy link
Member

LinB203 commented Jul 26, 2024

Btw, our project name is Open-Sora Plan, not Open-Sora, which is an another repo.

@foreverpiano
Copy link

@LinB203 想问下现在inference时候,DiT 的架构3D是怎么理解的?以前是 spatial temporal 交替,然后把多余的 reshape 到 bs 的维度,现在我看代码是只有一个 transformer 了,现在还是 transformer2D 吗?求解释下架构

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants