Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sanity check ranges in freeze.py #1686

Merged
merged 2 commits into from
Jul 5, 2024

Conversation

josharian
Copy link
Contributor

Description

this will catch range-related problems earlier and more clearly.

Motivation and Context

in my case, it appears that deepspeed doesn't play well with ranges.
i am still investigating.

How has this been tested?

manually, in my broken setup.
(it fails because the param length is 0.)

this will catch problems earlier and more clearly.

in my case, it appears that deepspeed zero3 sets layer tensor shapes
to [0], which doesn't play well with automatically inferred ranges.
through a bit of luck, inverting ranges still appears to work correctly.
@josharian
Copy link
Contributor Author

Looks like that test failure is an unrelated flake. No sign of "unfreeze" in the logs for it.

@winglian winglian merged commit b3f680d into axolotl-ai-cloud:main Jul 5, 2024
7 of 8 checks passed
@josharian josharian deleted the josh/sanity-check-ranges branch July 5, 2024 15:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants