Skip to content

Issues: microsoft/DeepSpeed

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[BUG] 1-bit LAMB not compatible with bf16 bug Something isn't working training
#5708 opened Jun 28, 2024 by catid
[BUG] Fine-tuned model outputs are empty. bug Something isn't working training
#5706 opened Jun 28, 2024 by IYIAscension
on Activation Checkpointing bug Something isn't working training
#5704 opened Jun 28, 2024 by ChaunceyWang
[BUG] inference ValueError bug Something isn't working inference
#5685 opened Jun 19, 2024 by zxrneu
[BUG] Using and Building DeepSpeedCPUAdam bug Something isn't working training
#5677 opened Jun 18, 2024 by oabuhamdan
does DeepSpeed support AMSP (a new DP shard strategy) enhancement New feature or request
#5661 opened Jun 14, 2024 by guoyejun
[BUG] Running llama2-7b step3 with tensor parallel and HE fails due to incompatible shapes bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#5656 opened Jun 13, 2024 by ShellyNR
[BUG] oneapi/ccl.hpp: No such file or directory. bug Something isn't working training
#5653 opened Jun 12, 2024 by weiji14
RuntimeError: still have inflight params[BUG] bug Something isn't working training
#5648 opened Jun 12, 2024 by iszengxin
ProTip! Mix and match filters to narrow down what you’re looking for.