Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug/sg 764 wrong ckpt when resuming with external ckpt root dir #948

Conversation

shaydeci
Copy link
Collaborator

@shaydeci shaydeci commented May 8, 2023

Since we dont pass ckpt_root_dir (till now) to get_checkpoints_dir_path it takes by default the pkg_resources one. Then if someone calls train with training_hyperparams.resume=True ckpt_root_dir=/some/path/to/ckpt_root/ we discard the ckpt_root_dir and get that we are trying to load a checkpoint that does not exist.

@dagshub
Copy link

dagshub bot commented May 8, 2023

@shaydeci shaydeci marked this pull request as ready for review May 8, 2023 08:47
Copy link
Contributor

@Louis-Dupont Louis-Dupont left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@shaydeci shaydeci merged commit 2a8b136 into master May 8, 2023
@shaydeci shaydeci deleted the bug/SG-764_wrong_ckpt_when_resuming_with_external_ckpt_root_dir branch May 8, 2023 12:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants