Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create saving-and-loading-Pytorch-checkpoints.rst #1364

Merged
merged 7 commits into from
Sep 6, 2022
Merged

Create saving-and-loading-Pytorch-checkpoints.rst #1364

merged 7 commits into from
Sep 6, 2022

Conversation

Linardos
Copy link
Contributor

@Linardos Linardos commented Aug 9, 2022

Reference Issues/PRs

What does this implement/fix? Explain your changes.

A minimal example for storing a PyTorch checkpoint. Several extra steps are added compared to the original saving progress example. Furthermore a way to load the latest checkpoint is added as well.

Any other comments?

This was developed with Daniel's help. We thought it was a resolved issue but apparently when loading the checkpoint, performance drops. It's as if training from scratch. We need to debug this further.

Copy link
Member

@danieljanes danieljanes left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the PR @Linardos ! Looks good overall, I've added a few small suggestions. Could you re-compile the docs and check if everything looks good?

Linardos and others added 4 commits August 31, 2022 19:30
Co-authored-by: Daniel J. Beutel <daniel@adap.com>
Co-authored-by: Daniel J. Beutel <daniel@adap.com>
Co-authored-by: Daniel J. Beutel <daniel@adap.com>
Co-authored-by: Daniel J. Beutel <daniel@adap.com>
@Linardos
Copy link
Contributor Author

Changes committed! I look forward to your thoughts after running the code. I hope we can resolve the issue.

@danieljanes
Copy link
Member

@Linardos, lightning fast! Did you have a chance to build the docs and check if the formatting looks good?

@Linardos
Copy link
Contributor Author

Yeap, I used a VS code plugin that allowed me to built them as I worked and your changes were small so it was easy to check pretty quickly.

@danieljanes danieljanes merged commit 2d4e434 into adap:main Sep 6, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants