-
Notifications
You must be signed in to change notification settings - Fork 26.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix _load_pretrained_model
#24200
Fix _load_pretrained_model
#24200
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you explain a bit more why we need to remove this line? Thanks!
This line what something I added in my previous PR thinking that we forgot to tie the weights. But I guess it was done intentionally as I see that one test failed and generated garbage value. Here the error that we get from the test:
|
The documentation is not available anymore as the PR was closed or merged. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The whole marian test suite now pass. Since you mentioned that line is added in the previous PR, the change here LGTM. Leave @sgugger to have a final judge though.
Thank you for the quick fix.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Seems very weird that this changes anything one way or the other but let.s go.
BTW, let's make the PR title a bit more precise, sth like |
Yeah, I will explore a little bit more why we have this weird behavior before merging. |
What does this PR do ?
Fixes the following test from my old PR Add check for tied parameters (#24029):
RUN_SLOW=1 python3 -m pytest -v tests/models/marian/test_modeling_marian.py::TestMarian_FI_EN_V2::test_batch_generation_en_fr