Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New logic for passing past_key_value #177

Merged
merged 1 commit into from
Nov 10, 2023
Merged

Conversation

younesbelkada
Copy link
Collaborator

What does this PR do ?

On par with huggingface/transformers#27411 / an alternative to slicing input hidden states. Since this can be done from transformers level, I propose an alternative on autoawq side.

creates dummy past key values so that with transformers >= 4.35.0 the generation works smoothly. I did not tried it with earlier versions of transformers but it should work fine

cc @casper-hansen

@casper-hansen
Copy link
Owner

This should be fine as we have bumped the dependency on transformers to be >= 4.35.0 because of large breaking changes.

@casper-hansen casper-hansen merged commit 5db86ec into main Nov 10, 2023
@younesbelkada younesbelkada deleted the younesbelkada-patch-1 branch November 10, 2023 13:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants