Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add tests for the HuggingFace integrations #105

Merged
merged 4 commits into from
May 15, 2023
Merged

Add tests for the HuggingFace integrations #105

merged 4 commits into from
May 15, 2023

Conversation

rlouf
Copy link
Member

@rlouf rlouf commented May 15, 2023

This PR adds tests for the HuggingFace integrations, for multiple samples and controlled generation among other things. Closes #97.

rlouf added 2 commits May 15, 2023 13:18
We currently have non-digit tokens in the sequences output by
the models when `type` is "int" or "float".

Somehow we do not get the same character when iterating over the
vocabulary and when using `tokenizer.decode`; since the latter
is used to decode the final sentence we use this when creating
the mask.
@rlouf rlouf merged commit aec7639 into main May 15, 2023
@rlouf rlouf deleted the test-transformers branch May 15, 2023 12:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Write tests for the generative model integrations
1 participant