Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Text Generation] Causal Mask Feature Branch #1126

Merged
merged 12 commits into from
Jul 27, 2023

Conversation

dbogunowicz
Copy link
Contributor

@dbogunowicz dbogunowicz commented Jul 19, 2023

This is the feature branch that will aggregate a series of PRs that share a common goal: enable causal mask support in the text generation pipeline for all models that require it.

This PR alone introduces a function that computes a causal mask from the inputs to the model.

Includes

Testing

Refer to the PRs above

dbogunowicz and others added 5 commits July 20, 2023 08:08
* initial commit

* clean up the PR

* working implementation

* Ben's review comments
* initial commit

* clean up the PR

* working implementation

* initial implementation, hacky lets clean it up

* ready for review

* few tiny quality improvements

* simplify the logic for computing num of unmasked bits for creating attention_mask for the multitoken prefill
bfineran
bfineran previously approved these changes Jul 25, 2023
bfineran
bfineran previously approved these changes Jul 27, 2023
@bfineran bfineran merged commit c988fe7 into main Jul 27, 2023
7 checks passed
@bfineran bfineran deleted the feature/damian/causal_mask_fb branch July 27, 2023 15:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants