-
Notifications
You must be signed in to change notification settings - Fork 140
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[KV Cache Injection] Causal Mask implementation for OPT and CodeGen (#…
…1677) * initial commit * [KV Cache Injection] Causal Mask for CodeGen (#1676) * initial implementation; testing now * fix a small blunder * cleanup --------- Co-authored-by: bogunowicz@arrival.com <bogunowicz@arrival.com> * [KV Cache Injection] Causal Mask for OPT (#1688) * initial implementation; testing now * fix a small blunder * cleanup * initial implementation * on to testing with deepsparse --------- Co-authored-by: bogunowicz@arrival.com <bogunowicz@arrival.com> * replace boolean causal mask for int64 causal mask * better logging info * allow transformations to be also a list --------- Co-authored-by: bogunowicz@arrival.com <bogunowicz@arrival.com>
- Loading branch information
1 parent
ebc4ac6
commit b1d5ea2
Showing
9 changed files
with
550 additions
and
294 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
48 changes: 0 additions & 48 deletions
48
src/sparseml/exporters/transforms/kv_cache/positions_adjustment_base.py
This file was deleted.
Oops, something went wrong.
89 changes: 0 additions & 89 deletions
89
src/sparseml/exporters/transforms/kv_cache/positions_adjustment_codegen.py
This file was deleted.
Oops, something went wrong.
133 changes: 0 additions & 133 deletions
133
src/sparseml/exporters/transforms/kv_cache/positions_adjustment_opt.py
This file was deleted.
Oops, something went wrong.
Oops, something went wrong.