Releases: feifeibear/long-context-attention
Releases · feifeibear/long-context-attention
0.3.5
What's Changed
- revert version 0.3.2 by @feifeibear in #83
- version 0.3.5 by @feifeibear in #84
Full Changelog: 0.3.3...0.3.5
0.3.2 released
What's Changed
- remove amd installation to an individual doc by @feifeibear in #76
- auto publish python package when release on github by @feifeibear in #77
- version 0.3.2 by @feifeibear in #78
- remove useless workflow by @feifeibear in #79
- version to 0.3.2 by @feifeibear in #80
- polish publish workflow by @feifeibear in #81
Full Changelog: v0.3.1...0.3.2
v0.3.1 released at 2024.09.14
stripe_extract_local, basic_extract_local, zigzag_extract_local works for tensors dimension >=2.
v0.3 released on 27th August 2024!
upgrade flash_attn >= 2.6.0
v0.2 released on 24th June 2024!
- Ulysses supports T4 and V100.
- Updates some directory structures.
v0.1
Sequence parallel attention adopting a hybrid ulysses and ring attention approach.
Support GQA
Support QKV packed.