v1.8

DefTruth released this 05 Aug 02:33

· 34 commits to main since this release

What's Changed

🔥[flashinfer] FlashInfer: Kernel Library for LLM Serving(@flashinfer-ai) by @DefTruth in #24
🔥[Palu] Palu: Compressing KV-Cache with Low-Rank Projection(@nycu.edu… by @DefTruth in #25
🔥[SentenceVAE] SentenceVAE: Faster, Longer and More Accurate Inferenc… by @DefTruth in #26
Bump up to v1.8 by @DefTruth in #27

Full Changelog: v1.7...v1.8

Contributors

DefTruth and flashinfer-ai

Assets 2