Skip to content
@Infini-AI-Lab

Infini-AI-Lab

Popular repositories Loading

  1. Sequoia Sequoia Public

    scalable and robust tree-based speculative decoding algorithm

    Python 304 31

  2. TriForce TriForce Public

    [COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

    Python 209 12

  3. MagicDec MagicDec Public

    Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding

    JavaScript 59 4

  4. Sirius Sirius Public

    Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining its efficiency gain.

    Python 13 2

  5. MagicPiG MagicPiG Public

    10

  6. Sequoia-Page Sequoia-Page Public

    JavaScript

Repositories

Showing 9 of 9 repositories
  • MagicDec Public

    Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding

    Infini-AI-Lab/MagicDec’s past year of commit activity
    JavaScript 59 Apache-2.0 4 3 0 Updated Sep 28, 2024
  • MagicDec-part1 Public

    Speculative decoding for high-throughput long-context inference

    Infini-AI-Lab/MagicDec-part1’s past year of commit activity
    JavaScript 0 Apache-2.0 0 0 0 Updated Sep 10, 2024
  • Sirius Public

    Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining its efficiency gain.

    Infini-AI-Lab/Sirius’s past year of commit activity
    Python 13 2 0 0 Updated Sep 10, 2024
  • MagicDec-part2 Public

    MagicDec: Breaking the Latency-Throughput Tradeoff for Long Contexts with Speculative Decoding

    Infini-AI-Lab/MagicDec-part2’s past year of commit activity
    JavaScript 0 Apache-2.0 0 0 0 Updated Sep 5, 2024
  • TriForce Public

    [COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

    Infini-AI-Lab/TriForce’s past year of commit activity
  • Sequoia Public

    scalable and robust tree-based speculative decoding algorithm

    Infini-AI-Lab/Sequoia’s past year of commit activity
    Python 304 31 7 3 Updated Aug 13, 2024
  • MagicPiG Public
    Infini-AI-Lab/MagicPiG’s past year of commit activity
    10 0 1 0 Updated Jul 26, 2024
  • lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness

    A framework for few-shot evaluation of language models.

    Infini-AI-Lab/lm-evaluation-harness’s past year of commit activity
    Python 0 MIT 1,751 0 0 Updated Jun 10, 2024
  • Infini-AI-Lab/Sequoia-Page’s past year of commit activity
    JavaScript 0 0 0 0 Updated May 21, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…