Qubitium

Follow

Qubitium-ModelCloud Qubitium

Follow

Golang, Python, Kotlin, Swift. I prefer strongly typed languages and I do not worship PEP. @ModelCloudAi

34 followers · 49 following

ModelCloud.ai
Earth/Epoch 2.0
https://modelcloud.ai
@qubitium

Achievements

Achievements

Pinned Loading

ModelCloud/GPTQModel ModelCloud/GPTQModel Public

An easy-to-use LLM quantization and inference toolkit based on GPTQ algorithm (weight-only quantization).

Python 66 12
sgl-project/sglang sgl-project/sglang Public

SGLang is a fast serving framework for large language models and vision language models.

Python 4.6k 296
vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 25.5k 3.7k
AutoGPTQ/AutoGPTQ AutoGPTQ/AutoGPTQ Public

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4.3k 453
flashinfer-ai/flashinfer flashinfer-ai/flashinfer Public

FlashInfer: Kernel Library for LLM Serving

Cuda 1k 92
Dao-AILab/flash-attention Dao-AILab/flash-attention Public

Fast and memory-efficient exact attention

Python 13.1k 1.2k