Change the repository type filter
All
Repositories list
56 repositories
vllm
Publicupstream-transformers
Publicmteb
PublicAutoFP8
Publiclm-evaluation-harness
PublicOmniQuant
Publicquant_kernel_benchmarks
Publicflash-attention
Publictemp-AutoGPTQ
Publicupstream-llm-foundry
Publicyolov5
Publicupstream-composer
PublicMixEval
Publicmamba
Publiccausal-conv1d
Publicevalplus
Publicsparseml
PublicLibraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller modelsinference
Publicexamples
Publicsparsezoo
PublicNeural network model repository for highly sparse and sparse-quantized models with matching sparsification recipesdeepsparse
PublicSparsity-aware deep learning inference runtime for CPUscutlass
Publicllm-foundry
Publicnm-vllm-utils
Public