zhentaoyu

Follow

🎯

Focusing

zhentaoyu zhentaoyu

🎯

Focusing

Follow

3 followers · 9 following

intel
Shanghai
densecollections.top

Achievements

Achievements

Block or Report

Block or report zhentaoyu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

intel/neural-speed intel/neural-speed Public

An innovative library for efficient LLM inference via low-bit quantization

C++ 326 34
intel/intel-extension-for-transformers intel/intel-extension-for-transformers Public

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2.1k 204
ggerganov/llama.cpp ggerganov/llama.cpp Public

LLM inference in C/C++

C++ 63.1k 9k
leejet/stable-diffusion.cpp leejet/stable-diffusion.cpp Public

Stable Diffusion in pure C/C++

C++ 3k 249
huggingface/optimum-habana huggingface/optimum-habana Public

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

Python 134 159
intel/neural-compressor intel/neural-compressor Public

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2.1k 248