Skip to content

Speedup model init on CPU (by 10x+ for llama-3-8B as one example) #2803

Speedup model init on CPU (by 10x+ for llama-3-8B as one example)

Speedup model init on CPU (by 10x+ for llama-3-8B as one example) #2803