Skip to content

Releases: ggerganov/llama.cpp

b3735

12 Sep 02:17
df4b794
Compare
Choose a tag to compare
cann: Fix error when running a non-exist op (#9424)

b3733

11 Sep 17:25
1b28061
Compare
Choose a tag to compare
llama : skip token bounds check when evaluating embeddings (#9437)

b3731

11 Sep 12:10
0996c55
Compare
Choose a tag to compare
llava : correct args for minicpmv-cli (#9429)

b3729

11 Sep 11:15
67155ab
Compare
Choose a tag to compare
feat: Implements retrying logic for downloading models using --model-…

…url flag (#9255)

* feat: Implements retrying logic for downloading models using --model-url flag

* Update common/common.cpp

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

* Update common/common.cpp

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

* apply comments

* implements a retry function to avoid duplication

* fix editorconfig

* change function name

---------

Co-authored-by: farbod <farbod.bjary82@gmail.com>
Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>
Co-authored-by: slaren <slarengh@gmail.com>
Co-authored-by: Xuan Son Nguyen <son@huggingface.co>

b3728

11 Sep 09:38
5af118e
Compare
Choose a tag to compare
CUDA: fix --split-mode row race condition (#9413)

b3727

11 Sep 08:53
d2b496b
Compare
Choose a tag to compare
batched-bench : remove unused code (#9305)

b3726

11 Sep 03:02
b34e023
Compare
Choose a tag to compare
musa: remove Clang builtins mapping (#9421)

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

b3725

11 Sep 02:17
51b6038
Compare
Choose a tag to compare
sycl : update support conditions  (#9394)

* sycl : update support condition to im2col

Signed-off-by: Alberto Cabrera <alberto.cabrera@codeplay.com>

* Added TODO to remind supporting FP32 im2col

---------

Signed-off-by: Alberto Cabrera <alberto.cabrera@codeplay.com>

b3723

10 Sep 22:13
6cd4e03
Compare
Choose a tag to compare
arg : bring back missing ifdef (#9411)

* arg : bring back missing ifdef

* replace with llama_supports_gpu_offload

b3722

10 Sep 22:06
8d300bd
Compare
Choose a tag to compare
enable --special arg for llama-server (#9419)

Co-authored-by: matteo serva <matteo.serva@gmail.com>