Releases · ZipXuan/llama.cpp

19 Nov 11:43

a88ad00

b4130 Latest

Latest

llama : add OLMo November 2024 support (#10394)

* Add OLMo November 2024 constants

* Add OLMo November 2024 converter

* Add loading of OLMo November 2024 tensors and hyper parameters

* Add building of OLMo November 2024 model

Assets 21

cudart-llama-bin-win-cu11.7.1-x64.zip

293 MB 2024-11-19T11:43:03Z
cudart-llama-bin-win-cu12.2.0-x64.zip

413 MB 2024-11-19T11:43:10Z
llama-b1-bin-win-hip-x64-gfx1030.zip

236 MB 2024-11-19T11:43:18Z
llama-b1-bin-win-hip-x64-gfx1100.zip

237 MB 2024-11-19T11:43:23Z
llama-b1-bin-win-hip-x64-gfx1101.zip

237 MB 2024-11-19T11:43:29Z
llama-b4130-bin-macos-arm64.zip

51 MB 2024-11-19T11:43:33Z
llama-b4130-bin-macos-x64.zip

52 MB 2024-11-19T11:43:35Z
llama-b4130-bin-ubuntu-x64.zip

56.1 MB 2024-11-19T11:43:37Z
llama-b4130-bin-win-avx-x64.zip

8.1 MB 2024-11-19T11:43:38Z
llama-b4130-bin-win-avx2-x64.zip

8.1 MB 2024-11-19T11:43:39Z
Source code (zip)

2024-11-19T09:04:08Z
Source code (tar.gz)

2024-11-19T09:04:08Z

19 Nov 05:11

github-actions

b4127

557924f

b4127

sycl: Revert MUL_MAT_OP support changes (#10385)

Assets 21

23 Sep 03:58

github-actions

b3804

c35e586

b3804

musa: enable building fat binaries, enable unified memory, and disabl…

Assets 22

22 Sep 13:09

github-actions

b3802

a5b57b0

b3802

CUDA: enable Gemma FA for HIP/Pascal (#9581)

Assets 22

22 Sep 04:08

github-actions

b3801

ecd5d6b

b3801

llama: remove redundant loop when constructing ubatch (#9574)

Assets 22

29 Jul 14:51

github-actions

b3487

439b3fc

b3487

cuda : organize vendor-specific headers into vendors directory (#8746)

Signed-off-by: Xiaodong Ye <[email protected]>

Assets 20

29 Jul 08:50

github-actions

b3486

0832de7

b3486

[SYCL] add conv support (#8688)

Assets 20

13 Jul 08:26

github-actions

b3384

4e24cff

b3384

server : handle content array in chat API (#8449)

* server : handle content array in chat API

* Update examples/server/utils.hpp

Co-authored-by: Xuan Son Nguyen <[email protected]>

---------

Co-authored-by: Xuan Son Nguyen <[email protected]>

Assets 20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: ZipXuan/llama.cpp

b4130

b4127

b3804

b3802

b3801

b3487

b3486

b3384