Releases: ZipXuan/llama.cpp
Releases · ZipXuan/llama.cpp
b4130
b4127
sycl: Revert MUL_MAT_OP support changes (#10385)
b3804
musa: enable building fat binaries, enable unified memory, and disabl…
b3802
CUDA: enable Gemma FA for HIP/Pascal (#9581)
b3801
llama: remove redundant loop when constructing ubatch (#9574)
b3487
cuda : organize vendor-specific headers into vendors directory (#8746) Signed-off-by: Xiaodong Ye <[email protected]>
b3486
[SYCL] add conv support (#8688)
b3384
server : handle content array in chat API (#8449) * server : handle content array in chat API * Update examples/server/utils.hpp Co-authored-by: Xuan Son Nguyen <[email protected]> --------- Co-authored-by: Xuan Son Nguyen <[email protected]>