sync : llama.cpp + fix file structure #998

ggerganov · 2024-10-26T06:45:32Z

* CUDA: fix MMQ for non-contiguous src0, add tests * revise test code

* metal : support permuted matrix multiplicaions ggml-ci * cont : use nb01 directly for row steps ggml-ci * cont : add comments [no ci] * metal : minor refactor * metal : minor

ggml-ci

ggerganov and others added 5 commits October 26, 2024 09:39

scripts : fix sync scripts (amx)

a63b5e5

CUDA: fix MMQ for non-contiguous src0, add tests (llama/10021)

4a5d966

* CUDA: fix MMQ for non-contiguous src0, add tests * revise test code

CUDA: fix insufficient buffer clearing for MMQ (llama/10032)

e7d0fb3

metal : support permuted matrix multiplicaions (llama/10033)

52ae1ed

* metal : support permuted matrix multiplicaions ggml-ci * cont : use nb01 directly for row steps ggml-ci * cont : add comments [no ci] * metal : minor refactor * metal : minor

sync : llama.cpp

cf50bf9

ggerganov force-pushed the sync branch from 1611ad0 to f318dc4 Compare October 26, 2024 06:46

mingfeima and others added 2 commits October 26, 2024 09:49

ggml : add AMX backend (llama/8998)

fb1d2b9

ggml : remove sync artifacts

e15162b

ggml-ci

ggerganov force-pushed the sync branch from f318dc4 to e15162b Compare October 26, 2024 06:49

ggerganov merged commit 162e232 into master Oct 26, 2024
10 checks passed

ggerganov deleted the sync branch October 26, 2024 07:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync : llama.cpp + fix file structure #998

sync : llama.cpp + fix file structure #998

ggerganov commented Oct 26, 2024

sync : llama.cpp + fix file structure #998

sync : llama.cpp + fix file structure #998

Conversation

ggerganov commented Oct 26, 2024