Skip to content

Pull requests: ggerganov/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

vulkan: copy iq4_nl LUT into shared memory
#10409 opened Nov 19, 2024 by jeffbolznv Loading…
2 of 4 tasks
sycl : permuted mul_mat through oneMKL SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#10408 opened Nov 19, 2024 by Alcpz Loading…
2 of 4 tasks
llama : handle KV shift for recurrent models
#10402 opened Nov 19, 2024 by ggerganov Loading…
Update recommended release version to 4040 documentation Improvements or additions to documentation SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#10395 opened Nov 19, 2024 by NeoZhangJianyu Loading…
2 of 4 tasks
vulkan: further optimize mul_mat_vec using larger loads
#10387 opened Nov 18, 2024 by jeffbolznv Loading…
2 of 4 tasks
speculative : refactor and add a simpler example demo Demonstrate some concept or idea, not intended to be merged examples
#10362 opened Nov 17, 2024 by ggerganov Draft
Add support for Qwen2VL build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#10361 opened Nov 17, 2024 by HimariO Draft
3 of 6 tasks
common: compile shared lib, and export some c functions
#10353 opened Nov 17, 2024 by KenForever1 Loading…
2 of 4 tasks
Refactor/tinyblas build Compilation issues demo Demonstrate some concept or idea, not intended to be merged documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning
#10343 opened Nov 16, 2024 by Djip007 Draft
2 of 4 tasks
chore : Fix the error when compiling rocm build on windows using cmake documentation Improvements or additions to documentation
#10310 opened Nov 15, 2024 by cocochick Loading…
2 of 4 tasks
Add .clang-format file
#10308 opened Nov 15, 2024 by ericcurtin Loading…
2 of 4 tasks
Introduce llama-run examples
#10291 opened Nov 14, 2024 by ericcurtin Loading…
2 of 4 tasks
speculative : experiments with Qwen2.5-Coder demo Demonstrate some concept or idea, not intended to be merged examples
#10290 opened Nov 14, 2024 by ggerganov Draft
Add try/except to test-tokenizer-random.py python python script changes testing Everything test related
#10276 opened Nov 13, 2024 by rmusser01 Loading…
2 of 4 tasks
Test tokenizer-0.py rewrite python python script changes testing Everything test related
#10275 opened Nov 13, 2024 by rmusser01 Loading…
2 of 4 tasks
readme : add option, update default value, fix formatting documentation Improvements or additions to documentation examples server
#10271 opened Nov 12, 2024 by pothitos Loading…
2 of 4 tasks
support for llguidance grammars
#10224 opened Nov 9, 2024 by mmoskal Draft
CANN: Add Ascend CANN build ci Ascend NPU issues specific to Ascend NPUs devops improvements to build systems and github actions
#10217 opened Nov 8, 2024 by xuedinge233 Loading…
CANN Support Ascend310P to accelerate F32 and F16 LLM Model Ascend NPU issues specific to Ascend NPUs enhancement New feature or request
#10216 opened Nov 8, 2024 by leo-pony Draft
2 of 4 tasks
docs: add doxygen documentation build Compilation issues
#10209 opened Nov 8, 2024 by sparkleholic Loading…
2 of 4 tasks
Draft: vulkan: Add VK_NV_cooperative_matrix2 support for mul_mat and FlashAttention2 ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#10206 opened Nov 7, 2024 by jeffbolznv Loading…
2 of 4 tasks
Introduce IQ4_NL_4_4 format and its neon implementation examples ggml changes relating to the ggml tensor library for machine learning
#10196 opened Nov 6, 2024 by FanShupei Loading…
2 of 4 tasks
ProTip! Exclude everything labeled bug with -label:bug.