Skip to content

Actions: PygmalionAI/aphrodite-engine

Deploy VitePress site to Pages

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
402 workflow runs
402 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

api: support LoRA lineage and base model metadata management (#1072)
Deploy VitePress site to Pages #402: Commit 6212072 pushed by AlpinDale
January 3, 2025 04:25 46s main
January 3, 2025 04:25 46s
rocm: enable multi-step scheduling for rocm (#1071)
Deploy VitePress site to Pages #401: Commit d9d287a pushed by AlpinDale
January 3, 2025 02:46 45s main
January 3, 2025 02:46 45s
fix: Phi3.5 Mini and MoE LoRA inference (#1070)
Deploy VitePress site to Pages #400: Commit ec17b6c pushed by AlpinDale
January 3, 2025 02:39 47s main
January 3, 2025 02:39 47s
vlm: add support for molmo vision model (#1069)
Deploy VitePress site to Pages #399: Commit acc0c72 pushed by AlpinDale
January 3, 2025 02:34 50s main
January 3, 2025 02:34 50s
build: guard against changes in cuda library name (#1068)
Deploy VitePress site to Pages #398: Commit 7dd001e pushed by AlpinDale
January 3, 2025 02:20 55s main
January 3, 2025 02:20 55s
sampler: simplify logits resort in _apply_top_k_top_p (#1067)
Deploy VitePress site to Pages #397: Commit ca7028d pushed by AlpinDale
January 3, 2025 02:15 45s main
January 3, 2025 02:15 45s
rocm: add support for FP8 KV cache in the custom paged attention kker…
Deploy VitePress site to Pages #396: Commit 61aed09 pushed by AlpinDale
January 3, 2025 02:12 55s main
January 3, 2025 02:12 55s
api: enable MQAphroditeEngine for embedding models (#1065)
Deploy VitePress site to Pages #395: Commit 12b0059 pushed by AlpinDale
January 3, 2025 02:00 47s main
January 3, 2025 02:00 47s
fix: encoder-decoder models for beam search (#1064)
Deploy VitePress site to Pages #394: Commit 314fa7f pushed by AlpinDale
January 2, 2025 09:54 52s main
January 2, 2025 09:54 52s
api: non-zero exit code if MQ engine startup fails (#1063)
Deploy VitePress site to Pages #393: Commit 34cf9b7 pushed by AlpinDale
January 1, 2025 16:08 52s main
January 1, 2025 16:08 52s
rocm: add more quants, fix _scaled_mm call (#1062)
Deploy VitePress site to Pages #392: Commit 92cee43 pushed by AlpinDale
January 1, 2025 16:00 57s main
January 1, 2025 16:00 57s
readme: add attribution to Ruliad
Deploy VitePress site to Pages #391: Commit b12d5c0 pushed by AlpinDale
January 1, 2025 15:11 55s main
January 1, 2025 15:11 55s
distributed: bind only to 127.0.0.1 for local-only usage (#1061)
Deploy VitePress site to Pages #390: Commit f81e7d7 pushed by AlpinDale
January 1, 2025 08:32 51s main
January 1, 2025 08:32 51s
core: support prompt logprobs in multi-step (#1060)
Deploy VitePress site to Pages #389: Commit 58aff37 pushed by AlpinDale
January 1, 2025 08:25 49s main
January 1, 2025 08:25 49s
feat: introduce MQAphroditeEngine (#1056)
Deploy VitePress site to Pages #388: Commit 9a7d551 pushed by AlpinDale
December 31, 2024 00:01 49s main
December 31, 2024 00:01 49s
fix: add missing logit index increment in sampling metadata prep (#1059)
Deploy VitePress site to Pages #387: Commit 0b5588d pushed by AlpinDale
December 30, 2024 14:20 51s main
December 30, 2024 14:20 51s
build: fix compilation for causal_conv1d_fwd kernel signature (#1057)
Deploy VitePress site to Pages #386: Commit 525edc1 pushed by AlpinDale
December 27, 2024 22:03 46s main
December 27, 2024 22:03 46s
mamba: enable continuous batching for mamba kernels (#1055)
Deploy VitePress site to Pages #385: Commit 9bdf8d5 pushed by AlpinDale
December 27, 2024 17:21 47s main
December 27, 2024 17:21 47s
fix: granite logit scale in logit computation (#1054)
Deploy VitePress site to Pages #384: Commit 11f49b5 pushed by AlpinDale
December 27, 2024 16:20 43s main
December 27, 2024 16:20 43s
api: add mistral function calling format to all models loaded with "m…
Deploy VitePress site to Pages #383: Commit 1264e0b pushed by AlpinDale
December 27, 2024 16:15 43s main
December 27, 2024 16:15 43s
quant: add tensor parallel support for bitsandbytes (#1052)
Deploy VitePress site to Pages #382: Commit b3f9ab3 pushed by AlpinDale
December 27, 2024 16:09 49s main
December 27, 2024 16:09 49s
core: add cuda graph support for encoder-decoder models (#1051)
Deploy VitePress site to Pages #381: Commit a985143 pushed by AlpinDale
December 27, 2024 15:53 45s main
December 27, 2024 15:53 45s
torch.compile: register all-reduce operations as custom ops (#1050)
Deploy VitePress site to Pages #380: Commit 239a8ca pushed by AlpinDale
December 27, 2024 06:05 48s main
December 27, 2024 06:05 48s
chore: remove dead code from triton sampling kernels (#1049)
Deploy VitePress site to Pages #379: Commit 4593a3b pushed by AlpinDale
December 27, 2024 05:54 43s main
December 27, 2024 05:54 43s
kernel: asymmetric AQ AZP quantization kernels (#1048)
Deploy VitePress site to Pages #378: Commit 8976805 pushed by AlpinDale
December 27, 2024 05:51 44s main
December 27, 2024 05:51 44s