Skip to content

Actions: PygmalionAI/aphrodite-engine

Deploy VitePress site to Pages

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
406 workflow runs
406 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

fix: crash when cancelling a request with multi-step (#977)
Deploy VitePress site to Pages #306: Commit d4e78a4 pushed by AlpinDale
December 24, 2024 04:35 44s main
December 24, 2024 04:35 44s
fix: modelscope for VLMs (#976)
Deploy VitePress site to Pages #305: Commit 3e4e766 pushed by AlpinDale
December 24, 2024 04:18 46s main
December 24, 2024 04:18 46s
models: add support for Phi3 MoE
Deploy VitePress site to Pages #304: Commit 201db10 pushed by AlpinDale
December 24, 2024 04:16 44s main
December 24, 2024 04:16 44s
tpu: fix TPU type api (#975)
Deploy VitePress site to Pages #303: Commit 032974a pushed by AlpinDale
December 24, 2024 03:34 54s main
December 24, 2024 03:34 54s
core: fix chunked prefill not being enabled by default for long conte…
Deploy VitePress site to Pages #302: Commit 510ae5b pushed by AlpinDale
December 24, 2024 03:32 55s main
December 24, 2024 03:32 55s
vlm: increase the default max_num_batched_tokens for multimodal mod…
Deploy VitePress site to Pages #301: Commit b3f6eeb pushed by AlpinDale
December 24, 2024 03:25 50s main
December 24, 2024 03:25 50s
tests: update internvl test for #971 (#972)
Deploy VitePress site to Pages #300: Commit 7eeee77 pushed by AlpinDale
December 24, 2024 03:17 51s main
December 24, 2024 03:17 51s
vlm: add tensor parallel support for vision transformer models (#971)
Deploy VitePress site to Pages #299: Commit b4a1e2f pushed by AlpinDale
December 24, 2024 01:54 57s main
December 24, 2024 01:54 57s
tpu: support single and multi-host TPUs on GKE and RayServe (#970)
Deploy VitePress site to Pages #298: Commit 61103b9 pushed by AlpinDale
December 23, 2024 14:38 47s main
December 23, 2024 14:38 47s
fix: prometheus.yaml path in monitoring example (#969)
Deploy VitePress site to Pages #297: Commit b26a014 pushed by AlpinDale
December 23, 2024 14:32 48s main
December 23, 2024 14:32 48s
tpu: add support for async postprocessing (#968)
Deploy VitePress site to Pages #296: Commit 5bec8fb pushed by AlpinDale
December 23, 2024 14:10 49s main
December 23, 2024 14:10 49s
distributed: support pipeline parallelism for internvl and internlm2 …
Deploy VitePress site to Pages #295: Commit a8bdd48 pushed by AlpinDale
December 23, 2024 09:00 50s main
December 23, 2024 09:00 50s
ci: bump to 0.6.5 (#964)
Deploy VitePress site to Pages #294: Commit cbd51a2 pushed by AlpinDale
December 22, 2024 06:42 47s main
December 22, 2024 06:42 47s
core: support logprobs with multi-step scheduling (#963)
Deploy VitePress site to Pages #293: Commit 0dfa6b6 pushed by AlpinDale
December 22, 2024 06:33 44s main
December 22, 2024 06:33 44s
vlm: do not allow max_model_len overflow (#962)
Deploy VitePress site to Pages #292: Commit 34e8606 pushed by AlpinDale
December 22, 2024 02:05 46s main
December 22, 2024 02:05 46s
quant: support pre-quanted bitsandbytes checkpoints (#961)
Deploy VitePress site to Pages #291: Commit 6bdff60 pushed by AlpinDale
December 22, 2024 02:01 53s main
December 22, 2024 02:01 53s
neuron: support for context length and token bucketing (#960)
Deploy VitePress site to Pages #290: Commit ba6d798 pushed by AlpinDale
December 22, 2024 01:26 45s main
December 22, 2024 01:26 45s
quant: update tpu_int8 to use AphroditeParameters (#959)
Deploy VitePress site to Pages #289: Commit f4b62bf pushed by AlpinDale
December 22, 2024 01:20 46s main
December 22, 2024 01:20 46s
fix: gguf vocab embddings in TP (#958)
Deploy VitePress site to Pages #288: Commit 9ff3239 pushed by AlpinDale
December 22, 2024 01:17 52s main
December 22, 2024 01:17 52s
misc: extend cuda graph capture size for H200 (#957)
Deploy VitePress site to Pages #287: Commit 22b8096 pushed by AlpinDale
December 22, 2024 00:35 47s main
December 22, 2024 00:35 47s
Revert "fix: issues with flashinfer fp8 kv (#950)" (#956)
Deploy VitePress site to Pages #286: Commit d6cbbba pushed by AlpinDale
December 22, 2024 00:24 45s main
December 22, 2024 00:24 45s
core: support multi-step scheduling w/ async post-processor (#955)
Deploy VitePress site to Pages #285: Commit 5be6225 pushed by AlpinDale
December 22, 2024 00:20 56s main
December 22, 2024 00:20 56s
spec decode: match the original rank computation impl for spec decodi…
Deploy VitePress site to Pages #284: Commit 564d197 pushed by AlpinDale
December 21, 2024 22:57 53s main
December 21, 2024 22:57 53s
vlm: fix errors on ragged NestedTensors (#953)
Deploy VitePress site to Pages #283: Commit 2aabf8f pushed by AlpinDale
December 21, 2024 22:49 46s main
December 21, 2024 22:49 46s
tpu: remove torch._dynamo.reset() (#952)
Deploy VitePress site to Pages #282: Commit ea59784 pushed by AlpinDale
December 21, 2024 22:39 46s main
December 21, 2024 22:39 46s