Skip to content

Actions: EricLBuehler/candle-vllm

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
292 workflow runs
292 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Optimize quantized matmul in batch processing & update Q4K results
Continuous integration #248: Pull request #78 synchronize by guoqingbao
August 13, 2024 08:12 1m 24s develop
August 13, 2024 08:12 1m 24s
Support in-situ quantization (#77)
Continuous integration #247: Commit 8805b40 pushed by guoqingbao
August 13, 2024 06:25 3m 8s master
August 13, 2024 06:25 3m 8s
Support in-situ quantization
Continuous integration #246: Pull request #77 synchronize by guoqingbao
August 13, 2024 06:14 2m 12s develop
August 13, 2024 06:14 2m 12s
Support in-situ quantization
Continuous integration #245: Pull request #77 synchronize by guoqingbao
August 13, 2024 06:10 1m 28s develop
August 13, 2024 06:10 1m 28s
Support in-situ quantization
Continuous integration #244: Pull request #77 opened by guoqingbao
August 13, 2024 06:04 2m 34s develop
August 13, 2024 06:04 2m 34s
Continuous integration
Continuous integration #243: Scheduled
August 12, 2024 00:58 1m 41s master
August 12, 2024 00:58 1m 41s
Continuous integration
Continuous integration #242: Scheduled
August 5, 2024 00:57 1m 39s master
August 5, 2024 00:57 1m 39s
Parallel token sampling process & reset decoder after each generation
Continuous integration #241: Commit 1b2a1d9 pushed by guoqingbao
August 2, 2024 10:19 1m 4s master
August 2, 2024 10:19 1m 4s
Parallel token sampling process & reset decoder after each generation
Continuous integration #240: Pull request #74 opened by guoqingbao
August 2, 2024 10:18 1m 51s develop
August 2, 2024 10:18 1m 51s
Tweak sampling parameters & update batched generation results
Continuous integration #239: Commit 231fa82 pushed by guoqingbao
August 2, 2024 01:56 4m 47s master
August 2, 2024 01:56 4m 47s
Tweak sampling parameters & update batched generation results
Continuous integration #238: Pull request #73 opened by guoqingbao
August 2, 2024 01:55 1m 31s develop
August 2, 2024 01:55 1m 31s
Fix bug for space token decoding & remove redundant code
Continuous integration #237: Commit 405407e pushed by guoqingbao
August 1, 2024 01:39 2m 39s master
August 1, 2024 01:39 2m 39s
Fix bug for space token decoding & remove redundant code
Continuous integration #236: Pull request #72 opened by guoqingbao
July 31, 2024 09:14 1m 55s develop
July 31, 2024 09:14 1m 55s
Fix bug for token decoding & remove token padding
Continuous integration #235: Commit b2494e5 pushed by guoqingbao
July 31, 2024 05:11 1m 51s master
July 31, 2024 05:11 1m 51s
Fix bug for token decoding & remove token padding
Continuous integration #234: Pull request #71 opened by guoqingbao
July 31, 2024 05:09 2m 20s develop
July 31, 2024 05:09 2m 20s
Support streaming batched chat completion requests
Continuous integration #232: Commit e55e4b4 pushed by guoqingbao
July 30, 2024 07:46 1m 18s master
July 30, 2024 07:46 1m 18s
Support streaming batched chat completion requests
Continuous integration #231: Pull request #69 synchronize by guoqingbao
July 30, 2024 06:27 1m 28s develop
July 30, 2024 06:27 1m 28s
Support streaming batched chat completion requests
Continuous integration #230: Pull request #69 synchronize by guoqingbao
July 30, 2024 06:24 1m 51s develop
July 30, 2024 06:24 1m 51s
Support streaming batched chat completion requests
Continuous integration #229: Pull request #69 opened by guoqingbao
July 30, 2024 06:06 2m 5s develop
July 30, 2024 06:06 2m 5s
Continuous integration
Continuous integration #228: Scheduled
July 29, 2024 00:57 1m 25s master
July 29, 2024 00:57 1m 25s
Merge pull request #68 from EricLBuehler/develop
Continuous integration #227: Commit eb41272 pushed by guoqingbao
July 26, 2024 04:26 1m 49s master
July 26, 2024 04:26 1m 49s
Update demo video
Continuous integration #226: Pull request #68 opened by guoqingbao
July 26, 2024 04:26 2m 5s develop
July 26, 2024 04:26 2m 5s
Merge pull request #67 from EricLBuehler/develop
Continuous integration #225: Commit e922750 pushed by guoqingbao
July 26, 2024 03:46 1m 40s master
July 26, 2024 03:46 1m 40s
LLaMa3.1 chat completion
Continuous integration #224: Pull request #67 opened by guoqingbao
July 26, 2024 03:46 1m 52s develop
July 26, 2024 03:46 1m 52s
Merge pull request #66 from EricLBuehler/develop
Continuous integration #223: Commit 8476f17 pushed by guoqingbao
July 24, 2024 09:27 1m 26s master
July 24, 2024 09:27 1m 26s