Skip to content

vllm-0.0.3

Compare
Choose a tag to compare
@github-actions github-actions released this 04 Dec 14:28

vLLM is a high-performance, low-latency, and memory-efficient library designed for serving large language models (LLMs) at scale.