Skip to content

v1.5

Compare
Choose a tag to compare
@andrewsykim andrewsykim released this 09 Sep 18:05
· 52 commits to main since this release
c1633fa

Ray

Ray on GKE Terraform now uses the GKE Ray Add-on when creating GKE clusters (#781)

GKE image builder

Add mirror.gcr.io in containerd configuration to reduce docker rate limiting (#764)

Benchmarks

Add latency profile generator (#775)
Decrease scrape interval of metrics from TGI and DCGM to 15s (#772)
Enable Pod monitoring for vLLM (#796)

Testing

Add e2e tests for Hugging Face TGI tutorial (#780)