v1.5
Ray
Ray on GKE Terraform now uses the GKE Ray Add-on when creating GKE clusters (#781)
GKE image builder
Add mirror.gcr.io in containerd configuration to reduce docker rate limiting (#764)
Benchmarks
Add latency profile generator (#775)
Decrease scrape interval of metrics from TGI and DCGM to 15s (#772)
Enable Pod monitoring for vLLM (#796)
Testing
Add e2e tests for Hugging Face TGI tutorial (#780)