v1.1.2
Highlights
- RAG, Ray & Jupyter terraform solutions now support GKE Autopilot as the default cluster type #635
- The RAG solution has improved test coverage to (1) validate the notebook that generates vector embeddings as part of the E2E tests #524 (2) validate prompt responses from the LLM with context #511
What's Changed
- Cherrypick AP cloud build stockout mitigation onto release-1.1 by @artemvmin in #580
- Jupyter notebook cherry pick by @chiayi in #600
- quick fix or rag prompt test output by @chiayi in #612
- Fetch the cached weights for Mistral-7B-Instruct-v0.1 from GCS bucket… by @gongmax in #621
- Cherry-pick #599 and #618 to release-1.1 by @roberthbailey in #627
- Cherry-pick #631 to release-1.1 branch by @roberthbailey in #632
- Cherry-pick #635 to release-1.1 branch by @roberthbailey in #637
Full Changelog: v1.1.0...v1.1.2