Skip to content

Releases: LlamaEdge/rag-api-server

LlamaEdge-RAG 0.7.1

01 Jul 13:12
Compare
Choose a tag to compare

Major changes:

  • Update deps
    • endpoints v0.9.1
    • chat-prompts v0.8.2
    • llama-core v0.11.4

LlamaEdge-RAG 0.7.0

27 Jun 03:08
Compare
Choose a tag to compare

Major change:

  • (BREAKING) Update the response data of the /v1/info endpoint

LlamaEdge-RAG 0.6.6

15 Jun 01:50
Compare
Choose a tag to compare

Major change:

  • Work around DNS failure in some containers by updating the default value of qdrant-url CLI option from http://localhost:6333 to http://127.0.0.1:6333

LlamaEdge-RAG 0.6.5

13 Jun 03:17
Compare
Choose a tag to compare
LlamaEdge-RAG 0.6.5 Pre-release
Pre-release

Major change:

  • Log the buffer of rag query request body while triggering bad request errors.

LlamaEdge-RAG 0.6.4

07 Jun 07:42
5a7ffc3
Compare
Choose a tag to compare
LlamaEdge-RAG 0.6.4 Pre-release
Pre-release

Major changes:

  • Improve the rag context retrieval
  • Upgrade the endpoints dependency to v0.8.1

LlamaEdge-RAG 0.6.3

06 Jun 05:03
Compare
Choose a tag to compare
LlamaEdge-RAG 0.6.3 Pre-release
Pre-release

Major changes:

  • Add logging

LlamaEdge-RAG 0.6.2

25 May 04:00
Compare
Choose a tag to compare

Major change:

  • Update to llama-core v0.11.1

LlamaEdge-RAG 0.6.1

24 May 09:56
Compare
Choose a tag to compare
LlamaEdge-RAG 0.6.1 Pre-release
Pre-release

Major changes:

  • Support embedding prompt template for embedding model

LlamaEdge-RAG 0.6.0

18 May 09:40
Compare
Choose a tag to compare

Major changes:

  • Improve --batch-size CLI option: set batch size for chat and embedding models, respectively

LlamaEdge-RAG 0.5.3

14 May 15:35
Compare
Choose a tag to compare

Major changes:

  • Add user header in chat completion responses
  • Support PLUGIN_DEBUG wasm environment variable for debugging the low-level ggml plugin