Skip to content

Releases: LlamaEdge/rag-api-server

LlamaEdge-RAG 0.5.2

13 May 03:07
Compare
Choose a tag to compare

Major changes:

  • Llama-api-server
    • Improve error responses
    • Add content-type:application/json header in responses

LlamaEdge-RAG 0.5.1

09 May 15:38
Compare
Choose a tag to compare

Major change:

  • Update /v1/embeddings endpoint to be compatible with OpenAI /v1/embeddings API

LlamaEdge-RAG 0.5.0

09 May 11:44
Compare
Choose a tag to compare

Major change:

  • Update deps: llama-core v0.9.0, endpoints v0.8.0, and chat-prompts v0.7.1

LlamaEdge-RAG 0.4.0

30 Apr 11:16
Compare
Choose a tag to compare

Major changes:

  • New /v1/retrieve endpoint
  • New --rag-policy CLI option

LlamaEdge-RAG 0.3.8

28 Apr 07:42
Compare
Choose a tag to compare

major change: update the chat-prompts dep to 0.6.2

LlamaEdge-RAG 0.3.7

26 Apr 12:58
Compare
Choose a tag to compare

Major change:

  • Post-process the generation of phi-3-chat model in non-stream mode.

LlamaEdge-RAG 0.3.6

24 Apr 10:20
Compare
Choose a tag to compare

Major changes:

  • Post-process the generation of llama-2-chat and llama-3-chat models in non-stream mode.

LlamaEdge-RAG 0.3.5

24 Apr 10:18
Compare
Choose a tag to compare

LlamaEdge-RAG 0.3.4

22 Apr 17:13
Compare
Choose a tag to compare

Major changes:

  • Support models without system prompt in RAG

LlamaEdge-RAG 0.3.3

19 Apr 05:00
Compare
Choose a tag to compare