[FEAT] Define model for Embedding #276

netandreus · 2023-11-14T08:15:50Z

Problem

I am using LocalAI with Zep.

llm:
  service: "openai"
  model: "gpt-3.5-turbo-1106"
  openai_endpoint: "http://host.docker.internal:8080/v1"

I can define model for llm itself, but It's needed also to define model for embeddings, because It seems that now model is hardcoded to text-embedding-ada-002.

Possible solution

Add and use model key in embeddings options like this:

    embeddings:
      enabled: true
      chunk_size: 200
      dimensions: 384
      service: "openai"
      model: "some-custom-model"

The text was updated successfully, but these errors were encountered:

danielchalef · 2023-12-27T01:04:56Z

We're refactoring how LLMs work and separating generation/completion from embeddings, which will address the above. We'll be releasing this in the new year.

danielchalef · 2024-10-03T21:22:45Z

Other inference providers are now supported via a proxy such as LiteLLM.

t41372 · 2024-10-17T06:13:36Z

Would it be possible to allow users to define the model name they'd like to use, rather than being limited to "gpt-4o-mini" and OpenAI's embeddings, without the need for a proxy server like LiteLLM? Many inference backends, such as Ollama, are compatible with OpenAI's chat and embeddings endpoints. However, the inability to modify the model name prevents us from utilizing this compatibility.

While, as you mentioned, we could use LiteLLM as a proxy server to reroute requests and overwrite the model name, this approach adds significant complexity. I'm currently exploring long-term memory integration for my project, Open-LLM-VTuber, and setting up LiteLLM with rerouting for LLM and embeddings can present a serious challenge for many of my users.

Given how beneficial the ability to change the model name would be, I strongly recommend considering allowing users to set custom model name for LLM and embeddings.

jkirk-denaliai · 2024-11-20T19:03:21Z

There are data privacy, regulatory, and sovereignty issues associated with using OpenAI embeddings.
Ideally we could implement our own embedding function and not be limited to OpenAI. I would need to re-write this to use local embeddings before I could use it in my applications.

netandreus mentioned this issue Nov 14, 2023

[BUG] No metadata for session #275

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEAT] Define model for Embedding #276

[FEAT] Define model for Embedding #276

netandreus commented Nov 14, 2023

danielchalef commented Dec 27, 2023

danielchalef commented Oct 3, 2024

t41372 commented Oct 17, 2024

jkirk-denaliai commented Nov 20, 2024

[FEAT] Define model for Embedding #276

[FEAT] Define model for Embedding #276

Comments

netandreus commented Nov 14, 2023

Problem

Possible solution

danielchalef commented Dec 27, 2023

danielchalef commented Oct 3, 2024

t41372 commented Oct 17, 2024

jkirk-denaliai commented Nov 20, 2024