Agentqna ollama xeon #273

pbharti0831 · 2025-01-14T00:23:37Z

This RFC proposes to support Ollama serving on Intel Xeon CPUs. Ollama will be integrated as an additional LLM service alongside existing services like vLLM, TGI, and OpenAI. This setup enhances data privacy by keeping processing local, reduces operational costs by leveraging on-premise hardware, and provides flexibility and control over AI deployments. The workflow includes embedding, retrieval, and reranking microservices, ensuring efficient and secure handling of user queries and data preparation.

Changes -

Added markdown file for new RFC

…/docs into agentqna-ollama-xeon Merging because it was not allowing git pull

pbharti0831 and others added 11 commits November 25, 2024 16:20

Added RFC template for new changes

a092f1e

Added info on title, author and status

6f17ba8

Added objective and motivation

7278688

changed objective

c4bc083

changed objective abn motivation

8a0b6f3

modified objective and motivation

10d50fd

added design flow and changed motivation and objectives

0847694

Merge branch 'opea-project:main' into agentqna-ollama-xeon

da3ebf8

fixed mermaid diagram to fit text inside boxes

5724718

Merge branch 'agentqna-ollama-xeon' of https://github.com/pbharti0831…

8b923b5

…/docs into agentqna-ollama-xeon Merging because it was not allowing git pull

Added changes based on reviews, i.e. use case story and design diagram

6175cf6

pbharti0831 requested review from chensuyue, ftian1, mkbhanda, preethivenkatesh, chickenrae and tomlenth as code owners January 14, 2025 00:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agentqna ollama xeon #273

Agentqna ollama xeon #273

pbharti0831 commented Jan 14, 2025

Agentqna ollama xeon #273

Are you sure you want to change the base?

Agentqna ollama xeon #273

Conversation

pbharti0831 commented Jan 14, 2025