Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix the format issue when build online doc #206

Merged
merged 1 commit into from
Oct 15, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
72 changes: 42 additions & 30 deletions community/rfcs/24-07-11-OPEA-Agent.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,11 @@
# 24-07-11-OPEA-Agent

Agent

## Author

[xuechendi](https://github.com/xuechendi)

## Status

v0.1 team sharing completed(07/10/24)
Expand All @@ -16,7 +24,7 @@ Single Agent Example:

![image](https://github.com/xuechendi/docs/assets/4355494/02232f5b-8034-44f9-a10c-545a13ec5e40)


* ‘Multi Agent' system: Multi Agents refer to a design that leveraging a Hierarchical Agent Teams to complete sub-tasks through individual agent working groups. Benefits of multi-agents’ design: (1) Grouping tools/responsibilities can give better results. An agent is more likely to succeed on a focused task than if it must select from dozens of tools. (2) Each agent will have their own assets including prompt, llm model, planning strategy and toolsets. (3) User can easily use yaml files or few lines of python to build a 'Hierarchical Multi Agent' megaservice by cherry-picking ready-to-use individual agents. (4) For small tasks which can be perfectly performed by single Agent, user can directly use 'Agent' microservice with simple/easy resource management.

Multi Agent example:
Expand All @@ -33,17 +41,19 @@ This RFC aims to provide low-code / no-code agents as new microservice / megaser

## Persona

We use the listed terms to define different persona mentioned in this document.
We use the listed terms to define different persona mentioned in this document.

* OPEA developer: OPEA developers describe who will follow current OPEA API SPEC or expand OPEA API SPEC to add new solutions. OPEA developers are expected to use this RFC to understand how this microservice communicates with other microservices and chained in megaflow. OPEA developer develops OPEA agent codes and add new Agent Implementation by extending current Agent library with advanced agent strategies.

* Enterprise User (Devops): Devops describe who will follow OPEA yaml configuration format to update settings according to their real need, or tune some of the configuration to get better performance, who will also use their updated configuration to launch all microservices and get functional endpoint and API calling. Devops are expected to use this RFC to understand the keywords, how these keywords works and rules of using this microservice. Devops are expected to follow customer tool template to provide their own tools and register to Agent microservice.

* End user: End user describe who writes application which will use OPEA exposed endpoints and API to fulfill task goals. End users are expected to use this RFC to understand API keywords and rules.
* End user: End user describe who writes application which will use OPEA exposed endpoints and API to fulfill task goals. End users are expected to use this RFC to understand API keywords and rules.


## Design Proposal
### Execution Plan

### Execution Plan

v0.8 (PR ready or merge to opea - agent branch)
* Agent component v0.1
* Support chat-completion API
Expand All @@ -65,14 +75,15 @@ V1.0
* Scaling
* Concurrency

### Part 1. API SPEC
### Part 1. API SPEC

Provide two types of API for different client application.
1. openAI chat completion API.
1. openAI chat completion API.
> Reference: https://platform.openai.com/docs/api-reference/chat/create

Advantage and limitation:
Advantage and limitation:
* Most common API, should be working with any existing client uses openAI.
* will not be able to memorize user historical session, human_in_loop agent will not work using this API.
* will not be able to memorize user historical session, human_in_loop agent will not work using this API.

```
"/v1/chat/completions": {
Expand All @@ -85,7 +96,7 @@ V1.0
2. openAI assistant API
> Reference: https://platform.openai.com/docs/api-reference/assistants

Advantage and limitation:
Advantage and limitation:
* User can create a session thread memorizing previous conversation as long-term memory. And Human-In-Loop agent will only works use this API.
* User client application may need codes change to work with this new API.
* openAI assistant API is tagged with ‘beta’, not stable
Expand All @@ -97,32 +108,32 @@ V1.0
"name": str,
"tools": list
}

# threads API is to used maintain conversation session with one user. It can be resumed from previous, can tracking long term memories.
- "/v1/threads/ ": { # empty is allowed }


# threads messages API is to add a task content to thread_1 (the thread created by threads API)
- "/v1/threads/thread_1/messages": {
"role": str,
"content": str
}

# threads run API is to start to execute agent thread using run api

- "/v1/threads/thread_1/runs": {
'assistant_id': str,
'instructions': str,
}
```

### Part 2. 'Agent' genAI Component definition
### Part 2. 'Agent' genAI Component definition

'Agent' genAI Component is regarded as the resource management unit in “Agent” design. It will be launched as one microservice and can be instantiated as ‘Agent’, ‘Planner’ or ‘Executor’ according to configuration. Tools will be registered to 'Agent' microservice during launch or runetime.

![image](https://github.com/user-attachments/assets/38e83fa4-57d8-4146-9061-e5153472b5f4)

#### SPEC for any agent Role - agent, planner, executor
#### SPEC for any agent Role - agent, planner, executor
```
"/v1/chat/completions": {
"model": str,
Expand All @@ -145,8 +156,8 @@ V1.0
}
```

#### Agent Role microservice definition - 'Agent':
A complete implementation of Agent, which contains LLM endpoint as planner, strategy algorithm for plan execution, Tools, and database handler to keep track of historical state and conversation.
#### Agent Role microservice definition - 'Agent':
A complete implementation of Agent, which contains LLM endpoint as planner, strategy algorithm for plan execution, Tools, and database handler to keep track of historical state and conversation.

configuration:
```
Expand All @@ -157,7 +168,7 @@ V1.0
llm_model_id: str
recursion_limit: int
tools: file_path or dict

# Tools definition
[tool_name]:
description: str
Expand All @@ -171,8 +182,8 @@ V1.0
return_output: str
```

#### Agent Role microservice definition - 'Planner':
Agent without tools. Planner only contains LLM endpoints as planner, certain strategies to complete an optimized plan.
#### Agent Role microservice definition - 'Planner':
Agent without tools. Planner only contains LLM endpoints as planner, certain strategies to complete an optimized plan.

configuration:
```
Expand All @@ -185,7 +196,7 @@ V1.0
require_human_feedback: bool
```

#### Agent Role microservice definition - 'Executor':
#### Agent Role microservice definition - 'Executor':
Tools executors. Executor is used to process input with registered tools.

Configuration:
Expand All @@ -203,7 +214,7 @@ V1.0
```

> Any microservcice follow this spec can be registered as role in Part3-graph-based

### Part3. 'Multi Agent' system overview

We planned to provide multi-agent system in two phases.
Expand All @@ -217,15 +228,16 @@ We planned to provide multi-agent system in two phases.


* Phase II: Graph-Based Multi Agent
1. In this design, we provide user a new SDK to compose a graph-based multi agents system with conditional edge to define all strategic rules.
2. Enterprise user will be able to use python code to wrap either ‘agent’, ‘planner’ or tools as ‘Role’ and add conditional edges between them for complex task agent design.
1. In this design, we provide user a new SDK to compose a graph-based multi agents system with conditional edge to define all strategic rules.
2. Enterprise user will be able to use python code to wrap either ‘agent’, ‘planner’ or tools as ‘Role’ and add conditional edges between them for complex task agent design.
3. This design provides user enough flexibility to handle very complex tasks and also provide flexibility to handle resource management when certain tools are running way slower than others.
> Detailed configuration please refer to Part3.2
![image](https://github.com/user-attachments/assets/35b36f64-eaa1-4f05-b25e-b8bea013680d)

#### Part3.1 Hierarchical Multi Agents

__Example 1__: ‘Single Agent megaservice’
Only 1 agent is presented in this configuration.
Only 1 agent is presented in this configuration.
![image](https://github.com/user-attachments/assets/2e716dd4-2923-4ebd-97bf-fe7a44161280)

3 tools are registered to this agent through custom_tools.yaml
Expand All @@ -235,7 +247,7 @@ Only 1 agent is presented in this configuration.
![image](https://github.com/user-attachments/assets/ec89e35b-8ccc-474b-9fb7-3ed7210acc10)

__Example 2__: ‘Hierarchical Multi Agents’
3 agents are presented in this configuration, 1st layer supervisor agent is the gateway to interact with user, and 1st layer agent will manage 2nd layer worker agents.
3 agents are presented in this configuration, 1st layer supervisor agent is the gateway to interact with user, and 1st layer agent will manage 2nd layer worker agents.

![image](https://github.com/user-attachments/assets/a83b51e6-ee08-473f-b389-51df48f1054f)

Expand All @@ -254,7 +266,7 @@ User can also chain agent into a multi-step mega service. audioAgent_megaservice
![image](https://github.com/user-attachments/assets/5fb18d75-9c08-4d7b-97f7-25d7227147dd)

#### Part3.2 Graph-Based Multi Agent
In Phase II, we propose to provide a graph-based multi agents system, which enterprise user will be able to define edges and conditional edges between agent nodes, planner nodes and tools for complex task agent design.
In Phase II, we propose to provide a graph-based multi agents system, which enterprise user will be able to define edges and conditional edges between agent nodes, planner nodes and tools for complex task agent design.

![image](https://github.com/user-attachments/assets/7c07e651-43ed-4056-b20a-cd39f3f883ee)

Expand All @@ -264,7 +276,7 @@ The user can build and launch the graph-based message group by the combination o
The yaml file contains the basic config information for each single “Role” in the agent architecture. The user can build a MessageGroup to define the link connection information and the data flow via “edges” and “conditional_edges”. The “edges” mean the output of the head_node is the input of the tail_node. The “conditional_edges” means there is a decision-making among the candidate tail_nodes based on the output of the head_node. The logic of this selection part is defined by the state component “Should_Continue”.
![image](https://github.com/user-attachments/assets/55ecb718-b134-4546-9496-40ac3a427a7b)

Appending agents/roles in MessageGroup.
Appending agents/roles in MessageGroup.
Define the role class define the action of the role  add edges  recompile the messagegroup
![image](https://github.com/user-attachments/assets/65a3fc1d-89f3-4bb3-a078-75db91400c58)

Expand Down
22 changes: 10 additions & 12 deletions community/rfcs/24-08-20-OPEA-001-AI_Gateway_API.md
Original file line number Diff line number Diff line change
@@ -1,27 +1,25 @@
## RFC Title
# 24-08-20-OPEA-001-AI Gateway API

AI Gateway API

## RFC Content

### Author
## Author

[daixiang0](https://github.com/daixiang0), [zhixie](https://github.com/zhxie), [gyohuangxin](https://github.com/gyohuangxin), [Forrest-zhao](https://github.com/Forrest-zhao), [ruijin-intel](https://github.com/ruijin-intel)

### Status
## Status

Under Review

### Objective
## Objective

Design the API for AI Gateway.

### Motivation
## Motivation

- Introduce gateway to do mTLS, traffic control, observability and so on
- Introduce AI Gateway API to use existing gateway sloutions rather than implement our own one.

### Design Proposal
## Design Proposal

The AI gateway is at the front of all microservices:

Expand All @@ -34,7 +32,7 @@ graph TD;
A-->B(Any microservice);
```

#### API overall
### API overall

To make the most of current resources, we choose to follow [Kubernetes Gateway API](https://gateway-api.sigs.k8s.io/) since it is the gateway API standard that all gateways support.

Expand All @@ -43,7 +41,7 @@ Since AI specific features of Kubernetes Gateway API are still [under discussion
- **Kubernetes Gateway API** for features it already supports
- **Extension API for** all other features

#### API workflow
### API workflow

```mermaid
graph LR;
Expand All @@ -52,7 +50,7 @@ graph LR;

AI Gateway is not a brand-new gateway implementation, only does one thing: Convert.

#### Extension API
### Extension API

```yaml
apiVersion: extension.gateway.opea.dev/v1
Expand All @@ -74,7 +72,7 @@ spec:
- name: the name of extension feature, support multiple extensions
- config: the content of extension config, following specified gateway API

#### Extension API example
### Extension API example

```yaml

Expand Down
Loading