Add RouteLLM Prompt Driver #987

collindutter · 2024-07-16T00:10:59Z

I have read and agree to the contributing guidelines for submitting new pull requests.

Describe your changes

Adds RouteLLMPromptDriver for using RouteLLM to route between a strong Prompt Driver and weak Prompt Driver.

Issue ticket number and link

NA

📚 Documentation preview 📚: https://griptape--987.org.readthedocs.build//987/

collindutter · 2024-07-16T00:12:35Z

Should we consider other routing solutions before putting this in place?

collindutter · 2024-07-16T00:15:53Z

griptape/drivers/prompt/routellm_prompt_driver.py

+        self.model = prompt_driver.model
+        self.tokenizer = prompt_driver.tokenizer


This feels a little weird, but I'm not sure what the alternative is

I agree that this kind of mutation inside of a _get_prompt_driver method is weird.

I think that model should be fixed to route-llm. Otherwise you'll get duplicate StartPromptEvent and FinishPromptEvent for the underlying model. (Like if you route to gpt-4o, then RouteLlmPromptDriver will emit StartPromptEvent(model="gpt-4o") and the underlying OpenAiChatPromptDriver will also do this.) Alternatively, you could disable the events for this prompt driver via a method override, but that feels wrong.

I think tokenizer should be left as None. Is there any reason why it can't be? I couldn't find a reason after searching for places where the field is referenced.

codecov · 2024-07-16T00:37:21Z

Codecov Report

Attention: Patch coverage is 78.37838% with 8 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
griptape/drivers/prompt/routellm_prompt_driver.py	77.77%	5 Missing and 3 partials ⚠️

📢 Thoughts on this report? Let us know!

dylanholmes

Really cool! 🍦

dylanholmes · 2024-07-16T13:31:25Z

CHANGELOG.md

@@ -9,6 +9,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 - Native function calling support to `OpenAiChatPromptDriver`, `AzureOpenAiChatPromptDriver`, `AnthropicPromptDriver`, `AmazonBedrockPromptDriver`, `GooglePromptDriver`, and `CoherePromptDriver`.
 - `OllamaEmbeddingDriver` for generating embeddings with Ollama.
 - `GriptapeCloudKnowledgeBaseVectorStoreDriver` to query Griptape Cloud Knowledge Bases.
+- `RouteLLMPromptDriver` for using RouteLLM to route between a strong Prompt Driver and weak Prompt Driver. 


In response to

Should we consider other routing solutions before putting this in place?

If you were referencing the choice of making a "composite/decorator" Prompt Driver: I really like the decorator pattern for this scenario. It seems ideal to allow users to use a familiar and existing api (Prompt Driver), especially since LLM routing would primarily be used as a mechanism for reducing costs that would be introduced after the initial business logic. Like first I'd implement my application and get it working, then I'd say wait I want to reduce my LLM costs. At that point all they need to do is swap out the Prompt Driver. Sounds like a good deal!

If you were referencing the choice of RouteLLM vs something else, you did put RouteLLM in the name, so it seems like nothing is preventing us from adding another Prompt Driver that does LLM routing differently.

dylanholmes · 2024-07-16T13:35:13Z

griptape/drivers/prompt/routellm_prompt_driver.py

+    )
+    router: str = field(kw_only=True, default="mf", metadata={"serializable": True})
+    threshold: float = field(kw_only=True, metadata={"serializable": True})
+    client: Controller = field(


I think you should name this field controller instead of client. When I read client I thought this thing was actually calling a service, but its not, its all local.

dylanholmes · 2024-07-16T13:48:23Z

griptape/drivers/prompt/routellm_prompt_driver.py

+        self.model = prompt_driver.model
+        self.tokenizer = prompt_driver.tokenizer


I agree that this kind of mutation inside of a _get_prompt_driver method is weird.

I think that model should be fixed to route-llm. Otherwise you'll get duplicate StartPromptEvent and FinishPromptEvent for the underlying model. (Like if you route to gpt-4o, then RouteLlmPromptDriver will emit StartPromptEvent(model="gpt-4o") and the underlying OpenAiChatPromptDriver will also do this.) Alternatively, you could disable the events for this prompt driver via a method override, but that feels wrong.

I think tokenizer should be left as None. Is there any reason why it can't be? I couldn't find a reason after searching for places where the field is referenced.

collindutter · 2024-07-18T16:37:42Z

Closing for now based on offline discussion.

collindutter requested review from vasinov and dylanholmes July 16, 2024 00:10

collindutter commented Jul 16, 2024

View reviewed changes

collindutter force-pushed the feature/route-prompt-driver branch from 58cd7c1 to f609a5a Compare July 16, 2024 00:17

Add RouteLLM Prompt Driver

aefd735

collindutter force-pushed the feature/route-prompt-driver branch from f609a5a to aefd735 Compare July 16, 2024 00:56

dylanholmes reviewed Jul 16, 2024

View reviewed changes

collindutter closed this Jul 18, 2024

collindutter deleted the feature/route-prompt-driver branch August 21, 2024 17:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add RouteLLM Prompt Driver #987

Add RouteLLM Prompt Driver #987

collindutter commented Jul 16, 2024 •

edited by github-actions bot

Loading

collindutter commented Jul 16, 2024

collindutter Jul 16, 2024

dylanholmes Jul 16, 2024 •

edited

Loading

codecov bot commented Jul 16, 2024 •

edited

Loading

dylanholmes left a comment

dylanholmes Jul 16, 2024

dylanholmes Jul 16, 2024

dylanholmes Jul 16, 2024 •

edited

Loading

collindutter commented Jul 18, 2024

		self.model = prompt_driver.model
		self.tokenizer = prompt_driver.tokenizer

Add RouteLLM Prompt Driver #987

Add RouteLLM Prompt Driver #987

Conversation

collindutter commented Jul 16, 2024 • edited by github-actions bot Loading

Describe your changes

Issue ticket number and link

collindutter commented Jul 16, 2024

collindutter Jul 16, 2024

Choose a reason for hiding this comment

dylanholmes Jul 16, 2024 • edited Loading

Choose a reason for hiding this comment

codecov bot commented Jul 16, 2024 • edited Loading

Codecov Report

dylanholmes left a comment

Choose a reason for hiding this comment

dylanholmes Jul 16, 2024

Choose a reason for hiding this comment

dylanholmes Jul 16, 2024

Choose a reason for hiding this comment

dylanholmes Jul 16, 2024 • edited Loading

Choose a reason for hiding this comment

collindutter commented Jul 18, 2024

collindutter commented Jul 16, 2024 •

edited by github-actions bot

Loading

dylanholmes Jul 16, 2024 •

edited

Loading

codecov bot commented Jul 16, 2024 •

edited

Loading

dylanholmes Jul 16, 2024 •

edited

Loading