Replace Image Query Drivers with Prompt Drivers #1340

collindutter · 2024-11-12T22:02:59Z

I have read and agree to the contributing guidelines for submitting new pull requests.

Describe your changes

Image Query Drivers were added early on before Prompt Drivers supported image inputs. Now they provide no value other than some syntactic niceties. This PR removes them and improves some syntax with Prompt Drivers.

Note that Image Query Tool has been kept since we don't have a Prompt Tool (though maybe we should...separate discussion).

Added

PromptStack.from_artifact factory method for creating a Prompt Stack with a user message from an Artifact.

Changed

BREAKING: Removed all ImageQueryDrivers, use PromptDrivers instead.
BREAKING: Removed ImageQueryTask, use PromptTask instead.
BREAKING: Updated ImageQueryTool.image_query_driver to ImageQueryTool.prompt_driver.
BasePromptDriver.run can now accept an Artifact in addition to a Prompt Stack.

Issue ticket number and link

NA

📚 Documentation preview 📚: https://griptape--1340.org.readthedocs.build//1340/

codecov · 2024-11-12T22:06:07Z

Codecov Report

All modified and coverable lines are covered by tests ✅

📢 Thoughts on this report? Let us know!

emjay07

I think it'd be good to copy the examples from the migration.md to the PromptDriver or PromptTask docs or a recipe.

emjay07 · 2024-11-13T17:36:29Z

docs/griptape-tools/official-tools/src/image_query_tool_1.py

 from griptape.structures import Agent
 from griptape.tools import ImageQueryTool

-# Create an Image Query Driver.
-driver = OpenAiImageQueryDriver(model="gpt-4o")
+driver = OpenAiChatPromptDriver(model="gpt-4o")

 # Create an Image Query Tool configured to use the engine.
 tool = ImageQueryTool(


I'm curious about the use case when an ImageQueryTool is still useful? if the prompt is "describe this image", the model should be able to do that now without using a tool.

There is still value in having a Tool that can query images from the file system/task memory. Long term this functionality should maybe be baked into the File Manager Tool but that would require refactors outside the scope of this PR.

got it. in that case, may be good to add ImageQueryTool to this list.

emjay07 · 2024-11-13T17:41:55Z

README.md

@@ -40,7 +40,6 @@ Drivers facilitate interactions with external resources and services:
 - 🔢 **Embedding Drivers** generate vector embeddings from textual inputs.
 - 💾 **Vector Store Drivers** manage the storage and retrieval of embeddings.
 - 🎨 **Image Generation Drivers** create images from text descriptions.
- 🔎 **Image Query Drivers** query images from text queries.


Up to you on this, but may be good to add a line in the prompt drivers that they can now handle multi-modal queries or something so it doesn't seem like we don't support it at all.

Maybe this list could focus on functionality (like verbs), then mention which driver gives it to you 🤷 .

dylanholmes

lgtm

dylanholmes · 2024-11-13T21:39:09Z

README.md

@@ -40,7 +40,6 @@ Drivers facilitate interactions with external resources and services:
 - 🔢 **Embedding Drivers** generate vector embeddings from textual inputs.
 - 💾 **Vector Store Drivers** manage the storage and retrieval of embeddings.
 - 🎨 **Image Generation Drivers** create images from text descriptions.
- 🔎 **Image Query Drivers** query images from text queries.


Maybe this list could focus on functionality (like verbs), then mention which driver gives it to you 🤷 .

collindutter force-pushed the refactor/image-query branch 2 times, most recently from 39eba58 to ab48bb6 Compare November 12, 2024 23:17

collindutter requested review from dylanholmes, vachillo and emjay07 November 12, 2024 23:20

collindutter marked this pull request as ready for review November 12, 2024 23:20

collindutter force-pushed the refactor/image-query branch from ab48bb6 to 030aef0 Compare November 13, 2024 00:55

emjay07 reviewed Nov 13, 2024

View reviewed changes

collindutter added 3 commits November 13, 2024 10:24

Replace Image Query Drivers with Prompt Drivers

00df011

Remove ImageQueryTask

8a113d3

Update readme

8925ac9

collindutter force-pushed the refactor/image-query branch 2 times, most recently from c8cc248 to 2767e54 Compare November 13, 2024 18:31

collindutter requested a review from emjay07 November 13, 2024 18:32

Add image example

d1741e4

collindutter force-pushed the refactor/image-query branch from 2767e54 to d1741e4 Compare November 13, 2024 19:40

dylanholmes approved these changes Nov 13, 2024

View reviewed changes

collindutter merged commit ba3a140 into dev Nov 13, 2024
15 checks passed

collindutter deleted the refactor/image-query branch November 13, 2024 21:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace Image Query Drivers with Prompt Drivers #1340

Replace Image Query Drivers with Prompt Drivers #1340

collindutter commented Nov 12, 2024 •

edited

Loading

codecov bot commented Nov 12, 2024 •

edited

Loading

emjay07 left a comment

emjay07 Nov 13, 2024

collindutter Nov 13, 2024

emjay07 Nov 13, 2024

emjay07 Nov 13, 2024

dylanholmes Nov 13, 2024

dylanholmes left a comment

dylanholmes Nov 13, 2024

Replace Image Query Drivers with Prompt Drivers #1340

Replace Image Query Drivers with Prompt Drivers #1340

Conversation

collindutter commented Nov 12, 2024 • edited Loading

Describe your changes

Added

Changed

Issue ticket number and link

codecov bot commented Nov 12, 2024 • edited Loading

Codecov Report

emjay07 left a comment

Choose a reason for hiding this comment

emjay07 Nov 13, 2024

Choose a reason for hiding this comment

collindutter Nov 13, 2024

Choose a reason for hiding this comment

emjay07 Nov 13, 2024

Choose a reason for hiding this comment

emjay07 Nov 13, 2024

Choose a reason for hiding this comment

dylanholmes Nov 13, 2024

Choose a reason for hiding this comment

dylanholmes left a comment

Choose a reason for hiding this comment

dylanholmes Nov 13, 2024

Choose a reason for hiding this comment

collindutter commented Nov 12, 2024 •

edited

Loading

codecov bot commented Nov 12, 2024 •

edited

Loading