You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
STORY: As a Monarch handler developer, I want to receive text-only preprocessor responses to followup queries on an entire graphic, so that we can provide responses to queries made directly from the Monarch.
#918
Open
jeffbl opened this issue
Nov 21, 2024
· 0 comments
The goal of this story is to create the preprocessor to be able to serve the simplest possible followup query response. The user scenario is:
user has a graphic loaded in the IMAGE app on the Monarch
user verbalizes a question about the graphic as a whole
stt is used to create a followup query, that is sent to the server
followup preprocessor sends original graphic to LMM, along with user query
text response is returned to orchestrator, for consumption by handler
completed rendering is sent to Monarch, which uses TTS to read response
Technical plan:
create new preprocessor text-followup
create new preprocessor schema text-followup.schema.json
The text-followup preprocessor will be pretty much identical to the graphic-caption preprocessor at first, but will diverge as we add additional features like paying attention to previous queries.
QUESTION: It could be combined with graphic-caption, where it works as it does currently if there is no followup content in the request, but then pay attention to that content if it exists. Would that be better long-term?
QUESTION: Downsides to a generic text.schema.json for preprocessors? otherwise text-followup.schema.json? ANSWER FROM @JRegimbal : text-followup (modified above)
The text was updated successfully, but these errors were encountered:
Future work items will cover:
The goal of this story is to create the preprocessor to be able to serve the simplest possible followup query response. The user scenario is:
Technical plan:
text-followup
text-followup.schema.json
The
text-followup
preprocessor will be pretty much identical to thegraphic-caption
preprocessor at first, but will diverge as we add additional features like paying attention to previous queries.@JRegimbal @VenissaCarolQuadros
QUESTION: It could be combined with
graphic-caption
, where it works as it does currently if there is nofollowup
content in the request, but then pay attention to that content if it exists. Would that be better long-term?QUESTION: Downsides to a generic
text.schema.json
for preprocessors? otherwisetext-followup.schema.json
? ANSWER FROM @JRegimbal : text-followup (modified above)The text was updated successfully, but these errors were encountered: