feat(generation-transformer): add generation transformer #2820

atierian · 2024-08-30T15:24:12Z

Description of changes

Amplify GraphQL Generation Transformer

The Amplify GraphQL Generation Transformer is a tool that enables the quick and easy creation of AI-powered Generation routes within your AWS AppSync API. This transformer can be leveraged by using the @generation directive to configure AI models and system prompts for generating content.

Directive Definition

The @generation directive is defined as follows:

directive @generation(
    aiModel: String!,
    systemPrompt: String!,
    inferenceConfiguration: GenerationInferenceConfiguration
) on FIELD_DEFINITION

Features

AI Model Integration: Specify the AI model to be used for generation.
System Prompt Configuration: Define a system prompt to guide the AI's output.
Inference Configuration: Fine-tune generation parameters like max tokens, temperature, and top-p.
Integrates with @auth Directive: Supports existing auth modes like IAM, API key, and Amazon Cognito User Pools.
Resolver Creation: Generates resolvers with tool definitions based on the Query field's return type to interact with the specified AI model.
Bedrock HTTP Data Source Creation: Creates a AppSync HTTP Data Source for Bedrock to interact with the specified AI model.

Examples

Basic Usage

Scalar Type Generation

type Query {
  generateStory(topic: String!): String @generation(
    aiModel: "anthropic.claude-3-haiku-20240307-v1:0",
    systemPrompt: "You are a creative storyteller. Generate a short story based on the given topic."
  )
}

Complex Type Generation

type Recipe {
  name: String!
  ingredients: [String!]!
  instructions: [String!]!
  prepTime: Int!
  cookTime: Int!
  servings: Int!
  difficulty: String!
}

type Query {
  generateRecipe(cuisine: String!, dietaryRestrictions: [String]): Recipe
  @generation(
    aiModel: "anthropic.claude-3-haiku-20240307-v1:0",
    systemPrompt: "You are a professional chef specializing in creating recipes. Generate a detailed recipe based on the given cuisine and dietary restrictions."
  )
}

Advanced Configuration

type Query {
  generateCode(description: String!): String @generation(
    aiModel: "anthropic.claude-3-haiku-20240307-v1:0",
    systemPrompt: "You are an expert programmer. Generate code based on the given description.",
    inferenceConfiguration: {
      maxTokens: 500,
      temperature: 0.7,
      topP: 0.9
    }
  )
}

Limitations

The @generation directive can only be used on Query fields.
The AI model specified must:
- be supported by Amazon Bedrock's /converse API
- support tool usage
Only the following GraphQL / AppSync scalar types are supported as required properties
- Boolean
- Int
- Float
- String
- ID
- AWSJSON

CDK / CloudFormation Parameters Changed

N/A

Issue #, if available

N/A

Description of how you validated changes

Snapshot tests
E2E tests
Manual testing

Checklist

PR description included
yarn test passes
Tests are changed or added
~~Relevant documentation is changed or added (and PR referenced)~~ Docs PRs are WIP
New AWS SDK calls or CloudFormation actions have been added to relevant test and service IAM policies
Any CDK or CloudFormation parameter changes are called out explicitly

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

atierian · 2024-08-30T15:26:14Z

packages/amplify-graphql-api-construct-tests/src/__tests__/generations/generation.test.ts

+      };
+      const solveEquationResult = await doAppSyncGraphqlQuery({ ...args, query: solveEquation, variables });
+      const solution = solveEquationResult.body.data.solveEquation;
+      expect(solution).toBeDefined();


We're asserting that we get an answer here, not the correct answer because LLMs are not good (being generous) at math.

atierian · 2024-08-30T15:26:26Z

packages/amplify-graphql-api-construct-tests/src/__tests__/generations/generation.test.ts

+      // TODO: This currently doesn't work because LLMs are not great at following regex pattern requirements, they'll sometimes return "<UNKNOWN>"
+      // which fails GraphQL type validation for implicitly generated required model values like id, createdAt, updatedAt.
+      xtest('should generate a model', async () => {


The test is currently disabled because we throw in the transformer when the return type contains required fields typed as certain AppSync scalars.

We're exploring options, including prompt improvements, better regex pattern in JSON Schema tool definitions, and special case handling for models (omitting createdAt, updatedAt, and id in tool definition, and populating them in the resolver).

For now, this is an accepted current limitation.

atierian · 2024-08-30T15:26:48Z

packages/amplify-graphql-generation-transformer/src/grapqhl-generation-transformer.ts

+      resolverResourceId,
+      invokeBedrockFunction.req,
+      invokeBedrockFunction.res,
+      ['auth'],


This allows existing VTL @auth generated resolver functions to be inserted into the pipeline resolver.

atierian · 2024-08-30T15:27:11Z

packages/amplify-graphql-generation-transformer/src/grapqhl-generation-transformer.ts

+      ['auth'],
+      [],
+      dataSource as any,
+      { name: 'APPSYNC_JS', runtimeVersion: '1.0.0' },


Probably should eventually be a constant defined in amplify-graphql-transformer-core, but it's only being used here for now. Will add one in a follow up if / when it becomes necessary.

atierian · 2024-08-30T15:27:32Z

packages/amplify-graphql-generation-transformer/src/resolvers/invoke-bedrock.ts

+ * @returns {MappingTemplateProvider} A MappingTemplateProvider for the response function.
+ */
+const createInvokeBedrockResponseFunction = (): MappingTemplateProvider => {
+  // TODO: add stopReason: max_tokens error handling


Planned followup to improve the error message.

packages/amplify-graphql-transformer-interfaces/src/transform-host-provider.ts

atierian · 2024-08-30T15:31:00Z

----- MARK REVIEW -----

...lify-graphql-api-construct-tests/src/__tests__/generations/graphql/schema-generation.graphql

packages/amplify-graphql-generation-transformer/package.json

...-graphql-generation-transformer/src/__tests__/amplify-graphql-generation-transformer.test.ts

packages/amplify-graphql-api-construct-tests/src/__tests__/generations/generation.test.ts

...-graphql-generation-transformer/src/__tests__/amplify-graphql-generation-transformer.test.ts

packages/amplify-graphql-generation-transformer/src/grapqhl-generation-transformer.ts

packages/amplify-graphql-generation-transformer/src/utils/tools.ts

packages/amplify-graphql-generation-transformer/src/validation.ts

packages/amplify-graphql-transformer-interfaces/src/transform-host-provider.ts

...s/amplify-graphql-generation-transformer/src/utils/graphql-scalar-json-schema-definitions.ts

phani-srikar · 2024-08-30T21:18:05Z

packages/amplify-graphql-generation-transformer/src/grapqhl-generation-transformer.ts

+
+export type GenerationDirectiveConfiguration = {
+  parent: ObjectTypeDefinitionNode;
+  directive: DirectiveNode;


why do we need to store the directive definition when the actual directive arguments are already listed out separately below?

Good point, we don't! I'll remove this in a follow up. Thanks!

phani-srikar · 2024-08-30T21:32:07Z

packages/amplify-graphql-generation-transformer/src/grapqhl-generation-transformer.ts

+      };
+
+      const stackName = `Generation${this.capitalizeFirstLetter(fieldName)}BedrockDataSourceStack`;
+      const stack = this.createStack(ctx, stackName);


I think Tim has a good point here and might be worth discussing the trade offs with team before we release. I couldn't find any AppSync imposed limit on number of datasources, but feel like re-use will save us from trouble in the future.

phani-srikar · 2024-08-30T21:40:11Z

packages/amplify-graphql-generation-transformer/src/utils/tools.ts

+  toolSpec: ToolSpec;
+};
+
+export type Tools = {


Do you use this type elsewhere?

We don't, and your comment made me realize that we only actually need to export ToolConfig. The extra exports and Tools type definition are relics from previous structure. Will remove in a follow up. Thanks for the callout!

phani-srikar · 2024-08-30T21:46:58Z

...-transformer/src/__tests__/__snapshots__/amplify-graphql-generation-transformer.test.ts.snap

+exports[`generation route all scalar types 2`] = `
+"export function request(ctx) {
+  const toolConfig = {\\"tools\\":[{\\"toolSpec\\":{\\"name\\":\\"responseType\\",\\"description\\":\\"Generate a response type for the given field\\",\\"inputSchema\\":{\\"json\\":{\\"type\\":\\"object\\",\\"properties\\":{\\"value\\":{\\"type\\":\\"object\\",\\"properties\\":{\\"int\\":{\\"type\\":\\"number\\",\\"description\\":\\"A signed 32-bit integer value.\\"},\\"float\\":{\\"type\\":\\"number\\",\\"description\\":\\"An IEEE 754 floating point value.\\"},\\"string\\":{\\"type\\":\\"string\\",\\"description\\":\\"A UTF-8 character sequence.\\"},\\"id\\":{\\"type\\":\\"string\\",\\"description\\":\\"A unique identifier for an object. This scalar is serialized like a String but isn't meant to be human-readable.\\"},\\"boolean\\":{\\"type\\":\\"boolean\\",\\"description\\":\\"A boolean value.\\"},\\"awsjson\\":{\\"type\\":\\"string\\",\\"description\\":\\"A JSON string. Any valid JSON construct is automatically parsed and loaded in the resolver code as maps, lists, or scalar values rather than as the literal input strings. Unquoted strings or otherwise invalid JSON result in a GraphQL validation error.\\"},\\"awsemail\\":{\\"type\\":\\"string\\",\\"description\\":\\"An email address in the format local-part@domain-part as defined by RFC 822.\\",\\"pattern\\":\\"^[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\\\\\\\\.[a-zA-Z]{2,}$\\"},\\"awsdate\\":{\\"type\\":\\"string\\",\\"description\\":\\"An extended ISO 8601 date string in the format YYYY-MM-DD.\\",\\"pattern\\":\\"^\\\\\\\\d{4}-d{2}-d{2}$\\"},\\"awstime\\":{\\"type\\":\\"string\\",\\"description\\":\\"An extended ISO 8601 time string in the format hh:mm:ss.sss.\\",\\"pattern\\":\\"^\\\\\\\\d{2}:\\\\\\\\d{2}:\\\\\\\\d{2}\\\\\\\\.\\\\\\\\d{3}$\\"},\\"awsdatetime\\":{\\"type\\":\\"string\\",\\"description\\":\\"An extended ISO 8601 date and time string in the format YYYY-MM-DDThh:mm:ss.sssZ.\\",\\"pattern\\":\\"^\\\\\\\\d{4}-\\\\\\\\d{2}-\\\\\\\\d{2}T\\\\\\\\d{2}:\\\\\\\\d{2}:\\\\\\\\d{2}\\\\\\\\.\\\\\\\\d{3}Z$\\"},\\"awstimestamp\\":{\\"type\\":\\"string\\",\\"description\\":\\"An integer value representing the number of seconds before or after 1970-01-01-T00:00Z.\\",\\"pattern\\":\\"^\\\\\\\\d+$\\"},\\"awsphone\\":{\\"type\\":\\"string\\",\\"description\\":\\"A phone number. This value is stored as a string. Phone numbers can contain either spaces or hyphens to separate digit groups. Phone numbers without a country code are assumed to be US/North American numbers adhering to the North American Numbering Plan (NANP).\\",\\"pattern\\":\\"^\\\\\\\\d{3}-d{3}-d{4}$\\"},\\"awsurl\\":{\\"type\\":\\"string\\",\\"description\\":\\"A URL as defined by RFC 1738. For example, https://www.amazon.com/dp/B000NZW3KC/ or mailto:[email protected]. URLs must contain a schema (http, mailto) and can't contain two forward slashes (//) in the path part.\\",\\"pattern\\":\\"^(https?|mailto)://[^s/$.?#].[^s]*$\\"},\\"awsipaddress\\":{\\"type\\":\\"string\\",\\"description\\":\\"A valid IPv4 or IPv6 address. IPv4 addresses are expected in quad-dotted notation (123.12.34.56). IPv6 addresses are expected in non-bracketed, colon-separated format (1a2b:3c4b::1234:4567). You can include an optional CIDR suffix (123.45.67.89/16) to indicate subnet mask.\\"}},\\"required\\":[]}},\\"required\\":[\\"value\\"]}}}}],\\"toolChoice\\":{\\"tool\\":{\\"name\\":\\"responseType\\"}}};
+  const prompt = \\"\\";


We give so much information and it still returns a response with wrong format? :)

Yea 😞
We haven't found a generic solution yet, but we've seen some promising results with various techniques, and are cautiously optimistic that we can get there.

atierian added 20 commits August 30, 2024 10:26

add graphqlUrl field to GraphQLAPIProvider

a3c265e

add generation-transformer

65bf4b8

add generation transformer to transformer chain

df59807

add generation e2e tests

26165a9

add generation transformer as dep for api and data construct

b73ae3e

update data and api construct jsii

4255f75

some cleanup and inline docs

666dd56

update readme

8d1524a

update number of transformers for test assertion

b540c60

update api extract for generation transformer

3e3b867

update api extract in transformer-interfaces

d4e5ab5

lint readme

a8281de

cleanup, inline docs, cascading types

c8052fd

split out req / res resolverfn and add some more inline docs

296c497

update inline comment for disabled model generation e2e test

ed314e9

alphabetize package dep order

9b7acb7

fix typo in test comment

7839a2c

remove unnecessary noise from snapshots

75a0972

remove graphqlurl change that snuck in again

118f4d2

move query type validation to validate function

8ab7bec

atierian requested review from a team as code owners August 30, 2024 15:24

atierian mentioned this pull request Aug 30, 2024

feat(generation-transformer): add generation transformer #2813

Closed

atierian commented Aug 30, 2024

View reviewed changes

packages/amplify-graphql-transformer-interfaces/src/transform-host-provider.ts Show resolved Hide resolved

atierian added 4 commits August 30, 2024 11:42

update generation API.md

d85ea0c

update split-e2e to force us-west-2 for generation e2es

27c2f37

use parent account for generation e2e

51fe60a

split e2e

61ff76a

palpatim reviewed Aug 30, 2024

View reviewed changes

atierian added 10 commits August 30, 2024 14:43

fix dep versions in generation transformer

54db06d

fix typo in e2e schema definition

a3775a7

fix threshold numbers and add a few more test cases

97018fa

toUpper

0704d88

add comment clarifying disallowed fields

a941491

stringify system prompt

48ff9f0

remove ueber complex ip address pattern prop

0aaacc5

magic-stringectomy

4de6c72

add over / under validation tests

7919c94

add doc comment clarifying intent of scalar types for json schema

3c4fc18

palpatim reviewed Aug 30, 2024

View reviewed changes

...s/amplify-graphql-generation-transformer/src/utils/graphql-scalar-json-schema-definitions.ts Show resolved Hide resolved

palpatim previously approved these changes Aug 30, 2024

View reviewed changes

atierian added 2 commits August 30, 2024 17:34

lint

c0c662b

fix api extract

174470b

phani-srikar previously approved these changes Aug 30, 2024

View reviewed changes

atierian dismissed stale reviews from phani-srikar and palpatim via 174470b August 30, 2024 21:47

phani-srikar approved these changes Aug 30, 2024

View reviewed changes

atierian enabled auto-merge (squash) August 30, 2024 21:58

palpatim approved these changes Aug 30, 2024

View reviewed changes

atierian merged commit a86db4e into feature/raven Aug 30, 2024
5 of 6 checks passed

atierian deleted the ai-generation branch August 30, 2024 22:18

This was referenced Sep 3, 2024

chore: move json schema generation for tool use into transformer-core #2824

Merged

feat(ai): add conversation and generation transformers #2830

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(generation-transformer): add generation transformer #2820

feat(generation-transformer): add generation transformer #2820

atierian commented Aug 30, 2024

atierian Aug 30, 2024

atierian Aug 30, 2024

atierian Aug 30, 2024 •

edited

Loading

atierian Aug 30, 2024

atierian Aug 30, 2024

atierian commented Aug 30, 2024

phani-srikar Aug 30, 2024

atierian Aug 30, 2024

phani-srikar Aug 30, 2024

phani-srikar Aug 30, 2024

atierian Aug 30, 2024

phani-srikar Aug 30, 2024

atierian Aug 30, 2024

feat(generation-transformer): add generation transformer #2820

feat(generation-transformer): add generation transformer #2820

Conversation

atierian commented Aug 30, 2024

Description of changes

Amplify GraphQL Generation Transformer

Directive Definition

Features

Examples

Basic Usage

Scalar Type Generation

Complex Type Generation

Advanced Configuration

Limitations

CDK / CloudFormation Parameters Changed

Issue #, if available

Description of how you validated changes

Checklist

Choose a reason for hiding this comment

Choose a reason for hiding this comment

atierian Aug 30, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

atierian commented Aug 30, 2024

----- MARK REVIEW -----

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

atierian Aug 30, 2024 •

edited

Loading