OmitType and TypeName to support sharing types #2409

t0yv0 · 2024-09-10T18:52:01Z

Introduce OmitType and TypeName flags to enforce type sharing.

Some of the upstream providers generate very large concrete schemata. TF is not being materially affected, just high RAM demands for in-memory processing. The example is inspired by QuickSight types in AWS. Pulumi is affected significantly. In Pulumi the default projection is going to generate named types for every instance of the shared schema. This leads to SDK bloat and issues with "filename too long."

With this change it is possible for the provider maintainer opt into explicit sharing of types, and ensure that the type names for the shared types have shorter meaningful prefixes.

At definition type the user can specify the type name to generate, which can be very short, and replace the automatically implied ReallyLongPrefixedTypeName like this:

"visuals": {
	Elem: &info.Schema{
		TypeName: tfbridge.Ref("Visual"),
	},
},

At reference time in another resource, the user can reuse an already generated type by token. This already worked before this change but had the downside of still generating unused helper types and causing SDK bloat.

"visuals": {
	Elem: &info.Schema{
		Type:     "testprov:index/Visual:Visual",
	},
},

With this change it is possible to instruct the bridge to stop generating the unused helper types:

"visuals": {
	Elem: &info.Schema{
		Type:     "testprov:index/Visual:Visual",
		OmitType: true
	},
},

codecov · 2024-09-10T19:00:13Z

Codecov Report

Attention: Patch coverage is 64.51613% with 11 lines in your changes missing coverage. Please review.

Project coverage is 57.66%. Comparing base (fbc2abd) to head (1370dde).
Report is 1 commits behind head on master.

Files with missing lines	Patch %	Lines
pkg/tfgen/generate.go	50.00%	9 Missing and 1 partial ⚠️
pkg/tfgen/generate_schema.go	88.88%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #2409   +/-   ##
=======================================
  Coverage   57.66%   57.66%           
=======================================
  Files         369      369           
  Lines       50121    50148   +27     
=======================================
+ Hits        28902    28920   +18     
- Misses      19641    19650    +9     
  Partials     1578     1578

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

t0yv0 · 2024-09-10T19:59:53Z

This does not check that the underlying type is structurally equal to the new replacement type. We could add this check.

iwahbe

This is a very surgical change, which is great. I would mark the new fields on info.Schema as experimental in their respective doc comments, just to hedge against future iteration on the design.

pkg/tfbridge/info.go

iwahbe · 2024-09-11T08:07:28Z

pkg/tfgen/generate.go

+	switch {
+	case t.typePrefixOverride != nil && other.typePrefixOverride == nil:
+		return false
+	case t.typePrefixOverride == nil && other.typePrefixOverride != nil:
+		return false
+	case t.typePrefixOverride != nil && other.typePrefixOverride != nil &&
+		*t.typePrefixOverride != *other.typePrefixOverride:
+		return false
+	}


Nit:

I think this is clearer as an if statement. It looks odd to me to have 3 branches that all lead to the same body.

if (t.typePrefixOverride != nil && other.typePrefixOverride == nil) || (t.typePrefixOverride == nil && other.typePrefixOverride != nil) || (t.typePrefixOverride != nil && other.typePrefixOverride != nil && *t.typePrefixOverride != *other.typePrefixOverride) { return false }

That said, this is just an inequality check made painful by nil values. Nitpicking, I would express this as the inverse of an equality check:

Suggested change

switch {

case t.typePrefixOverride != nil && other.typePrefixOverride == nil:

return false

case t.typePrefixOverride == nil && other.typePrefixOverride != nil:

return false

case t.typePrefixOverride != nil && other.typePrefixOverride != nil &&

*t.typePrefixOverride != *other.typePrefixOverride:

return false

}

if eq := t.typePrefixOverride == other.typePrefixOverride ||

(t.typePrefixOverride != nil && other.typePrefixOverride != nil &&

*t.typePrefixOverride == *other.typePrefixOverride); !eq {

return false

}

This is an F# habit that may be out of place but I find individual cases more tractable than a large bool expr.

pkg/tfbridge/info/info.go

pkg/tfgen/generate.go

pkg/tfgen/generate_schema_test.go

VenelinMartinov

Very nice! You mentioned tools to help provider maintainers spot these repeated types in the schemas. Is the plan to add these to upstream upgrades or tfgen or something else?

t0yv0 · 2024-09-11T13:10:41Z

I've spent a lot of time last night getting Quicksight sharing rolled out to AWS against this functionality and found it can be made to work but it is quite awkward.

pulumi/pulumi-aws#4449

This PR reduces Quicksight from 30,000 to 7,000 types - there is still some work to specify sharing.

Couple of learnings there:

the user might want to recent the name of the auto-generated type and not just the prefix, since it is otherwise hard to predict the sharing token.
perhaps the better interface would be:

{ ShareTypeAs: "<token>" }

In AWS I built a stateful facade that emulates this by emitting {Prefix:""} on the first invocation and {Type: "<token>", OmitType: true} subsequently. It has some issues guessing the right token.

I've been relying heavily on a shared type detector from a different PR that operates on TypeSpec equality. It sounds like we could really use shim.Schema equality instead.
Once the provider builder confirms that certain types are shared, it can be extremely daunting to specify chasing down all the copies in complicated cases like Quicksight.

Perhaps we can automate the following flow.

go generate to compute a type sharing specification and check in as a yaml file. This specification specifies where the shared types are found and what are their suggested tokens.
Provider maintainer audits this to rename tokens as needed and "accept" that the discovered sharing is desirable, e.g. confirm that the identical types were generated from a shared func() invocation upstrema.
At resources.go time the sharing specification is loaded from disk and executed against ProviderInfo to find all copies of shared root types and specified in the corresponding info.Schema ; then tfgen proceeds to generate a schema with sharing

A question is what to do with dynamically bridged providers. It might be desirable there, since they are subject to type explosion as well. I think once we are certain of this functionality, we should default to detecting sharing in dynamically bridged providers, picking arbitrary names for the shared types.

t0yv0 · 2024-09-11T19:35:23Z

Tweaked the API to make a lot more usable for QuickSight use case. PTAL.

iwahbe

LGTM

pkg/tfbridge/info/info.go

Introduce OmitType and TypeName flags to enforce type sharing. Some of the upstream providers generate very large concrete schemata. TF is not being materially affected, just high RAM demands for in-memory processing. The example is inspired by QuickSight types in AWS. Pulumi is affected significantly. In Pulumi the default projection is going to generate named types for every instance of the shared schema. This leads to SDK bloat and issues with "filename too long." With this change it is possible for the provider maintainer opt into explicit sharing of types, and ensure that the type names for the shared types have shorter meaningful prefixes. At definition type the user can specify the type name to generate, which can be very short, and replace the automatically implied ReallyLongPrefixedTypeName like this: ```go "visuals": { Elem: &info.Schema{ TypeName: tfbridge.Ref("Visual"), }, }, ``` At reference time in another resource, the user can reuse an already generated type by token. This already worked before this change but had the downside of still generating unused helper types and causing SDK bloat. ```go "visuals": { Elem: &info.Schema{ Type: "testprov:index/Visual:Visual", }, }, ``` With this change it is possible to instruct the bridge to stop generating the unused helper types: ```go "visuals": { Elem: &info.Schema{ Type: "testprov:index/Visual:Visual", OmitType: true }, }, ```

pulumi-bot · 2024-09-20T09:50:16Z

This PR has been shipped in release v3.91.0.

t0yv0 requested review from flostadler, corymhall, iwahbe and VenelinMartinov September 10, 2024 18:52

iwahbe reviewed Sep 11, 2024

View reviewed changes

VenelinMartinov approved these changes Sep 11, 2024

View reviewed changes

t0yv0 added 5 commits September 11, 2024 14:17

Implement type sharing support

3dd9c26

Fix chattiness of the test

1a24b64

Lint

87c08e2

Skip new test on Windows

0730846

Control the exact type name and not prefix

11d4115

t0yv0 force-pushed the t0yv0/type-sharing branch from 3920161 to 11d4115 Compare September 11, 2024 18:17

t0yv0 added 5 commits September 11, 2024 14:20

Switch to Ref

7231894

Fix the test

e238d07

PR feedback

68dffbe

PR Feedback 2

ac1be90

Fix typo

51091c3

t0yv0 changed the title ~~OmitType and TypePrefixOverride to support sharing types~~ OmitType and TypeName to support sharing types Sep 11, 2024

t0yv0 added 2 commits September 11, 2024 14:35

Fix typo

d76371d

Experimental markers in the comments

0a27d59

t0yv0 requested review from iwahbe and VenelinMartinov September 11, 2024 19:35

iwahbe reviewed Sep 12, 2024

View reviewed changes

pkg/tfbridge/info/info.go Outdated Show resolved Hide resolved

Add example use

1370dde

t0yv0 requested a review from iwahbe September 12, 2024 15:00

iwahbe approved these changes Sep 12, 2024

View reviewed changes

t0yv0 merged commit f0fee4f into master Sep 12, 2024
11 checks passed

t0yv0 deleted the t0yv0/type-sharing branch September 12, 2024 15:25

This was referenced Sep 13, 2024

Ensure shorter tokens for object types #1118

Open

Support re-rolling unrolled recursive TF types #1468

Open

mjeffryes assigned t0yv0 Sep 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OmitType and TypeName to support sharing types #2409

OmitType and TypeName to support sharing types #2409

t0yv0 commented Sep 10, 2024 •

edited

Loading

codecov bot commented Sep 10, 2024 •

edited

Loading

t0yv0 commented Sep 10, 2024

iwahbe left a comment

iwahbe Sep 11, 2024

t0yv0 Sep 11, 2024

VenelinMartinov left a comment

t0yv0 commented Sep 11, 2024 •

edited

Loading

t0yv0 commented Sep 11, 2024

iwahbe left a comment

pulumi-bot commented Sep 20, 2024

OmitType and TypeName to support sharing types #2409

OmitType and TypeName to support sharing types #2409

Conversation

t0yv0 commented Sep 10, 2024 • edited Loading

codecov bot commented Sep 10, 2024 • edited Loading

Codecov Report

t0yv0 commented Sep 10, 2024

iwahbe left a comment

Choose a reason for hiding this comment

iwahbe Sep 11, 2024

Choose a reason for hiding this comment

t0yv0 Sep 11, 2024

Choose a reason for hiding this comment

VenelinMartinov left a comment

Choose a reason for hiding this comment

t0yv0 commented Sep 11, 2024 • edited Loading

t0yv0 commented Sep 11, 2024

iwahbe left a comment

Choose a reason for hiding this comment

pulumi-bot commented Sep 20, 2024

t0yv0 commented Sep 10, 2024 •

edited

Loading

codecov bot commented Sep 10, 2024 •

edited

Loading

t0yv0 commented Sep 11, 2024 •

edited

Loading