[prototype] return AssetMaterialization #14931

alangenfeld · 2023-06-23T21:53:53Z

explorations for returning AssetMaterializtion directly

alangenfeld · 2023-06-23T21:54:05Z

Current dependencies on/for this PR:

master
- PR add indirect execution context access #14954
  - PR [prototype] return AssetMaterialization #14931 👈

This stack of pull requests is managed by Graphite.

alangenfeld · 2023-06-26T15:01:36Z

python_modules/dagster/dagster_tests/asset_defs_tests/test_assets.py

+            # supporting AssetMaterialization without asset_key requires handling it being in a partial state.
+            # could do AssetMaterialization.partial(...) with special partial state obj
+            asset_key=context.asset_key_for_output(),


idea: allow omitting in constructor args but avoid nullability on the object itself by grabbing the current asset_key indirectly (and erroring when can not)

edit :this has since been implemented and in the PR

sryza · 2023-06-26T22:59:25Z

Have you considered a new MaterializationMetadata type?

alangenfeld · 2023-06-27T15:15:26Z

Have you considered a new MaterializationMetadata type?

Ya thats the other option, in previous discussions no one was hot on the idea of adding a new noun. Whats your take?

sryza · 2023-06-27T18:54:16Z

My initial lean would be towards MaterializationMetadata I think. allowing AssetMaterialization to be in two different states (with asset key or without) feels weird.

It's also a breaking change, right? For anyone depending on the asset key attribute to be non-None.

And it means that anyone using type-checking needs to guard against None whenever they access AssetMaterialization.asset_key.

alangenfeld · 2023-06-27T20:58:21Z

allowing AssetMaterialization to be in two different states (with asset key or without) feels weird.

Agree, and that is not the case in the current updated state of this prototype PR. We allow omitting from the constructor in cases we can infer it and error otherwise [2].

alangenfeld · 2023-06-27T20:58:35Z

python_modules/dagster/dagster/_core/definitions/events.py

+        elif asset_key is None:
+            current_ctx = get_execution_context()
+            if current_ctx is None:
+                raise DagsterInvariantViolationError(
+                    "Could not infer asset_key, not currently in the context of an execution."
+                )
+            keys = current_ctx.selected_asset_keys
+            if len(keys) != 1:
+                raise DagsterInvariantViolationError(
+                    f"Could not infer asset_key, there are {len(keys)} in the current execution"
+                    " context. Specify the appropriate asset_key."
+                )
+            asset_key = next(iter(keys))


alangenfeld · 2023-06-27T21:00:22Z

How do you feel about [2] and with that does that change your stance on MaterializationMetadata?

sryza · 2023-06-28T22:52:44Z

How do you feel about [2] and with that does that change your stance on MaterializationMetadata?

Ah, I missed this. I'm having trouble putting my finger on any concrete negatives, but this kind of pattern gives me some generalized discomfort – I think it's difficult for users to look at the code and guess how it's working.

alangenfeld · 2023-06-29T17:10:28Z

this kind of pattern gives me some generalized discomfort – I think it's difficult for users to look at the code and guess how it's working.

Makes sense, this is definitely a "may be lesser of two evils" type of proposal.

Probably need to flesh this prototype out to include multi_asset as I think the full picture including how things work there will influence the final call.

sryza · 2023-08-16T22:41:00Z

python_modules/dagster/dagster/_core/execution/plan/execute_step.py

@@ -165,6 +171,15 @@ def _step_output_error_checked_user_event_sequence(
                    f'Emitting implicit Nothing for output "{step_output_def.name}" on {op_label}'
                )
                yield Output(output_name=step_output_def.name, value=None)
+
+            if step_context.is_sda_step and step_context.get_observed_user_asset_mat(


Not sure if this has any consequence with the way that execution currently works, but imagine a world where we could kick off downstream steps as soon as an upstream output is produced, even if the upstream step hasn't finished yet.

In that world, we would ideally yield the Output as soon the materialization is reported/yielded/returned, right?

The latest evolution of #14931 & #15392 intentionally aligned with #15928 this PR adds support for a new "Result" return type from assets that do not deal with "Outputs" to be able to communicate materialization metadata. ## How I Tested These Changes added tests.

alangenfeld force-pushed the al/06-23-_prototype_return_AssetMaterialization branch from 9f29736 to 556c934 Compare June 23, 2023 21:58

alangenfeld commented Jun 26, 2023

View reviewed changes

alangenfeld changed the base branch from master to al/06-26-add_indirect_execution_context_access June 26, 2023 17:36

alangenfeld force-pushed the al/06-23-_prototype_return_AssetMaterialization branch from 556c934 to 51870da Compare June 26, 2023 17:36

alangenfeld mentioned this pull request Jun 26, 2023

add indirect execution context access #14954

Merged

alangenfeld commented Jun 27, 2023

View reviewed changes

alangenfeld force-pushed the al/06-26-add_indirect_execution_context_access branch from 37be28c to 8b69394 Compare July 10, 2023 20:31

alangenfeld force-pushed the al/06-23-_prototype_return_AssetMaterialization branch from 51870da to da70d2a Compare July 10, 2023 20:31

sryza reviewed Aug 16, 2023

View reviewed changes

[prototype] return AssetMaterialization

928618e

alangenfeld force-pushed the al/06-26-add_indirect_execution_context_access branch from 8b69394 to d0752c8 Compare August 18, 2023 14:46

alangenfeld force-pushed the al/06-23-_prototype_return_AssetMaterialization branch from da70d2a to 928618e Compare August 18, 2023 14:46

alangenfeld mentioned this pull request Aug 18, 2023

add MaterializeResult #15932

Merged

alangenfeld closed this Sep 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[prototype] return AssetMaterialization #14931

[prototype] return AssetMaterialization #14931

alangenfeld commented Jun 23, 2023

alangenfeld commented Jun 23, 2023 •

edited

Loading

alangenfeld Jun 26, 2023 •

edited

Loading

sryza commented Jun 26, 2023

alangenfeld commented Jun 27, 2023

sryza commented Jun 27, 2023

alangenfeld commented Jun 27, 2023

alangenfeld Jun 27, 2023

alangenfeld commented Jun 27, 2023

sryza commented Jun 28, 2023

alangenfeld commented Jun 29, 2023

sryza Aug 16, 2023

[prototype] return AssetMaterialization #14931

[prototype] return AssetMaterialization #14931

Conversation

alangenfeld commented Jun 23, 2023

alangenfeld commented Jun 23, 2023 • edited Loading

alangenfeld Jun 26, 2023 • edited Loading

Choose a reason for hiding this comment

sryza commented Jun 26, 2023

alangenfeld commented Jun 27, 2023

sryza commented Jun 27, 2023

alangenfeld commented Jun 27, 2023

alangenfeld Jun 27, 2023

Choose a reason for hiding this comment

alangenfeld commented Jun 27, 2023

sryza commented Jun 28, 2023

alangenfeld commented Jun 29, 2023

sryza Aug 16, 2023

Choose a reason for hiding this comment

alangenfeld commented Jun 23, 2023 •

edited

Loading

alangenfeld Jun 26, 2023 •

edited

Loading