Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement Nexus Serializer for Temporal Payloads #5126

Merged
merged 5 commits into from
Nov 17, 2023

Conversation

bergundy
Copy link
Member

What changed?

See description

Why?

The server should do best effort conversion between Nexus Content and Temporal Payloads so pure Temporal applications don't have to deal with the separate Nexus Content concept.

How did you test it?

Added tests.

Potential risks

Non-standard content is converter into a Payload with metadata encoding of unknown/nexus-content. Users will need to prepare for that or their Nexus requests may fail.
I believe this is the best option we have to avoid lossy conversion and expose as much information and empower the application.
I considered using binary/plain for non-standard content but I'd rather not have these payloads accidentally slip into user's applications.

Is hotfix candidate?

Far from it.

@bergundy bergundy requested a review from a team as a code owner November 16, 2023 06:57
common/nexus/payload_serializer.go Show resolved Hide resolved
common/nexus/payload_serializer.go Outdated Show resolved Hide resolved
common/nexus/payload_serializer.go Outdated Show resolved Hide resolved
if !isStandardPayload(payload) {
data, err := payload.Marshal()
if err != nil {
return nil, fmt.Errorf("%w: payload marshal error: %w", errSerializer, err)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You may as well just have errSerializer be a string prefix; it doesn't look like we gain anything by wrapping it into the error tree

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Mostly did this to satisfy our linter.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would avoid a double %w, there's only one true cause. In fact, would avoid errSerializer altogether, it has no value from what I see (but I understand server has a bit of a history with hard to read code by precreating errors often unnecessarily regardless of some opinions about performance or attempting to predict future uses)

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Again, I could make a linter exception here. I was just "following the rules".

common/nexus/payload_serializer.go Outdated Show resolved Hide resolved
common/nexus/payload_serializer_test.go Outdated Show resolved Hide resolved
common/nexus/payload_serializer_test.go Outdated Show resolved Hide resolved
common/nexus/payload_serializer_test.go Show resolved Hide resolved
common/nexus/payload_serializer_test.go Show resolved Hide resolved
commonpb "go.temporal.io/api/common/v1"
)

type nexusPayloadSerializer struct{}
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm going to rename this to payloadSerializer and avoid the stuttering.

common/nexus/payload_serializer.go Show resolved Hide resolved
commonpb "go.temporal.io/api/common/v1"
)

type payloadSerializer struct{}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is the purpose of this type? Not that it's wrong, but you check that it implements an interface and everything but nothing uses it and it's not exported

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I still don't know if I want to export it but I'm going to use it in a bit when I implement the Nexus Handler in this code base.

Copy link
Member

@cretz cretz Nov 17, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would be willing to wait until then to review since how it is used affects how it is written (or at least set a reminder to go review this already-merged piece if it does get merged before the usage of it is done)

if params["format"] == "protobuf" {
payload.Metadata["encoding"] = []byte("json/protobuf")
messageType := params["message-type"]
if messageType != "" {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think we should allow proto without message type. It's not very useful over normal JSON and message-type-less protos in Temporal are only supported on older SDKs. Arguably you don't need format and message-type as separate params, just a single protobuf-message-type param is good enough to be seen as proto JSON.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm fine considering protos without message-type non-standard.
I'll also consider merging into a single param.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, more and more the single param sounds good.

case "application/x-protobuf":
payload.Metadata["encoding"] = []byte("binary/protobuf")
messageType := params["message-type"]
if messageType != "" {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same here, we should not allow message-type-less proto binary

common/nexus/payload_serializer.go Show resolved Hide resolved
}
contentType := h.Get("type")
if contentType == "" {
return len(h) == 0 && len(content.Data) == 0
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So it was confusing to follow this logic down to here, but I think an absent content type with data should be a binary/plain.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I considered that but it would make this conversion asymmetric. Because we can't interpret the intent I think it may be better that this becomes encoding: unknown/nexus-content so then if it's turned into a content object again it doesn't get an application/octet-stream content type.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

An absent content type + data is always asymmetric because it's more loosely defined inbound but we want to explicitly set content type outbound when data is present. So discussion of absent content-type + data only applies in one direction.

if !isStandardPayload(payload) {
data, err := payload.Marshal()
if err != nil {
return nil, fmt.Errorf("%w: payload marshal error: %w", errSerializer, err)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would avoid a double %w, there's only one true cause. In fact, would avoid errSerializer altogether, it has no value from what I see (but I understand server has a bit of a history with hard to read code by precreating errors often unnecessarily regardless of some opinions about performance or attempting to predict future uses)

}
case "json/protobuf":
content.Header["type"] = fmt.Sprintf("application/json; format=protobuf")
if messageType != "" {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think any SDKs leave this off anymore and haven't for a while, I think we can require usage of a somewhat modern SDK version for use with Nexus protos

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm fine with that.

return &nexus.Content{Header: nexus.Header{}}, nil
}

if !isStandardPayload(payload) {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is confusing too, can you just make this stuff part of the switch below? You can set data in each arm of the switch, there's not that much code reuse here. Just handle each type specifically. I can suggest exact rewrite if it helps.

@cretz
Copy link
Member

cretz commented Nov 17, 2023

(I don't usually do reviews in this repository but I was asked in this case)

Copy link
Member Author

@bergundy bergundy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the reviews.

commonpb "go.temporal.io/api/common/v1"
)

type payloadSerializer struct{}
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I still don't know if I want to export it but I'm going to use it in a bit when I implement the Nexus Handler in this code base.

common/nexus/payload_serializer.go Show resolved Hide resolved
if params["format"] == "protobuf" {
payload.Metadata["encoding"] = []byte("json/protobuf")
messageType := params["message-type"]
if messageType != "" {
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm fine considering protos without message-type non-standard.
I'll also consider merging into a single param.

}

func isStandardNexusContent(content *nexus.Content) bool {
h := content.Header
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This was supposed to clone the map, I missed that.

}
contentType := h.Get("type")
if contentType == "" {
return len(h) == 0 && len(content.Data) == 0
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I considered that but it would make this conversion asymmetric. Because we can't interpret the intent I think it may be better that this becomes encoding: unknown/nexus-content so then if it's turned into a content object again it doesn't get an application/octet-stream content type.

if !isStandardPayload(payload) {
data, err := payload.Marshal()
if err != nil {
return nil, fmt.Errorf("%w: payload marshal error: %w", errSerializer, err)
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Again, I could make a linter exception here. I was just "following the rules".

}
case "json/protobuf":
content.Header["type"] = fmt.Sprintf("application/json; format=protobuf")
if messageType != "" {
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm fine with that.

@bergundy
Copy link
Member Author

Merged the detection and serialization logic and address the rest of the comments.
PTAL.

Comment on lines +52 to +55
// We assume that encoding is handled by the transport layer and the content is decoded.
delete(h, "encoding")
// Length can safely be ignored.
delete(h, "length")
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why remove these here when you can just filter and not set them in the one place you actually copy headers, the setUnknownNexusContent call?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is mostly to simplify the logic below and sanitize any irrelevant fields.

// Length can safely be ignored.
delete(h, "length")

if len(h) > 1 {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why do you care what the rest of the headers are so long as the content type is correct? I think you're overthinking here and can just remove this conditional

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't want to lose information in the conversion

if len(h) == 0 && len(content.Data) == 0 {
payload.Metadata["encoding"] = []byte("binary/null")
} else {
setUnknownNexusContent(h, payload.Metadata)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I still think this should be binary/plain

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This could also be content header that's not type, and I'd rather not lose this information.

var errSerializer = errors.New("serializer error")

// Deserialize implements nexus.Serializer.
func (payloadSerializer) Deserialize(content *nexus.Content, v any) error {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To me this reads clearer as a future maintainer, but don't have to accept it (just typed real quick, so probably some typos):

// Deserialize implements nexus.Serializer.
func (payloadSerializer) Deserialize(content *nexus.Content, v any) error {
	payloadRef, ok := v.(**commonpb.Payload)
	if !ok {
		return fmt.Errorf("cannot deserialize into %v", v)
	}
	payload := &commonpb.Payload{Metadata: map[string][]byte{}, Data: content.Data}
	*payloadRef = payload

	// Absent content type with no data means null, with data is just binary
	if contentType := content.Header["type"]; contentType == "" {
		if len(content.Data) == 0 {
			payload.Metadata["encoding"] = []byte("binary/null")
		} else {
			payload.Metadata["encoding"] = []byte("binary/plain")
		}
		return nil
	}

	// Different content types make different payloads
	if mediaType, params, err := mime.ParseMediaType(contentType); err == nil {
		switch mediaType {
		case "application/x-temporal-payload":
			if err := payload.Unmarshal(content.Data); err != nil {
				return fmt.Errorf("failed to unmarshal payload: %w", err)
			}
			return nil
		case "application/json":
			if protoMessageType := params["protobuf-message-type"]; protoMessageType != "" {
				payload.Metadata["encoding"] = []byte("json/protobuf")
				payload.Metadata["messageType"] = []byte(protoMessageType)
			} else {
				payload.Metadata["encoding"] = []byte("json/plain")
			}
			return nil
		case "application/x-protobuf":
			if protoMessageType := params["protobuf-message-type"]; protoMessageType != "" {
				payload.Metadata["encoding"] = []byte("binary/protobuf")
				payload.Metadata["messageType"] = []byte(protoMessageType)
				return nil
			}
			return fmt.Errorf("protobuf message type required for application/x-protobuf content type")
		case "application/octet-stream":
			payload.Metadata["encoding"] = []byte("binary/plain")
			return nil
		}
	}

	// Unknown or bad content type
	payload.Metadata["encoding"] = "binary/unknown-nexus"
	for k, v := range content.Header {
		// Don't want length or encoding
		if k != "length" && k != "encoding" {
			payload.Metadata[k] = []byte(v)
		}
	}
	return nil
}

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I prefer my stricter non-lossy version.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the suggestion though.

common/nexus/payload_serializer.go Outdated Show resolved Hide resolved
}
}
case "json/protobuf":
if len(payload.Metadata) != 2 || messageType == "" {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think, like with the code in deserialize, you're being overly strict on the metadata. If it's our encoding, we can choose to ignore other metadata

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think you're right here.
If a user customizes serialization to add more information, we should treat their payload as opaque and non standard and avoid losing metadata during the conversion.

case "application/json":
if len(params) == 0 {
payload.Metadata["encoding"] = []byte("json/plain")
} else if len(params) == 2 && params["format"] == "protobuf" && params["message-type"] != "" {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think you're being overly strict on the mime type params. If it's our content type, we can ignore params we don't care about

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

again, I'd rather not lose this information. Either you're sticking to the standard or you're a custom nexus payload.

@bergundy bergundy merged commit 4bd580f into temporalio:nexus Nov 17, 2023
8 checks passed
@bergundy bergundy deleted the nexus-payload-serializer branch November 17, 2023 23:57
@bergundy
Copy link
Member Author

@cretz I thought about it some more and it may be desirable to be a bit less strict with the conversion but I would like to do that in a way that no information is lost.

In any case, I'll get back to this at some point before merging nexus into main but I'd rather make some more progress before that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants