SpanEvent and Event through LogRecord should be "unified" #3406

jsuereth · 2023-04-17T20:27:54Z

What are you trying to achieve?

Whether or not a user leverages SpanEvent or LogRecord to fire events should be orthogonal to the representation of the data. If I decide to add an event to a Span, then I should have similar data-model to when I fire off an "unadorned" event via the Event API.

What did you expect to see?

Right now there are fields on LogRecord that do not exist on SpanEvent:

SeverityText
SeverityNumber
Body

I believe this need to be added to (or have a specified encoding in) SpanEvent.

Additional context.

I think we should not try to workaround the issue in particular domains, like: #2926

Instead, allowing events to flow as SpanEvent or LogEvent should both be viable choices.

The text was updated successfully, but these errors were encountered:

jkwatson · 2023-04-17T20:48:46Z

I don't think the as-of-yet-incomplete "event" API will support severities, or bodies. Just names and attributes, like our span events do.

scheler · 2023-04-18T02:13:08Z

In Define the shape of a event by encapsulating the payload in event.data. #2926, we wanted to introduce the notion of payload for Events, via attribute event.data. It is for this payload that @jsuereth and @jack-berg recommend using LogRecord Body instead.
The current Events API and Events SemConv do not talk about Severity. However, in case Events end up needing Severity the recommendation was to use the in-built fields in the LogRecord and not introduce new attributes. That is the reason for including SeverityNumber and SeverityText in this issue. Event API can continue to not have these until the use-cases emerge.
@jsuereth ObservedTimestamp is another field in LogRecord that could be added to SpanEvent. Only the Context fields could be omitted.

joaopgrassi · 2023-04-18T14:27:02Z

Was it considered if we even need the SpanEvent in the long run? I already saw users getting confused on why "events" are shown in two places and questions as : When I use a span event vs when I use a log.

jsuereth · 2023-04-18T15:02:17Z

@joaopgrassi I believe there was discussion here. The main (huge) difference right now is in how SAMPLING works between the two. Span Events are sampled exactly as Spans are. Log-based events tend to be sampled by severity / log-ingestion configuration.

I suspect we need to wait for the unification between these two channels before we could drop SpanEvents, and in practice I expect we'll have to deal with both for quite some time.

tigrannajaryan · 2023-04-26T15:07:01Z

Duplicate/related to #622

tedsuo · 2023-04-26T15:35:50Z

Right now there are fields on LogRecord that do not exist on SpanEvent:

SeverityText

SeverityNumber

Body

Since SpanEvents are a name plus a bag of attributes, it is straight forward to make SpanEvents equivalent to logs. The SpanEvent.Name field can match the event.name log attribute. For log fields where there is no SpanEvent equivalent, we can define SpanEvent attributes using semantic conventions:

log.severity_text
log.severity_number
log.body
For writing non-event logs to SpanEvents, the SpanEvent.name can be log.

This would allow data recorded through either API to be emitted using either data model.

I do not recommend adding more fields to the SpanEvents data model, and I do not recommend adding additional API surface area for exposing additional fields on SpanEvents. This is due to the fact that tracing backends do not understand these fields, and would make no use of them.

Once we have a stable Event API and data model, I recommend that we deprecate the SpanEvents API. This will alleviate confusion. Instead of trying to explain why you would use one API versus another, we just say "SpanEvents are deprecated, use Events."

Under the hood, users can choose where they would like data to be recorded based on whether they have a unified backend, or a separate logging and tracing backend. The SDK can be configured to allow users to emit any data recorded through either API to either data model. This is where the above semantic conventions come into play: translating data from one model to another when you have two separate backends.

jkwatson · 2023-04-26T17:07:49Z

Once we have a stable Event API and data model, I recommend that we deprecate the SpanEvents API. This will alleviate confusion. Instead of trying to explain why you would use one API versus another, we just say "SpanEvents are deprecated, use Events."

I don't know about this... A SpanEvent is explicitly associated to the span to which it is attached. An Event would only potentially be associated with a span via some unknown mechanism. Not all Events that are emitted in the course of a span's execution should be associated to the span, so how would we figure out when we do and do not make this association?

I do not recommend trying to make a SpanEvent and an Event the same thing. Although they are structurally similar, they are not semantically equivalent, and trying to force them to be the same will only lead to user confusion, I fear.

trask · 2023-04-26T19:13:19Z

Not all Events that are emitted in the course of a span's execution should be associated to the span, so how would we figure out when we do and do not make this association?

it seems ok for the default to use the "current span" (you can still ignore the association in your backend event query), and you can opt-out with EventBuilder.setContext(Context.root())

Although they are structurally similar, they are not semantically equivalent

I agree with this. I think of SpanEvent as more semantically equivalent to a Log.

Instead of trying to explain why you would use one API versus another, we just say "SpanEvents are deprecated, use Events."

Maybe we refine this to be "SpanEvents are deprecated, use Events or Logs"?

scheler · 2023-04-27T02:23:57Z

For log fields where there is no SpanEvent equivalent, we can define SpanEvent attributes using semantic conventions:

log.severity_text

log.severity_number

log.body

For writing non-event logs to SpanEvents, the SpanEvent.name can be log.

What would be the type of value for the log.body attribute - any? So now it's between adding additional fields in SpanEvent vs allowing attribute values of type map?

This is due to the fact that tracing backends do not understand these fields, and would make no use of them.

I know very little about tracing backends, but do they understand SpanEvents as they exist today? Does adding additional fields make it worse?

MSNev · 2023-08-22T15:41:18Z

Going over this it seems to me that there is an argument for #2994 to remove the domain and just have the event.name

tedsuo · 2024-07-12T18:14:36Z

Circling back on this. Since we are coming to the end of the Event SIG and this is one of our last remaining issues, let's make a decision so that we can close the SIG down.

Representing Events as SpanEvents

Using the log namespace to define the semantic conventions for representing log fields appears to be straightforwards, except for the log.body attribute. Log body is an Any type that will regularly include nested data. Seems like there are two options.

log.body attribute is represented as as Any type
The advantage of this approach is that it has no additional overhead. There would be no need to expose the Any type in the SpanEvent API.

However, are there any tracing backends in existence that expect an Any type in a SpanEvent? It's not useful to send data that tracing backends do not support. However, all tracing backends could be updated to do something useful with this field. Is it safe to assume that backends would not create errors? What happens if you send Jaeger an Any field? We could assume that backends guard against Any and either drop it or serialize it.

log.body attribute is serialized as a string
Regardless of the type of Log Body, it is stringified as JSON. For the byte type, it is represented as a string with the prefix data:base64.

The advantage of this approach is that it ensures tracing backends do not drop this field. They can choose to deserialize it if they want to. The disadvantage is that there is overhead.

My recommendation is that we should serialize the body field when converting it to an attribute. According to the spec, using the Any type would be a breaking change. Since there's a way to do this without a breaking change, it seems to me that this is the correct decision. But I do not have a strong feeling about it.

Representing Logs as SpanEvents

Events are just LogRecords that require certain attributes to be present. Once we've defined how Events are converted to SpanEvents, it would be trivial to extend this definition to include converting any log to a SpanEvent. There could be an option in the SDK:

No conversion
Convert only Events
Convert all LogRecords

The only additional decision needed to convert all LogRecords would be the value of the SpanEvent Name field. LogRecords that are not Events do not have a name attribute. I propose the value of the Name field should just be log. Seems simple enough.

trask · 2024-08-01T00:01:37Z

Representing Events as SpanEvents

is this only needed for backwards compatibility, i.e. so we can move existing SpanEvents over to Events?

if so, then we could drop the need to convert (log) event body (since existing SpanEvents don't have a body)

it also would solve the problem of what to do with (log) events which aren't linked to a span (since existing SpanEvents are all linked to a span)

tigrannajaryan · 2024-08-23T14:58:53Z

Please also consider adding the Name field to LogRecord. We have in Span Events and we needed it for generic events and added an event.name semconv for it. It will be a backward compatible addition, existing LogRecords will simply have the Name empty and event.name is experimental so we can remove it and replace by Name.

tedsuo · 2024-08-23T17:44:03Z

@tigrannajaryan oh man I would have loved name to be a field a year ago, because I agree that this concept is part of the structure of OTLP. But at this late stage I'm concerned that it would be burdensome. So we have officially switched sides on this issue. 😄

lmolkova · 2024-08-23T19:31:11Z

Having first-class Name in the LogRecord would eliminate the last difference between logs and events and I feel it'd simplify a lot of things, but I expect @tedsuo will object (and I respect it)

MSNev · 2024-08-23T19:49:02Z

and event.name is experimental so we can remove it and replace by Name.

We should NOT remove the event.name attribute as this explicitly calls out that the log record is representing an event and is not just a generic log record.

Whether or not there is a need for another generic log record name attribute should NOT be conflated with the need to explicitly call out that an event is being sent.

lmolkova · 2024-08-23T19:52:14Z

We should NOT remove the event.name attribute as this explicitly calls out that the log record is representing an event and is not just a generic log record.

The non-empty Name conveys the same message.

But it's an important point - if an event is just a named log record, why do we need two notions? I've been in several discussions this week where some people assumed that logs and events are different things and other people assumed that they are the same, but events are named.

Are they different? If so, we need to explain it better (at least provide guidance when someone would report an event vs named log).
If they are the same, let's remove unnecessary notion and simplify things.

MSNev · 2024-08-26T22:05:38Z

They are not the same thing and should not be considered to be the same, events are just "reusing" a Log record as a transport. This entire discussion (Span / Log event unification) is (I believe) also highlighting that the general concept of an event is separate to its actual "transport" mechanism used.

As defined in the Sem conv definition of Events

Semantically, an Event is a named occurrence at an instant in time. It signals that "this thing happened at this time" and provides additional specifics about the occurrence.

And as such they SHOULD be treated differently to just a log generic record.

Now "IF" as part of "transporting" the event via a Log Record the semantic convention for any "Log Name" was "defined" that it is itself namespaced and that an event is defined as being "prefixed" as event.<event.name> then technically that could achieve the same thing, with a potential processing advantage that any backend would not need to deserialize all of the attributes to "find" one called event.name to determine that it's an event. But "just" simply "moving" the existing event.name to a log name verbatim (I believe) will cause (bridging) issues and that is the foundation on my objection to "removing" event.name. Would this "stop" issues -- no, unless the semconv / spec was / is defined from the outset to be namespaced.

The non-empty Name conveys the same message.

This is problematic as I would (assume) that there is (or would be) some log bridge that wants to translate some log "name" from some logging system that would want to use a "Log Name".

Now "IF" the semantic convention for any "Log Name" was to "redefine" that any possible Log Name which identified an event WAS "prefixed" as event.<event.name> then technically that could achieve the same thing, so that any backend would not need to deserialize all of the attributes to "find" one called event.name to determine that it's an event. But "just" simply "moving" the existing event.name to a log name verbatim (I believe) will cause (bridging) issues

trask · 2024-08-26T23:23:41Z

if an event is just a named log record, why do we need two notions?

I'm getting more comfortable with the idea that an event is just a named log record

At the same time, keeping the separate notion of event could have value

it could help explain the OpenTelemetry vision by separating the desired (events) from the more legacy (logs)
it allows us to give users a first-class OpenTelemetry Event API (promoting our vision), while side-stepping the contentiousness of introducing something called a Logging API

MSNev · 2024-08-26T23:45:18Z

Ok, I just asked CoPilot to give me a list of logging systems that support each log record having a "name" (so the potential bridge situation) and this is the list it gave (they are possibly wrong and depending on the existing bridge may not exist), but these are the "potential" name clashing items that I would be concerned about.

Several logging systems allow you to send a “name” with each log record, which can be very useful for categorizing and filtering logs. Here are a few examples:

Python’s logging module: In Python, you can create loggers with specific names using logging.getLogger(name). Each log record can include the logger’s name, which helps in organizing and filtering logs 1 2.
Log4j (Java): Log4j allows you to create named loggers. Each log record can include the name of the logger, which is useful for distinguishing logs from different parts of an application 1.
NLog (C#/.NET): NLog supports named loggers, and you can include the logger’s name in each log record. This helps in categorizing logs and making them easier to search and analyze 1.
Serilog (C#/.NET): Serilog allows you to create loggers with specific names and include these names in each log record. This is particularly useful for structured logging and filtering 1.
Logback (Java): Similar to Log4j, Logback supports named loggers, and you can include the logger’s name in each log record for better organization and filtering 1.

lmolkova · 2024-08-27T16:54:53Z

named loggers are not related to named log.

Logger name goes to the instrumentation scope. Logger name is used for filtering for the category (e.g. enable all logs from com.foo.MyClass with level INFO and above).

The named log record discussed in open-telemetry/semantic-conventions#1339 is a name of the event for this record.

If we create gen_ai events in OpenAI instrumentations using logging facades:

in .NET: scope name: OpenAI.ChatClient, event/log record name would be gen_ai.user.message
in Python: scope name: opentelemetry-instrumentation-openai.chat, event/log record name - gen_ai.user.message

Event/log record name identifies event globally (or users can choose to give custom events names that are unique within the scope - that's unfortunate for these users, but not the end of the world - they can also fix it).

So I don't understand the clashing concern.

jsuereth added spec:logs Related to the specification/logs directory spec:trace Related to the specification/trace directory labels Apr 17, 2023

github-actions bot assigned jack-berg Apr 17, 2023

MSNev mentioned this issue May 1, 2023

Define the shape of a event by encapsulating the payload in event.data. #2926

Closed

MSNev mentioned this issue Aug 22, 2023

Do not add AddLink(), use option to AddEvent() instead #3337

Closed

MSNev added this to Logs SIG Sep 20, 2023

MSNev moved this to In Progress in Logs SIG Sep 20, 2023

MSNev moved this from In Progress to Semantic Conventions in Logs SIG Sep 20, 2023

MSNev moved this from What is an event to In progress in Logs SIG Sep 20, 2023

MSNev moved this from In progress to Related in Logs SIG Sep 20, 2023

trask mentioned this issue Jan 29, 2024

Event payload lives in the LogRecord body open-telemetry/semantic-conventions#566

Merged

3 tasks

lmolkova mentioned this issue Mar 21, 2024

LLM semconv: how to capture prompts and completions open-telemetry/semantic-conventions#829

Closed

This was referenced Apr 24, 2024

Add event.data attribute to provide mapping from span events to events open-telemetry/semantic-conventions#954

Closed

Define span events to events mapping #4023

Closed

austinlparker added the triage:accepted:needs-sponsor Ready to be implemented, but does not yet have a specification sponsor label May 14, 2024

austinlparker unassigned jack-berg May 14, 2024

austinlparker added this to 🔭 Main Backlog Jul 16, 2024

austinlparker moved this to Spec - Accepted in 🔭 Main Backlog Jul 16, 2024

MSNev mentioned this issue Aug 9, 2024

feat: Add support for event body fields open-telemetry/weaver#297

Merged

tigrannajaryan mentioned this issue Aug 23, 2024

logs: Add durable identifier log record attributes open-telemetry/semantic-conventions#1339

Closed

3 tasks

lmolkova mentioned this issue Nov 19, 2024

Add event_name to logs proto open-telemetry/opentelemetry-proto#600

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SpanEvent and Event through LogRecord should be "unified" #3406

SpanEvent and Event through LogRecord should be "unified" #3406

jsuereth commented Apr 17, 2023

jkwatson commented Apr 17, 2023

scheler commented Apr 18, 2023

joaopgrassi commented Apr 18, 2023 •

edited

Loading

jsuereth commented Apr 18, 2023

tigrannajaryan commented Apr 26, 2023

tedsuo commented Apr 26, 2023

jkwatson commented Apr 26, 2023

trask commented Apr 26, 2023

scheler commented Apr 27, 2023

MSNev commented Aug 22, 2023

tedsuo commented Jul 12, 2024 •

edited

Loading

trask commented Aug 1, 2024

tigrannajaryan commented Aug 23, 2024

tedsuo commented Aug 23, 2024

lmolkova commented Aug 23, 2024

MSNev commented Aug 23, 2024

lmolkova commented Aug 23, 2024 •

edited

Loading

MSNev commented Aug 26, 2024

trask commented Aug 26, 2024

MSNev commented Aug 26, 2024

lmolkova commented Aug 27, 2024 •

edited

Loading

SpanEvent and Event through LogRecord should be "unified" #3406

SpanEvent and Event through LogRecord should be "unified" #3406

Comments

jsuereth commented Apr 17, 2023

jkwatson commented Apr 17, 2023

scheler commented Apr 18, 2023

joaopgrassi commented Apr 18, 2023 • edited Loading

jsuereth commented Apr 18, 2023

tigrannajaryan commented Apr 26, 2023

tedsuo commented Apr 26, 2023

jkwatson commented Apr 26, 2023

trask commented Apr 26, 2023

scheler commented Apr 27, 2023

MSNev commented Aug 22, 2023

tedsuo commented Jul 12, 2024 • edited Loading

Representing Events as SpanEvents

Representing Logs as SpanEvents

trask commented Aug 1, 2024

tigrannajaryan commented Aug 23, 2024

tedsuo commented Aug 23, 2024

lmolkova commented Aug 23, 2024

MSNev commented Aug 23, 2024

lmolkova commented Aug 23, 2024 • edited Loading

MSNev commented Aug 26, 2024

trask commented Aug 26, 2024

MSNev commented Aug 26, 2024

lmolkova commented Aug 27, 2024 • edited Loading

joaopgrassi commented Apr 18, 2023 •

edited

Loading

tedsuo commented Jul 12, 2024 •

edited

Loading

lmolkova commented Aug 23, 2024 •

edited

Loading

lmolkova commented Aug 27, 2024 •

edited

Loading