The Redis integration can't distinguish final errors from retried errors #3820

byroot · 2024-08-01T07:35:29Z

There is a bit of an ongoing problem with users of dd-trace-rb, the instrumentation of the Redis gem has a fairly different behavior whether you are using the 4.x version or 5.x version.

More detail at redis-rb/redis-client#119 (comment), but in short with the 4.x version the instrumentation is hooked above the code in charge of retrying on error, while with 5.x it's below.

Because of this users upgrading from 4.x to 5.x see a stream of errors that aren't new at all, just weren't reported, and think the 5.x version is bugged.

This is no fault of dd-trace-rb, it just use the official hook points declared by redis-client. As the maintainer of redis-client I'd like to find a solution to this, and totally willing to extend or evolve the hook points to allow dd-trace-rb and similar projects to be able to ignore retried errors.

But to help with that I'd first like to know if there is a precedent to this, is this done for other integrations? Would dd-trace-rb need to just not see errors, or need some kind of boolean telling whether the error is final, etc. As this would help me drive what the API change would look like.

@ivoanjo @TonyCTHsu please let me know your thougths.

The text was updated successfully, but these errors were encountered:

ivoanjo · 2024-08-01T08:52:06Z

Thanks for letting us know, definitely sucks that we're breaking folks and adding more work for you 😅

Let me sync with the other folks and we'll get back to you asap.

byroot · 2024-08-01T08:58:39Z

definitely sucks that we're breaking folks and adding more work for you

It really isn't that bad, it's mostly confusion I think.

marcotc · 2024-08-07T20:09:53Z

Just an FYW that we are still looking into it; just trying to get the right people together with all the summer PTOs.

byroot · 2024-08-07T20:56:53Z

No worries. That's much appreciated.

marcotc · 2024-08-21T21:33:50Z

Hey @byroot, I had a chat internally in Hughes what we came up with:

We think that visibility at a level that abstracts away any automatic retries is the desirable default. We had this behavior when we instrumented version 4.x, but we made a mistake of changing that behavior for 5.X.
Inspecting the middleware call site today, we think that it makes sense that it is invoked on each individual request, even on retried request, because arbitrary request modifications can be necessary on an individual request level. For observability, we are not interested in that signal, but we see the value in theory.

What we re thinking of doing is going back to the original monkey-patching for 5.x, that same that we do for 4.x.

We don't think that is necessarily required for the redis-client gem to support callbacks at the level we desire which today would translate to "one high-level Redis API call to one instrumentation signal".

We are happy to expand a conversation as well and hear any feedback.

byroot · 2024-08-21T21:35:57Z

What we re thinking of doing is going back to the original monkey-patching for 5.x, that same that we do for 4.x.

So you wouldn't instrument raw redis-client usage at all?

marcotc · 2024-10-29T18:47:12Z

So you wouldn't instrument raw redis-client usage at all?

This is correct. Given we rather report Redis requests at a high-level, looking at the code in redis and redis-client, we wouldn't instrument at the redis-client level.

byroot · 2024-10-29T18:48:25Z

Note that redis-client can be used standalone, it's not exclusively a dependency of redis.

byroot added community Was opened by a community member feature-request A request for a new feature or change to an existing one labels Aug 1, 2024

byroot mentioned this issue Aug 1, 2024

Broken Pipe and EOFErrors redis-rb/redis-client#119

Open

byroot changed the title ~~The Redis integration can't distinguish final errors from final errors~~ The Redis integration can't distinguish final errors from retried errors Aug 1, 2024

liaden mentioned this issue Aug 8, 2024

Supressing errors when handling retries #3835

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The Redis integration can't distinguish final errors from retried errors #3820

The Redis integration can't distinguish final errors from retried errors #3820

byroot commented Aug 1, 2024

ivoanjo commented Aug 1, 2024

byroot commented Aug 1, 2024

marcotc commented Aug 7, 2024

byroot commented Aug 7, 2024

marcotc commented Aug 21, 2024

byroot commented Aug 21, 2024

marcotc commented Oct 29, 2024

byroot commented Oct 29, 2024

The Redis integration can't distinguish final errors from retried errors #3820

The Redis integration can't distinguish final errors from retried errors #3820

Comments

byroot commented Aug 1, 2024

ivoanjo commented Aug 1, 2024

byroot commented Aug 1, 2024

marcotc commented Aug 7, 2024

byroot commented Aug 7, 2024

marcotc commented Aug 21, 2024

byroot commented Aug 21, 2024

marcotc commented Oct 29, 2024

byroot commented Oct 29, 2024