Why not dedupe in Thanos query proxy? #4602
pentlander
started this conversation in
General
Replies: 2 comments 1 reply
-
I took an |
Beta Was this translation helpful? Give feedback.
0 replies
-
Seems like if the goal is to actually function as a proxy, this technique should be used to avoid decoding chunks: https://blog.najaryan.net/posts/partial-protobuf-encoding/ |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Thanos query does not deduplicate series when using the store API, it only does it using the http api. Specifically, it says:
I'm wondering what the reasoning behind that is. We're currently running a federated thanos query setup and just recently realized that deduplication is only occurring at the highest layer. It would be nice if we could more evenly distribute the work between layers, otherwise we'll just remove the second query layer as it just adds overhead.
Edit: Seems like this is relevant: #2700 though I don't understand the explanation. proxy.go reads in the Series which unmarshals the AggrChunks, so the deduping might as well be performed at the same time. Maybe I'm missing something, are the inner Chunks unmarshalled lazily?
Beta Was this translation helpful? Give feedback.
All reactions