DPLT-1077 Process historical messages via Redis Streams #181

morgsmccauley · 2023-08-16T21:47:17Z

This PR write historical messages to a dedicated Redis Stream per indexer. Redis keys have been slightly restructured to make it easier to distinguish which streams need to be monitored, mainly:

Rather than storing the indexer name, i.e. morgs.near/test, we store the stream name, as this indexer could potentially have two streams, historical and real-time.
storage, which stores the indexer config, is now suffixed under the stream name, so we can have separate storage for both historical and real-time, preventing race conflicts between them

The last point is a bit of a weird one, but I am trying to avoid the scenario where we change the historical indexers code out from underneath it. This would be the case if the developer changes their indexer code while a historical process is still running. Keeping the indexer code consistent throughout a given historical backfill seems most correct.

It would be possible for a developer to start another historical process, while an existing one is ongoing, causing the indexer code to change, as historical processes for a given indexer share the same storage key. But I don't think it's worth actively preventing this, as our ideal state is to stop the existing historical backfill when another is kicked off.

Once an historical process has been kicked off, Runner will always start a process to listen to the stream, there's nothing telling it to stop listening. I may address this in a future PR, or just wait until we have control messages :).

morgsmccauley · 2023-08-17T02:54:06Z

indexer/queryapi_coordinator/src/historical_block_processing.rs

-            new_indexer_function_copy,
-            Opts::parse(),
-        ))
+        tokio::spawn(async move {


Need to ownership of redis_connection_manager to the thread

morgsmccauley · 2023-08-17T02:54:36Z

indexer/storage/src/lib.rs

@@ -84,14 +92,6 @@ pub async fn xadd(
 ) -> anyhow::Result<()> {
    tracing::debug!(target: STORAGE, "XADD: {:?}, {:?}", stream_key, fields);

-    // TODO: Remove stream cap when we finally start processing it
-    redis::cmd("XTRIM")


We now delete the stream messages after they are processed - so this can be removed

morgsmccauley · 2023-08-17T02:55:07Z

runner/src/index.ts

@@ -1,157 +1,57 @@
-import { createClient } from 'redis';


All Redis related actions have been extracted to RedisClient

roshaans

overall looks good to me.

roshaans · 2023-08-17T15:54:02Z

indexer/queryapi_coordinator/src/historical_block_processing.rs

+                storage::sadd(
+                    redis_connection_manager,
+                    storage::STREAMS_SET_KEY,
+                    storage::generate_historical_stream_key(&indexer_function.get_full_name()),


Should we have the ability to distinguish between different historical processes?

Won't using the indexer function's name cause the old historical processes's with the same name to be overwritten?

Yes, and that's intentional. We don't want to have multiple concurrent historical processes for a given indexer. I'll be working on stopping existing processes soon, which is why I didn't bother creating unique streams.

morgsmccauley added 3 commits August 16, 2023 15:12

feat: Write real time messages to dedicated redis stream

4457256

feat: Write historical messages to Redis Streams

ed699c2

refactor: Restructure real time keys to avoid conflicts with historical

b79c418

morgsmccauley requested a review from a team as a code owner August 16, 2023 21:47

morgsmccauley marked this pull request as draft August 16, 2023 21:47

morgsmccauley added 5 commits August 17, 2023 11:38

refactor: Extract redis logic to own class

b4956dc

refactor: Restructure redis keys for easier access

187a234

feat: Use new key structure in runner

059eafc

refactor: Rename redis types

7285e7d

fix: Use correct key to get unprocessed messages

213e252

morgsmccauley changed the title ~~DPLT-1077 Write historical messages to Redis Stream~~ DPLT-1077 Process historical messages via Redis Streams Aug 17, 2023

morgsmccauley added 4 commits August 17, 2023 14:10

test: RedisClient

e5a6b31

refactor: Extract special stream ID to constant

1f83147

refactor: Delete stream messages after processing

fd58ccc

feat: Remove stream size limit as we now delete messages

f416e23

morgsmccauley commented Aug 17, 2023

View reviewed changes

morgsmccauley marked this pull request as ready for review August 17, 2023 02:56

morgsmccauley requested review from a team and removed request for a team August 17, 2023 03:10

feat: Add real-time/historical label to metrics

75b8d8b

morgsmccauley force-pushed the DPLT-1077-process-historical-messages branch from 5ccc003 to 75b8d8b Compare August 17, 2023 03:24

roshaans approved these changes Aug 17, 2023

View reviewed changes

fix: Ensure duration metric is written on failed executions

def5d5f

morgsmccauley merged commit 639064d into main Aug 17, 2023
6 checks passed

morgsmccauley deleted the DPLT-1077-process-historical-messages branch August 17, 2023 20:00

morgsmccauley mentioned this pull request Apr 22, 2024

test stable branch git fix up #687

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DPLT-1077 Process historical messages via Redis Streams #181

DPLT-1077 Process historical messages via Redis Streams #181

morgsmccauley commented Aug 16, 2023 •

edited

Loading

morgsmccauley Aug 17, 2023

morgsmccauley Aug 17, 2023

morgsmccauley Aug 17, 2023

roshaans left a comment

roshaans Aug 17, 2023

morgsmccauley Aug 17, 2023

DPLT-1077 Process historical messages via Redis Streams #181

DPLT-1077 Process historical messages via Redis Streams #181

Conversation

morgsmccauley commented Aug 16, 2023 • edited Loading

morgsmccauley Aug 17, 2023

Choose a reason for hiding this comment

morgsmccauley Aug 17, 2023

Choose a reason for hiding this comment

morgsmccauley Aug 17, 2023

Choose a reason for hiding this comment

roshaans left a comment

Choose a reason for hiding this comment

roshaans Aug 17, 2023

Choose a reason for hiding this comment

morgsmccauley Aug 17, 2023

Choose a reason for hiding this comment

morgsmccauley commented Aug 16, 2023 •

edited

Loading