feat(sequencer-relayer): provide a shutdown controller #889

Fraser999 · 2024-04-01T15:02:17Z

Summary

Adds a shutdown handle for the sequencer-relayer.

Background

We want the ability to invoke the shutdown sequence from main and from tests.

Changes

A new RAII object ShutdownHandle has been added. This cancels the wrapped CancellationToken when the ShutdownHandle is dropped or when its shutdown method is called.

SIGTERM handling has been moved to main.rs; on receiving a SIGTERM, shutdown is called on the shutdown handle associated with the relayer.

The relayer's main select! loop was simplified to only contain the two tasks.

Testing

The TestSequencerRelayer was updated to make use of the new shutdown controller. Also manually tested by sending a SIGTERM to a running sequencer-relayer.

Related Issues

Closes #882

crates/astria-sequencer-relayer/Cargo.toml

crates/astria-sequencer-relayer/src/sequencer_relayer.rs

Fraser999 · 2024-04-01T15:21:41Z

crates/astria-sequencer-relayer/tests/blackbox/helper.rs

+            timeout(Duration::from_secs(30), sequencer_relayer)
+                .await
+                .unwrap_or_else(|_| {
+                    panic!("timed out waiting for sequencer relayer to shut down");


I generally avoid putting panicking code in drop impls, but I think it's probably ok here since this is a test-only object, and having a test process abort seems like a better option than just logging here since the log message could easily be missed.

I think this panic here makes sense as that means the shutdown logic failed, which we'd otherwise not catch.

There is also std::thread::panicking to check if the thread is ... panicking. You could issue a debug! in that case so it doesn't re-panick.

Well, that's what I was meaning in my comment. I think it's better to not check for std::thread::panicking here since all we'd win is avoiding the test process aborting. But if the thread is already panicking and all we do here is log an error, then I think it'll be easy to miss that error.

Fraser999 · 2024-04-01T15:24:08Z

crates/astria-sequencer-relayer/tests/blackbox/helper.rs

@@ -305,6 +326,12 @@ pub struct TestSequencerRelayerConfig {

 impl TestSequencerRelayerConfig {
    pub async fn spawn_relayer(self) -> TestSequencerRelayer {
+        assert_ne!(


We need the tokio runtime to not be of the "current thread" flavour so as to allow the TestSequencerRelayer::drop to not hang at the futures::executor::block_on call.

interesting. The wiremock mock servers are fine running on current_thread. Do you know where this might be deadlocking? I wonder if we should try and address this problem.

Well, it's not a problem with code currently in the main branch. It's the addition in this PR of the call to futures::executor::block_on in the TestSequencerRelayer::drop.

The async function passed to block_on can't be cancelled or paused, and I believe that's stopping the relayer making progress inside the main select! macro as the single tokio thread is blocked.

SuperFluffy

I like explicitly handling the shutdowns very much so that we can test this on every test.

I think this is almost ready to merge, but I would like to explore whether we can both simplify the ShutdownHandle (by moving the signal trap into main.rs and relying on a oneshot channel or a CancellationToken/DropGuard) and making it more explicit (by providing an ShutdownHandle::shutdown method).

crates/astria-sequencer-relayer/Cargo.toml

crates/astria-sequencer-relayer/src/sequencer_relayer.rs

SuperFluffy · 2024-04-02T11:48:51Z

crates/astria-sequencer-relayer/tests/blackbox/helper.rs

+            timeout(Duration::from_secs(30), sequencer_relayer)
+                .await
+                .unwrap_or_else(|_| {
+                    panic!("timed out waiting for sequencer relayer to shut down");


I think this panic here makes sense as that means the shutdown logic failed, which we'd otherwise not catch.

There is also std::thread::panicking to check if the thread is ... panicking. You could issue a debug! in that case so it doesn't re-panick.

SuperFluffy · 2024-04-02T11:50:27Z

crates/astria-sequencer-relayer/tests/blackbox/helper.rs

@@ -305,6 +326,12 @@ pub struct TestSequencerRelayerConfig {

 impl TestSequencerRelayerConfig {
    pub async fn spawn_relayer(self) -> TestSequencerRelayer {
+        assert_ne!(


interesting. The wiremock mock servers are fine running on current_thread. Do you know where this might be deadlocking? I wonder if we should try and address this problem.

crates/astria-sequencer-relayer/src/sequencer_relayer.rs

SuperFluffy

This is neat, thank you!

crates/astria-sequencer-relayer/src/main.rs

crates/astria-sequencer-relayer/src/sequencer_relayer.rs

…#919) ## Summary Added the `--all-targets` flag to `cargo hack check` to ensure tests can also be built outside the workspace. ## Background While implementing #889 it was found that sequencer-relayer tests can only be run from the repository root but not from the crate directory itself. The CI job that is supposed to ensure this failed because it was not running with the right flags. ## Changes - Added `--all-targets` flag to `cargo hack check` in the rust test github workflow. - ## Related Issues Closes #893

## Summary Conductor now respects shutdown signals it receives during init. ## Background Conductor's task ignored shutdowns while still initializing. This meant that Conductor would hang for up to 30 seconds. ## Changes - refactor conductor's constituent long-running tasks to separate initialization and running - listen for the shutdown signal in all of conductor's tasks ## Testing Run conductor with endpoints that hang indefinitely and sending it SIGTERM. Observe that conductor shuts down quickly. The main operation of conductor is unaffected on the happy path: all blackbox tests run to completion. A proper test for the shutdown logic will be implemented in a follow-up refactor similar to #889

provide a shutdown controller for the sequencer relayer

673dc27

Fraser999 requested review from joroshiba, noot and SuperFluffy as code owners April 1, 2024 15:02

github-actions bot added the sequencer-relayer pertaining to the astria-sequencer-relayer crate label Apr 1, 2024

Fraser999 commented Apr 1, 2024

View reviewed changes

crates/astria-sequencer-relayer/Cargo.toml Show resolved Hide resolved

Fraser999 commented Apr 1, 2024

View reviewed changes

crates/astria-sequencer-relayer/src/sequencer_relayer.rs Show resolved Hide resolved

Fraser999 commented Apr 1, 2024

View reviewed changes

Merge branch 'main' into 882-improve-relayer-shutdown-control

7cd5de2

SuperFluffy mentioned this pull request Apr 2, 2024

Run cargo heck check with --all-targets #893

Closed

SuperFluffy reviewed Apr 2, 2024

View reviewed changes

simplify and rename the ShutdownController

fc24725

SuperFluffy approved these changes Apr 3, 2024

View reviewed changes

crates/astria-sequencer-relayer/src/main.rs Outdated Show resolved Hide resolved

crates/astria-sequencer-relayer/src/main.rs Outdated Show resolved Hide resolved

crates/astria-sequencer-relayer/src/sequencer_relayer.rs Show resolved Hide resolved

Fraser999 and others added 2 commits April 3, 2024 19:42

minor changes to sequencer-relayer main

27c3fa0

Merge branch 'main' into 882-improve-relayer-shutdown-control

7203941

Fraser999 added this pull request to the merge queue Apr 3, 2024

Merged via the queue into astriaorg:main with commit 0877d69 Apr 3, 2024
36 checks passed

Fraser999 deleted the 882-improve-relayer-shutdown-control branch April 3, 2024 21:37

SuperFluffy mentioned this pull request Apr 4, 2024

feat(ci): ensure all crate targets can be built outside the workspace #919

Merged

SuperFluffy mentioned this pull request May 17, 2024

feat(conductor): respect shutdown signals during init #1080

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(sequencer-relayer): provide a shutdown controller #889

feat(sequencer-relayer): provide a shutdown controller #889

Fraser999 commented Apr 1, 2024 •

edited

Loading

Fraser999 Apr 1, 2024

SuperFluffy Apr 2, 2024

Fraser999 Apr 2, 2024

Fraser999 Apr 1, 2024

SuperFluffy Apr 2, 2024

Fraser999 Apr 2, 2024

SuperFluffy left a comment

SuperFluffy Apr 2, 2024

SuperFluffy Apr 2, 2024

SuperFluffy left a comment

feat(sequencer-relayer): provide a shutdown controller #889

feat(sequencer-relayer): provide a shutdown controller #889

Conversation

Fraser999 commented Apr 1, 2024 • edited Loading

Summary

Background

Changes

Testing

Related Issues

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SuperFluffy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SuperFluffy left a comment

Choose a reason for hiding this comment

Fraser999 commented Apr 1, 2024 •

edited

Loading