Execution service 2: separate execution from validation #1536

tsahee · 2023-03-24T17:22:01Z

Block recorder is becomes function of execution client.
Heavy rewrite of block-validators to support new architecture, really simplifying them along the way.

…ver-valiation

create mock recorder, store execution run requests

…ver-valiation

tsahee · 2023-03-24T17:22:50Z

made this draft to avoid accidental merging.

codecov · 2023-03-24T17:40:19Z

Codecov Report

Merging #1536 (329611e) into master (154a17d) will decrease coverage by 3.73%.
The diff coverage is 56.02%.

❗ Current head 329611e differs from pull request most recent head 2fea784. Consider uploading reports for the commit 2fea784 to get more accurate results

@@            Coverage Diff             @@
##           master    #1536      +/-   ##
==========================================
- Coverage   54.49%   50.76%   -3.73%     
==========================================
  Files         221      270      +49     
  Lines       33099    31704    -1395     
  Branches        0      555     +555     
==========================================
- Hits        18036    16095    -1941     
- Misses      12906    13382     +476     
- Partials     2157     2227      +70

Validation count client

Also solving a bug that caused to return identical first and last

call from staker don't prune batches required for reports

PlasmaPower

I have a couple of minor comments and this has a merge conflict now, but it's looking good!

staker/block_validator.go

PlasmaPower · 2023-07-08T05:08:54Z

staker/block_validator.go

+	if v.recordSentA < countUint64 {
+		v.recordSentA = countUint64
+	}
+	v.validatedA = countUint64


Don't these all need to be atomic writes?

no, because we're write-holding the reorg-lock. There are comments but I'll make them a little clearer.

We should add comments to created(), recordSent(), and validated() indicating that they need the reorg lock read-held. Right now Validated(t *testing.T) is called in a test without the lock, but we don't seem to enable the race detector for that test so we should be good.

PlasmaPower · 2023-07-08T05:10:55Z

staker/staker.go

+	if err != nil {
+		if errors.Is(err, ErrGlobalStateNotInChain) && s.fatalErr != nil {
+			fatal := fmt.Errorf("latest staked not in chain: %w", err)
+			s.fatalErr <- fatal


I think this fatal error might prevent the node from performing a reorg it needs to do to correct its invalid state (because it shuts down before it's able to reorg). Perhaps this should only be fatal if the stakedInfo's inbox accumulator matches the inbox tracker's record, otherwise just return an error here.

Let's about this.
We've already read the batch for this data and it's quite old (feed messages won't get us "caught up")
Most chances are there was some weird one-off error in calculating state that we can't really recover from.
I think being very noisy with this error is worth having to go through manual hoops in case it is recoverable.

Makes sense. I guess if someone is error-looping on this they could always just disable the staker for a bit until the reorg happens.

Message pruner fixes

recordingDb: add config, metrics, and size limits

Pruner fixes

…over-valiation

PlasmaPower

LGTM

tsahee added 23 commits January 30, 2023 18:42

Merge branch 'execution_separation_initial' into execution-separate-o…

1e4e052

…ver-valiation

Merge branch 'execution_separation_initial' into execution-separate-o…

56eccdc

…ver-valiation

containers: add syncmap

fae1f0f

stopwaiter: optimize iterative 0-time call

86a734e

initial separation of execution from validation

3438753

tracker: add information for AccumulatorNotFound

48afd4b

notfy block recorder of reorgs

fb465a1

validateResult api fixes

03afe10

arbnode: small fixes

93da3b6

block_validator sorting and fixes

7b44435

validator prints logs

7d2aeac

staker: more fixes

ed05af0

validator node in system tests

ad5dce9

blockvalidator tests: multiple txs in batch

d7776fe

validation fixes

d1a60e5

Merge branch 'execution_separation_initial' into execution-separate-o…

19bc3e9

…ver-valiation

Merge branch 'execution_separation_initial' into execution-separate-o…

1a0c8b5

…ver-valiation

Merge branch 'execution_separation_initial' into execution-separate-o…

01141e3

…ver-valiation

fix challenges FindGlobalStateFromMessageCount

796cb6a

stateless_block_validator: use recorder interface

be93c1c

validation_mock improvements

31ffc5c

create mock recorder, store execution run requests

full challenge test: add mocks for various pos-in-batch

30eac54

Merge branch 'execution_separation_initial' into execution-separate-o…

eca1bc0

…ver-valiation

cla-bot bot added the s Automatically added by the CLA bot if the creator of a PR is registered as having signed the CLA. label Mar 24, 2023

tsahee marked this pull request as draft March 24, 2023 17:22

tsahee changed the title ~~Execution separate 2: separate execution from validation~~ Execution service 2: separate execution from validation Mar 24, 2023

tsahee added 2 commits March 25, 2023 08:10

testChallenge: fix parallelism

69a3f66

dont run MockChallenge tests with race detection

f43c5a1

tsahee and others added 19 commits June 29, 2023 17:25

batch_validator: fix messages when not caught up

63f7063

validation: manage room in client and not server

789f21f

block_validator: call launch in main thread

ebbc572

Merge pull request #1728 from OffchainLabs/validation_count_client

74cffb1

Validation count client

pruner: deleteFromRange return uint64

e96d779

Also solving a bug that caused to return identical first and last

message pruner updates

18e8365

call from staker don't prune batches required for reports

message_pruner: min-batches-left

63df5ae

pruner: fix config options

c09cd9e

calliterativelywith: avoid overhead for duration 0

e193ac0

Merge branch 'execution-separate-over-valiation' into pruner_fixes

9c435ac

CallIterativelyWith: fix trigger val if duration is 0

947f8f7

message pruner: fix bug checking if enough batches left

5b18e37

pruner: minor fixes following review comments

89fda39

staker: move block_validator into notifiers

e7030af

update geth, add recordingDb config

b56c734

message_pruner: dont prune batchmetadata

0f3eb15

validator: dont warn when catching up

3b91538

Merge branch 'pruner_fixes' into recordingdb_features

7465846

block_validator bugfix: delete validation entry when done

9be0397

PlasmaPower reviewed Jul 8, 2023

View reviewed changes

tsahee and others added 7 commits July 10, 2023 09:00

Merge pull request #1732 from OffchainLabs/pruner_fixes

b9b400a

Message pruner fixes

Merge pull request #1741 from OffchainLabs/recordingdb_features

e61465a

recordingDb: add config, metrics, and size limits

Merge pull request #1747 from OffchainLabs/pruner_fixes

a2bcb67

Pruner fixes

block_validator: don't try to read non-existing batch

266948d

block_validator: fixin PR review comments

67af3b3

Merge remote-tracking branch 'origin/master' into execution-separate-…

314bd0c

…over-valiation

block_validator: add missing error check

902f2ee

PlasmaPower approved these changes Jul 10, 2023

View reviewed changes

PlasmaPower merged commit 7bba01f into master Jul 10, 2023

PlasmaPower deleted the execution-separate-over-valiation branch July 10, 2023 20:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Execution service 2: separate execution from validation #1536

Execution service 2: separate execution from validation #1536

tsahee commented Mar 24, 2023 •

edited by PlasmaPower

Loading

tsahee commented Mar 24, 2023

codecov bot commented Mar 24, 2023 •

edited

Loading

PlasmaPower left a comment

PlasmaPower Jul 8, 2023

tsahee Jul 10, 2023

tsahee Jul 10, 2023

PlasmaPower Jul 10, 2023

PlasmaPower Jul 8, 2023

tsahee Jul 10, 2023

PlasmaPower Jul 10, 2023

PlasmaPower left a comment

Execution service 2: separate execution from validation #1536

Execution service 2: separate execution from validation #1536

Conversation

tsahee commented Mar 24, 2023 • edited by PlasmaPower Loading

tsahee commented Mar 24, 2023

codecov bot commented Mar 24, 2023 • edited Loading

Codecov Report

PlasmaPower left a comment

Choose a reason for hiding this comment

PlasmaPower Jul 8, 2023

Choose a reason for hiding this comment

tsahee Jul 10, 2023

Choose a reason for hiding this comment

tsahee Jul 10, 2023

Choose a reason for hiding this comment

PlasmaPower Jul 10, 2023

Choose a reason for hiding this comment

PlasmaPower Jul 8, 2023

Choose a reason for hiding this comment

tsahee Jul 10, 2023

Choose a reason for hiding this comment

PlasmaPower Jul 10, 2023

Choose a reason for hiding this comment

PlasmaPower left a comment

Choose a reason for hiding this comment

tsahee commented Mar 24, 2023 •

edited by PlasmaPower

Loading

codecov bot commented Mar 24, 2023 •

edited

Loading