feat(op-batcher): altda->ethda failover #13

samlaf · 2024-11-07T21:19:27Z

Plan is to upstream this to op's repo. Created here to get review from the team first.

This PR builds on top of the feat--multiframe-altda-channel changes (already upstreamed to op's repo, waiting for review there)

It contains 2 commits:

failover test: 61d7578
failover logic: 5712618

Right now failover is done to calldata txs because that was trivial whereas failing over to blobs or their auto mode that switches between blobs and calldata would need a nontrivial refactor and some thinking. Not sure its worth putting effort into this atm given that the whole point of failover is that it should happen very rarely and also not last very long, but let me know if you guys think otherwise.

bxue-l2 · 2024-11-12T23:37:00Z

At the high level, it usually takes us about 1 hour to fix the problem

bxue-l2 · 2024-11-12T23:53:34Z

op-alt-da/damock.go

@@ -130,6 +134,10 @@ func (s *FakeDAServer) HandleGet(w http.ResponseWriter, r *http.Request) {

 func (s *FakeDAServer) HandlePut(w http.ResponseWriter, r *http.Request) {
 	time.Sleep(s.putRequestLatency)
+	if s.failoverCount > 0 {


what is the point to decrement failoverCount, then actually handle the put

Is it just to simplify testing?

bxue-l2 · 2024-11-13T00:01:55Z

op-e2e/system/altda/failover_test.go

+)
+
+// TestBatcher_FailoverToEthDA_FallbackToAltDA tests that the batcher will failover to ethDA
+// if the da-server returns 503, and then fallback to altDA once altDA is available again


we always try altda first for every dispersal, then retry for non-altDA after sufficient retry. Wording seems odd.

bxue-l2 · 2024-11-13T00:04:06Z

op-e2e/system/altda/failover_test.go

+
+	countEthDACommitment := uint64(0)
+
+	// Most likely, sequence of blocks will be: altDA, ethDA, ethDA, altDA, altDA, altDA.


why most likely? And why two ethDA in a row, because we set failoverCount=2, but why altDA in the beginning?

samlaf marked this pull request as draft November 7, 2024 21:19

samlaf force-pushed the samlaf/feat--op-batcher-altda-failover-to-ethda branch from 50e492c to 80728ec Compare November 8, 2024 14:50

test(altda): add test for altda->ethda failover

61d7578

samlaf force-pushed the samlaf/feat--op-batcher-altda-failover-to-ethda branch from adfa7ce to 61d7578 Compare November 11, 2024 06:16

feat(batcher): altda->ethda failover when altda is down

5712618

samlaf changed the base branch from develop to feat--multiframe-altda-channel November 11, 2024 07:07

samlaf marked this pull request as ready for review November 11, 2024 07:07

samlaf requested review from bxue-l2 and epociask November 11, 2024 07:08

bxue-l2 reviewed Nov 13, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(op-batcher): altda->ethda failover #13

feat(op-batcher): altda->ethda failover #13

samlaf commented Nov 7, 2024 •

edited

Loading

bxue-l2 commented Nov 12, 2024

bxue-l2 Nov 12, 2024

bxue-l2 Nov 13, 2024

bxue-l2 Nov 13, 2024

bxue-l2 Nov 13, 2024


		countEthDACommitment := uint64(0)

		// Most likely, sequence of blocks will be: altDA, ethDA, ethDA, altDA, altDA, altDA.

feat(op-batcher): altda->ethda failover #13

Are you sure you want to change the base?

feat(op-batcher): altda->ethda failover #13

Conversation

samlaf commented Nov 7, 2024 • edited Loading

bxue-l2 commented Nov 12, 2024

bxue-l2 Nov 12, 2024

Choose a reason for hiding this comment

bxue-l2 Nov 13, 2024

Choose a reason for hiding this comment

bxue-l2 Nov 13, 2024

Choose a reason for hiding this comment

bxue-l2 Nov 13, 2024

Choose a reason for hiding this comment

samlaf commented Nov 7, 2024 •

edited

Loading