Simplify info client usage #68

cam-schultz · 2023-10-24T18:05:52Z

Why this should be merged

Currently the P-Chain API URL passed in the config is used for two purposes:

To get the canonical validator set for a subnet
To establish peers in the app request network via the info API
The latter case requires that the info API expose the GetNodeIP method on the node. For the public RPC (api.avax.network), this method is disabled.

How this works

Removes the calls to the methods GetNodeIP and GetNodeID on the configured P-Chain API node. Instead, we get the list of peer IPs and IDs from GetPeers, which is enabled on the public RPC server.

Also fixes an edge case in which it's possible to connect to fewer than the intended number of peers on startup.

How this was tested

CI

How is this documented

N/A

michaelkaplan13

🙌 This is a great improvement to make. The changes all LGTM, but do want to note that when trying to use the public API in the relayer configuration now with these changes, I hit the follow error when it is attempting to relay a message:

{"level":"error","timestamp":"2023-10-24T14:46:09.656-0400","logger":"awm-relayer","caller":"relayer/message_relayer.go:411","msg":"Failed to get the canonical subnet validator set","subnetID":"1M439meCk9SDQYJSrDBJJeRntk1FtmSaaZakSeLFvcJonxGUM","error":"failed to fetch validator set (P-Chain Height: 123702, SubnetID: 1M439meCk9SDQYJSrDBJJeRntk1FtmSaaZakSeLFvcJonxGUM): failed to decode client response: the method platform.getValidatorsAt is not available"}

I think these changes are self-standing and still worth moving forward with as is though.

minghinmatthewlam · 2023-10-24T19:19:24Z

peers/app_request_network.go

+		)
+		return nil, nil, err
+	}
+	if len(beaconIPs) < numInitialTestPeers {


why do we need to limit the number of peers to 5 again? IIUC when we do the signature aggregation process it could take longer in terms of us sending out more signature requests, but would increase likelihood of having enough stake threshold in the peers. Are 5 peers generally enough for exceeding stake threshold?

This is just the list of initial peers to manually connect to in order to initialize the app request network. When requesting signatures, we connect to all peers from which we are requesting signatures.

minghinmatthewlam · 2023-10-24T19:22:33Z

separate of PR changes, but was looking at the relayer's network manual track. Supposedly the network will keep attempting the connection with exponential backoffs, should we look at timing out this operation?

minghinmatthewlam

LGTM, left some questions for clarification

bernard-avalabs

Generally looks good to me. Left a few questions.

bernard-avalabs · 2023-10-25T14:31:00Z

peers/app_request_network.go

+			"Failed to find a full set of peers to connect to on startup",
+			zap.Int("connectedPeers", len(beaconIPs)),
+			zap.Int("expectedConnectedPeers", numInitialTestPeers),
+		)


Should we abort if the number of connectedPeers is 0?

In that case, we return an error a few lines above here.

bernard-avalabs · 2023-10-25T14:35:47Z

peers/app_request_network.go

+			"Failed to find a full set of peers to connect to on startup",
+			zap.Int("connectedPeers", len(beaconIPs)),
+			zap.Int("expectedConnectedPeers", numInitialTestPeers),
+		)
 	}

 	for i, beaconIDStr := range beaconIDs {


What if we are unable to connect to any of the initial test peers? Will we select another sample of peers?

That's a good question. As Matt mentioned above, ManuallyTrack will continuously attempt to connect. We should look into what the failure modes for that function call are, and consider how to handle them. Right now if we are unable to connect to any peers on initialization, we will attempt to connect to the subnet validators when relaying a message. So there is some form of redundancy, but an explicit resampling on startup would be more robust.

cam-schultz · 2023-10-25T18:31:11Z

separate of PR changes, but was looking at the relayer's network manual track. Supposedly the network will keep attempting the connection with exponential backoffs, should we look at timing out this operation?

As discussed here, I think we should first understand the failure modes of ManuallyTrack, then consider how to handle them.

cam-schultz added 2 commits October 24, 2023 17:20

only use peer ips

f50ed4b

ensure peer connection on startup

a0c3ca4

cam-schultz requested review from michaelkaplan13, minghinmatthewlam, gwen917, geoff-vball and bernard-avalabs as code owners October 24, 2023 18:05

cam-schultz mentioned this pull request Oct 24, 2023

separate info API cfg #67

Closed

michaelkaplan13 approved these changes Oct 24, 2023

View reviewed changes

minghinmatthewlam reviewed Oct 24, 2023

View reviewed changes

minghinmatthewlam approved these changes Oct 24, 2023

View reviewed changes

bernard-avalabs reviewed Oct 25, 2023

View reviewed changes

cam-schultz merged commit e178b46 into main Oct 25, 2023
7 checks passed

cam-schultz deleted the simplify-info-client-usage branch October 25, 2023 18:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify info client usage #68

Simplify info client usage #68

cam-schultz commented Oct 24, 2023 •

edited

Loading

michaelkaplan13 left a comment

minghinmatthewlam Oct 24, 2023

cam-schultz Oct 25, 2023

minghinmatthewlam commented Oct 24, 2023

minghinmatthewlam left a comment

bernard-avalabs left a comment

bernard-avalabs Oct 25, 2023

cam-schultz Oct 25, 2023

bernard-avalabs Oct 25, 2023

cam-schultz Oct 25, 2023

cam-schultz commented Oct 25, 2023

Simplify info client usage #68

Simplify info client usage #68

Conversation

cam-schultz commented Oct 24, 2023 • edited Loading

Why this should be merged

How this works

How this was tested

How is this documented

michaelkaplan13 left a comment

Choose a reason for hiding this comment

minghinmatthewlam Oct 24, 2023

Choose a reason for hiding this comment

cam-schultz Oct 25, 2023

Choose a reason for hiding this comment

minghinmatthewlam commented Oct 24, 2023

minghinmatthewlam left a comment

Choose a reason for hiding this comment

bernard-avalabs left a comment

Choose a reason for hiding this comment

bernard-avalabs Oct 25, 2023

Choose a reason for hiding this comment

cam-schultz Oct 25, 2023

Choose a reason for hiding this comment

bernard-avalabs Oct 25, 2023

Choose a reason for hiding this comment

cam-schultz Oct 25, 2023

Choose a reason for hiding this comment

cam-schultz commented Oct 25, 2023

cam-schultz commented Oct 24, 2023 •

edited

Loading