chain: use neutrino filters to speed up bitcoind seed recovery #889

Roasbeef · 2023-09-22T01:14:03Z

In this commit, we use neutrino filters to speed up bitcoind seed
recovery. We use the recently created maybeShouldFetchBlock function
to check the filters to see if we need to fetch a block at all. This
saves us from fetching, decoding, then scanning the block contents if we
know nothing is present in them.

At this point, we can also further consolidate the FilterBlocks
methods between the two backends, as they're now identical.

Another follow up here would be to start to prefetch both filters and blocks. We know that we'll need to fetch every filter, so we can fetch them all in a single RPC request (batch call). Once we have the filters, we can sort of check them all in parallel, but need to mind the FoundOutPoints field, as once we have something like a change addr we know of, we want to then watch that for subsequent spends.

chain/btcd.go

We also abstract how blocks are fetched in the first place, as bitcoind uses a different name for the RPC to fetch filters.

In this commit, we use neutrino filters to speed up bitcoind seed recovery. We use the recently created `maybeShouldFetchBlock` function to check the filters to see if we need to fetch a block at all. This saves us from fetching, decoding, then scanning the block contents if we know nothing is present in them. At this point, we can also further consolidate the `FilterBlocks` methods between the two backends, as they're now identical.

guggero

Needs a couple of fixes. Was able to get it running with a small patch and can confirm this massively speeds up scanning blocks.

My quick benchmark, scanning 2000 blocks took:
Without this PR: ~1m 44s
With this PR: ~26s

guggero · 2023-10-06T08:27:02Z

chain/btcd.go

@@ -197,6 +197,65 @@ func (c *RPCClient) BlockStamp() (*waddrmgr.BlockStamp, error) {
 	}
 }

+// fetchBlockFilter fetches the GCS filter for a block from the remote node.
+func (c *RPCClient) fetchBlockFilter(blkHash chainhash.Hash,


Hmm, I still can't get over this formatting tbh... Do you really prefer this over:

// fetchBlockFilter fetches the GCS filter for a block from the remote node. func (c *RPCClient) fetchBlockFilter( blkHash chainhash.Hash(*gcs.Filter, error) {

?

guggero · 2023-10-06T09:18:15Z

chain/bitcoind_client.go

+	}
+
+	resp, err := c.chainConn.client.RawRequest(
+		bitcoindFilterRPC, []json.RawMessage{jsonFilterReq},


This threw an error for me, looks like we need to JSON encode each field individually, not the whole message. This worked for me:

hash, err := json.Marshal(blkHash.String()) if err != nil { return nil, fmt.Errorf("cannot marshal hash: %w", err) } filterType, err := json.Marshal(bitcoindFilterType) if err != nil { return nil, fmt.Errorf("cannot marshal hash: %w", err) } resp, err := c.chainConn.client.RawRequest( bitcoindFilterRPC, []json.RawMessage{hash, filterType}, ) if err != nil { return nil, fmt.Errorf("cannot send request: %w", err) }

guggero · 2023-10-06T09:18:43Z

chain/bitcoind_client.go

+		// block, then we don't need to fetch it, as there're no false
+		// negatives.
+		if !shouldFetchBlock {
+			log.Infof("Skipping block height=%d hash=%v, no "+


This is very spammy. Maybe demote to debug?

guggero · 2023-10-06T09:21:08Z

chain/bitcoind_client.go

+			continue
+		}
+
+		log.Infof("Fetching block height=%d hash=%v",


This is quite useful to debug! I synced my mainnet node and found it very interesting that we seem to fetch around 90 blocks per 2000 block scanning batch. Is it possible that the filters have that high of a false positive rate? Because I used an xprv I didn't have a birthday block encoded and started at height 481596 (segwit activation), where my wallet definitely didn't have any transactions yet...

Not sure if this is a property of the filters themselves or whether we can optimize something with how we construct the filter matcher on our side? Or maybe we have a weird value (e.g. all zero) that skews the matcher into more false positives?

guggero · 2023-10-06T09:30:58Z

chain/bitcoind_conn.go

+	//
+	// The getblockfilter call was added in version 19.0.0, so we return
+	// for versions >= 190000.
+	return info.Version >= 190000, nil


This just gives us whether bitcoind is new enough to have filters. But don't we also need to detect whether they are enabled? Since on bitcoind they aren't turned on by default (are they in btcd?) as far as I know.
I just tried this locally with the index disabled and got: -1: Index is not enabled for filtertype basic

guggero reviewed Sep 22, 2023

View reviewed changes

chain/btcd.go Outdated Show resolved Hide resolved

Roasbeef force-pushed the bitcoind-neutrino-filter-rescan branch from 6c9113e to a0b0db1 Compare September 29, 2023 20:29

Roasbeef added 2 commits September 29, 2023 15:39

chain: refactor btcd filter rescan into maybeShouldFetchBlock

b131910

We also abstract how blocks are fetched in the first place, as bitcoind uses a different name for the RPC to fetch filters.

Roasbeef force-pushed the bitcoind-neutrino-filter-rescan branch from a0b0db1 to 2cb1df2 Compare September 29, 2023 20:40

Roasbeef requested a review from guggero September 29, 2023 20:40

guggero reviewed Oct 6, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chain: use neutrino filters to speed up bitcoind seed recovery #889

chain: use neutrino filters to speed up bitcoind seed recovery #889

Roasbeef commented Sep 22, 2023

guggero left a comment

guggero Oct 6, 2023

guggero Oct 6, 2023

guggero Oct 6, 2023

guggero Oct 6, 2023

guggero Oct 6, 2023

guggero Oct 6, 2023

chain: use neutrino filters to speed up bitcoind seed recovery #889

Are you sure you want to change the base?

chain: use neutrino filters to speed up bitcoind seed recovery #889

Conversation

Roasbeef commented Sep 22, 2023

guggero left a comment

Choose a reason for hiding this comment

guggero Oct 6, 2023

Choose a reason for hiding this comment

guggero Oct 6, 2023

Choose a reason for hiding this comment

guggero Oct 6, 2023

Choose a reason for hiding this comment

guggero Oct 6, 2023

Choose a reason for hiding this comment

guggero Oct 6, 2023

Choose a reason for hiding this comment

guggero Oct 6, 2023

Choose a reason for hiding this comment