feat(electrum): optimize merkle proof validation with batching #1957

LagginTimes · 2025-05-15T19:05:40Z

Replaces #1908, originally authored by @Keerthi421.
Fixes #1891.

Description

This PR optimizes the Electrum client's performance by improving Merkle proof validation, addressing the significant performance regression in BDK 1.1.0 where full sync time increased from 4s to 26s.

Key improvements:

Implemented batch processing for Merkle proof validations.
Added Merkle proof caching to prevent redundant network calls.
Optimized header handling with pre-fetching and reuse.
Modified core functions to use batch operations instead of individual calls.

Also adds reorg-safe eviction of stale proofs: before each Merkle batch, we verify the highest cached block’s hash against the live tip and, on mismatch, evict proofs until the fork point is reached.

Notes to the reviewers

The optimization approach focuses on three main areas:

Reducing network round trips through batched Merkle proof requests.
Minimizing redundant operations with a new Merkle proof cache.
Improving header handling efficiency with pre-fetching.

The batch size is set to 100 as a balance between performance and memory usage. This value can be adjusted based on testing results.

Changelog notice

New Merkle proof cache to prevent redundant network calls.
Batch processing for Merkle proof validations.
Performance tests to verify sync time improvements.

Checklists

All Submissions:

I've signed all my commits
I followed the contribution guidelines
I ran cargo fmt and cargo clippy before committing

New Features:

I've added tests for the new feature
I've added docs for the new feature

Bugfixes:

This pull request breaks the existing API
I've added tests to reproduce the issue which are now passing
I'm linking the issue being fixed by this PR

evanlinjin

Thanks for moving this forward.

This is not a full review, but I think it's enough to push this PR in a good direction.

evanlinjin · 2025-05-16T11:07:21Z

crates/electrum/src/bdk_electrum_client.rs

+    /// The Merkle proof cache
+    merkle_cache: Mutex<HashMap<(Txid, BlockHash), GetMerkleRes>>,


It will be more efficient if we cache anchors instead of GetMerkleRes here.

evanlinjin · 2025-05-16T11:12:51Z

crates/electrum/src/bdk_electrum_client.rs

+    /// Remove any proofs for blocks that may have been re-orged out.
+    ///
+    /// Checks if the latest cached block hash matches the current chain tip. If not, evicts proofs
+    /// for blocks that were re-orged out, stopping at the fork point.
+    fn clear_stale_proofs(&self) -> Result<(), Error> {
+        let mut cache = self.merkle_cache.lock().unwrap();
+
+        // Collect one (height, old_hash) pair per proof.
+        let mut entries: Vec<(u32, BlockHash)> = cache
+            .iter()
+            .map(|((_, old_hash), res)| (res.block_height as u32, *old_hash))
+            .collect();
+
+        // Sort descending and dedup so we only check each height once.
+        entries.sort_unstable_by(|a, b| b.0.cmp(&a.0));
+        entries.dedup();
+
+        // Evict any stale proofs until fork point is found.
+        for (height, old_hash) in entries {
+            let current_hash = self.fetch_header(height)?.block_hash();
+            if current_hash == old_hash {
+                break;
+            }
+            cache.retain(|&(_txid, bh), _| bh != old_hash);
+        }
+        Ok(())
+    }


Reorgs don't happen that often so we won't have much "extra data". This method looks like it's O(n^2). Let's remove it.

evanlinjin · 2025-05-16T11:22:06Z

crates/electrum/src/bdk_electrum_client.rs

+        // Batch validate all collected transactions.
+        if !txs_to_validate.is_empty() {
+            let proofs = self.batch_fetch_merkle_proofs(&txs_to_validate)?;
+            self.batch_validate_merkle_proofs(tx_update, proofs)?;
        }


Instead of having every populate_with_{} method call this internally, it will be more efficient and make more logical sense if we extract this so that we only call it at the end of full_scan and sync.

In other words, populate_with_{} should no longer fetch anchors. Instead, they should either mutate, or return a list of (Txid, BlockId) for which we try to fetch anchors for in a separate step.

It will be even better if full txs are fetched in a separate step too.

Keerthi421 and others added 3 commits May 15, 2025 18:33

feat(electrum): optimize merkle proof validation with batching

f1858f7

fix(electrum): improve tx validation and gap limit scanning

6ddec6d

feat(electrum): prune stale Merkle proofs on chain reorg

6d6f4ed

LagginTimes requested a review from evanlinjin May 15, 2025 19:06

LagginTimes self-assigned this May 15, 2025

evanlinjin requested changes May 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(electrum): optimize merkle proof validation with batching #1957

feat(electrum): optimize merkle proof validation with batching #1957

LagginTimes commented May 15, 2025

evanlinjin left a comment

evanlinjin May 16, 2025

evanlinjin May 16, 2025

evanlinjin May 16, 2025

		/// The Merkle proof cache
		merkle_cache: Mutex<HashMap<(Txid, BlockHash), GetMerkleRes>>,

feat(electrum): optimize merkle proof validation with batching #1957

Are you sure you want to change the base?

feat(electrum): optimize merkle proof validation with batching #1957

Conversation

LagginTimes commented May 15, 2025

Description

Notes to the reviewers

Changelog notice

Checklists

All Submissions:

New Features:

Bugfixes:

evanlinjin left a comment

Choose a reason for hiding this comment

evanlinjin May 16, 2025

Choose a reason for hiding this comment

evanlinjin May 16, 2025

Choose a reason for hiding this comment

evanlinjin May 16, 2025

Choose a reason for hiding this comment