block.hash() unnecessarily expensive to call · mimblewimble/grin#3372

Repository metrics

Stars: (4,876 stars)
PR merge metrics: (Avg merge 6d 11h) (25 merged PRs in 30d)

Description

We call write() on Writeable when serializing things to bytes. Not just for serializing full data but also when we serialize during a call to hash().

Block hash -> BlockHeader hash -> ProofOfWork hash -> Proof hash

Every call to block.hash() or header.hash() delegates to the following code to produce the data to be hashed -

https://github.com/mimblewimble/grin/blob/098d25e5696d1d97e4fbfabb80f27063caccd0f3/core/src/pow/types.rs#L455-L472

We build the bitvec "on the fly" every time we call hash(). Ideally hash() on any type is as cheap as possible.

I suspect, but have no benchmarks or data to confirm, that this is relatively expensive over time. We make a lot of calls to block.hash() and header.hash().

Possible improvement -

Consider initializing the bitvec when we initially read the Proof instance.
Use this "cached" value when we write it in Hash serialization mode.
Ignore the "cached" value when writing full data.
Make sure nonces or edge_bits cannot be written to or modified in unexpected ways, leaving the cache stale.

Maybe we don't need to cache "hashes" in a general way (yet), maybe simply making block.hash() cheap will give us most of the benefit for the least effort.

Contributor guide

Research direction: Identify where the bitvec is built in Proof serialization and implement caching for hash mode. Ensure cache invalidation when nonces or edge bits change.
Tech stack: rust
Domain: blockchainbackend
Issue type: Performance
Difficulty: 2
Estimated time: 1-3 hours
Activity status: Active
Clarity: Clear
Prerequisites: RustGit
Newbie friendliness: 70

Repository metrics

Description

Contributor guide

Get fresh easy issues in your inbox.