Byzantine Fault Tolerant CRDTs

This work is mainly inspired by implementing Martin Kleppmann's 2022 paper on Making CRDTs Byzantine Fault Tolerant¹ on top of a simplified Automerge implementation.

The goal is to show a working prototype that demonstrated in simple code the ideas behind

An Automerge-like CRDT
How a primitive list CRDT can be composed to create complex CRDTs like JSON
How to add Byzantine Fault Tolerance to arbitrary CRDTs

Unlike most other CRDT implementations, I leave out many performance optimizations that would make the basic algorithm harder to understand.

Check out the accompanying blog post for this project!

Benchmarks

Altough this implementation does not optimize for performance, it still nonetheless performs quite well.

Benchmarking happened on a 2019 Macbook Pro with a 2.6GHz i7. Numbers are compared to Automerge which report their performance benchmarks here

# Ops	Raw String (JS)	Ours (basic)	Ours (BFT)	Automerge (JS)	Automerge (Rust)
10k	n/a	0.081s	1.793s	1.6s	0.047s
100k	n/a	9.321s	38.842s	43.0s	0.597s
All (259k)	0.61s	88.610s	334.960s	Out of Memory	1.780s
Memory	0.1MB	27.6MB	59.5MB	880MB	232.5MB

Flamegraph

To get some flamegraphs of the time graph on MacOS, run:

sudo cargo flamegraph --dev --root --bench speed

Further Work

This is mostly a learning/instructional project but there are a few places where performance improvements are obvious:

This is backed by std::Vec which isn't great for random insert. Replace with a B-tree or something that provides better insert and find performance
1. Diamond Types and Automerge (Rust) use a B-tree
2. Yjs is backed by a doubly linked-list and caches last ~5-10 accessed locations (assumes that most edits happen sequentially; seeks are rare)
3. (funnily enough, main peformance hit is dominated by find and not insert, see this flamegraph)
Avoid calling find so many times. A few Automerge optimizations that were not implemented
1. Use an index hint (especially for local inserts)
2. Skipping the second find operation in integrate if sequence number is already larger
Improve storage requirement. As of now, a single Op weighs in at over 168 bytes. This doesn't even fit in a single cache line!
Implement 'transactions' for a group of changes that should be considered atomic.
1. This would also speed up Ed25519 signature verification time by batching.
2. For example, a peer might create an atomic 'transaction' that contains a bunch of changes.
Currently, each character is a single op. Similar to Yjs, we can combine runs of characters into larger entities like what André, Luc, et al.² suggest
Implement proper persistence using SQLLite or something similar
Compile the project to WASM and implement a transport layer so it can be used in browser. Something similar to Yjs' WebRTC Connector could work.

Acknowledgements

Thank you to Nalin Bhardwaj for helping me with my cryptography questions and Martin Kleppmann for his teaching materials and lectures which taught me a significant portion of what I've learned about distributed systems and CRDTs.

Kleppmann, Martin. "Making CRDTs Byzantine Fault Tolerant." Proceedings of the 9th Workshop on Principles and Practice of Consistency for Distributed Data. 2022. ↩
André, Luc, et al. "Supporting adaptable granularity of changes for massive-scale collaborative editing." 9th IEEE International Conference on Collaborative Computing: Networking, Applications and Worksharing. IEEE, 2013. ↩

jackyzha0/bft-json-crdt

jackyzha0

Reviews

Repository Details

Byzantine Fault Tolerant CRDTs

Benchmarks

Flamegraph

Further Work

Acknowledgements

More Repositories

jackyzha0/bft-json-crdt

jackyzha0

Reviews

Repository Details

Byzantine Fault Tolerant CRDTs

Benchmarks

Flamegraph

Further Work

Acknowledgements

Footnotes

More Repositories