Byzantine Fault Tolerant CRDTs
This work is mainly inspired by implementing Martin Kleppmann's 2022 paper on Making CRDTs Byzantine Fault Tolerant1 on top of a simplified Automerge implementation.
The goal is to show a working prototype that demonstrated in simple code the ideas behind
- An Automerge-like CRDT
- How a primitive list CRDT can be composed to create complex CRDTs like JSON
- How to add Byzantine Fault Tolerance to arbitrary CRDTs
Unlike most other CRDT implementations, I leave out many performance optimizations that would make the basic algorithm harder to understand.
Check out the accompanying blog post for this project!
Benchmarks
Altough this implementation does not optimize for performance, it still nonetheless performs quite well.
Benchmarking happened on a 2019 Macbook Pro with a 2.6GHz i7. Numbers are compared to Automerge which report their performance benchmarks here
# Ops | Raw String (JS) | Ours (basic) | Ours (BFT) | Automerge (JS) | Automerge (Rust) |
---|---|---|---|---|---|
10k | n/a | 0.081s | 1.793s | 1.6s | 0.047s |
100k | n/a | 9.321s | 38.842s | 43.0s | 0.597s |
All (259k) | 0.61s | 88.610s | 334.960s | Out of Memory | 1.780s |
Memory | 0.1MB | 27.6MB | 59.5MB | 880MB | 232.5MB |
Flamegraph
To get some flamegraphs of the time graph on MacOS, run:
sudo cargo flamegraph --dev --root --bench speed
Further Work
This is mostly a learning/instructional project but there are a few places where performance improvements are obvious:
- This is backed by
std::Vec
which isn't great for random insert. Replace with a B-tree or something that provides better insert and find performance- Diamond Types and Automerge (Rust) use a B-tree
- Yjs is backed by a doubly linked-list and caches last ~5-10 accessed locations (assumes that most edits happen sequentially; seeks are rare)
- (funnily enough, main peformance hit is dominated by find and not insert, see this flamegraph)
- Avoid calling
find
so many times. A few Automerge optimizations that were not implemented- Use an index hint (especially for local inserts)
- Skipping the second
find
operation inintegrate
if sequence number is already larger
- Improve storage requirement. As of now, a single
Op
weighs in at over 168 bytes. This doesn't even fit in a single cache line! - Implement 'transactions' for a group of changes that should be considered atomic.
- This would also speed up Ed25519 signature verification time by batching.
- For example, a peer might create an atomic 'transaction' that contains a bunch of changes.
- Currently, each character is a single op. Similar to Yjs, we can combine runs of characters into larger entities like what AndrΓ©, Luc, et al.2 suggest
- Implement proper persistence using SQLLite or something similar
- Compile the project to WASM and implement a transport layer so it can be used in browser. Something similar to Yjs' WebRTC Connector could work.
Acknowledgements
Thank you to Nalin Bhardwaj for helping me with my cryptography questions and Martin Kleppmann for his teaching materials and lectures which taught me a significant portion of what I've learned about distributed systems and CRDTs.
Footnotes
-
Kleppmann, Martin. "Making CRDTs Byzantine Fault Tolerant." Proceedings of the 9th Workshop on Principles and Practice of Consistency for Distributed Data. 2022. β©
-
AndrΓ©, Luc, et al. "Supporting adaptable granularity of changes for massive-scale collaborative editing." 9th IEEE International Conference on Collaborative Computing: Networking, Applications and Worksharing. IEEE, 2013. β©