• Stars
    star
    725
  • Rank 62,504 (Top 2 %)
  • Language
  • Created almost 4 years ago
  • Updated almost 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Awesome list of distributed transactions

Awesome distributed transactions

A curated selection of distributed transactions protocols

Highly Available Transactions

Transactional Causal Consistency

Cure: Strong semantics meets high availability and low latency

Stronger Semantics for Low-Latency Geo-Replicated Storage (aka Eiger)

Don’t Settle for Eventual: Scalable Causal Consistency for Wide-Area Storage with COPS

Stronger than read committed (MAV & RA)

n MAV provides "cut isolation" and atomicity. RA is MAV which prevents fractured reads.

Highly Available Transactions: Virtues and Limitations

Scalable Atomic Visibility with RAMP Transactions

Weaker than snapshot isolation levels (PSI, PC-PSI, NMSI)

Compared to snapshot isolation (SI) PSI, PC-PSI & NMSI allows "long fork" anomaly.

Database Replication Using Generalized Snapshot Isolation

Transactional storage for geo-replicated systems (aka Walter, PSI)

I Can’t Believe It’s Not Causal! Scalable Causal Consistency with No Slowdown Cascades (aka Occult, PC-PSI)

Non-Monotonic Snapshot Isolation (aka Jessy, NMSI)

Blotter: Low Latency Transactions for Geo-Replicated Storage (NMSI)

Snapshot isolation

Large-scale Incremental Processing Using Distributed Transactions and Notifications (aka Percolator)

Serializability

Notes on Data Base Operating Systems (aka 2PC, two-phase commit, it all started there)

Let's put 2PC coordinator on Paxos:

CockroachDB: The Resilient Geo-Distributed SQL Database (Parallel Commits over Paxos)

Spanner: Google’s Globally-Distributed Database (2PC over Paxos)

Concurrency control schemes:

Deterministic ordering:

Blending of replication with transaction processing

Special hardware

The End of a Myth: Distributed Transactions Can Scale (aka NAM-DB, SI)

No compromises: distributed transactions with consistency, availability, and performance (aka FaRM, serializability)

FaSST: Fast, Scalable and Simple Distributed Transactions with Two-Sided (RDMA) Datagram RPCs

Eris: Coordination-Free Consistent Transactions Using In-Network Concurrency Control (serializability, independent transactions, deterministic transaction, OLLP)

Limits of distributed transactions

The SNOW Theorem and Latency-Optimal Read-Only Transactions

Causal Consistency and Latency Optimality: Friend or Foe?

Distributed Transactional Systems Cannot Be Fast

Distributed transactional reads: the strong, the quick, the fresh and the impossible

Testing

Elle: Inferring Isolation Anomalies from Experimental Observations

More Repositories

1

redpanda

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
C++
9,485
star
2

connect

Fancy stream processing made operationally mundane
Go
8,113
star
3

console

Redpanda Console is a developer-friendly UI for managing your Kafka/Redpanda workloads. Console gives you a simple, interactive approach for gaining visibility into your topics, masking data, managing consumer groups, and exploring real-time data with time-travel debugging.
Go
3,769
star
4

kminion

KMinion is a feature-rich Prometheus exporter for Apache Kafka written in Go. It is lightweight and highly configurable so that it will meet your requirements.
Go
615
star
5

benthos

Go
205
star
6

redpanda-connect-plugin-example

Benthos plugin examples
Go
89
star
7

helm-charts

Redpanda Helm Chart
Go
75
star
8

deployment-automation

Cluster configuration best practices
HCL
64
star
9

redpanda-operator

Go
36
star
10

openmessaging-benchmark

Java
35
star
11

observability

Python
34
star
12

redpanda-connect-helm-chart

Helm 3 repository for benthosdev/benthos
Smarty
34
star
13

seastar-starter

Seastar boilerplate project with cmake
C++
30
star
14

redpanda-edge-agent

Lightweight internet of things agent that forwards events from the edge
Go
28
star
15

pglogicalstream

PostgreSQL Logical Replication CDC Module for Streaming Database Changes with Golang
Go
23
star
16

redpanda-labs

The home for Redpanda Labs projects.
Go
22
star
17

pg_stream

Postgres Logical Replication plugin for benthos
Go
18
star
18

redpanda-examples

A collection of examples to demonstrate how to interact with Redpanda from various clients and languages.
Go
9
star
19

redpanda-ansible-collection

Python
7
star
20

docs-extensions-and-macros

Extensions and macros developed for Redpanda documentation.
JavaScript
6
star
21

client-swarm

Test utility for running large numbers of concurrent client sessions
Rust
6
star
22

chaos

Python
6
star
23

terraform-provider-redpanda

Go
6
star
24

flink-kafka-examples

A repo of Java examples using Apache Flink with flink-connector-kafka
Java
6
star
25

rp-connect-docs

Documentation for Redpanda Connect
Shell
6
star
26

docs

Open source content for the Redpanda documentation
JavaScript
5
star
27

github-action

Shell
5
star
28

kgo-verifier

Test utility based on franz-go, for consistency checking of Redpanda reads vs. writes
Go
5
star
29

2023-stream-processing-apache-flink-redpanda

Repository containing all the code you need to build a simple streaming ETL pipeline from scratch.
Python
5
star
30

alien_thread

Tools for interoperating with native threads from Seastar
C++
4
star
31

common-go

Go
3
star
32

terraform-aws-redpanda-cluster

HCL
3
star
33

2023-build-time-series-data-stream-timescale-db

This is the GitHub repo for the Redpanda tutorial on building a time series data stream using TimescaleDB
Dockerfile
2
star
34

docs-ui

UI project for the Redpanda documentation site.
CSS
2
star
35

cloud-docs

Redpanda Cloud documentation
JavaScript
2
star
36

kubecon-na-2023

Different Redpanda in Kubernetes Deployment Examples
Shell
1
star
37

how-to-connect-code-snippets

1
star
38

databalancer

Go
1
star
39

2023-unify-log-data-parseable

1
star
40

homebrew-tap

Redpanda Homebrew Tap
Ruby
1
star
41

gcp-psc

HCL
1
star
42

kafka-fsync

Dockerfile
1
star
43

developer-diaries

This repository serves as a comprehensive resource for beginner and intermediate learners in the fields of data engineering and analytics. It contains a curated collection of sketch notes designed to simplify complex concepts and system designs through visual diagrams and concise summaries.
1
star