• Stars
    star
    9,339
  • Rank 3,831 (Top 0.08 %)
  • Language
  • Created almost 5 years ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A list of learning materials to understand databases internals

Awesome Database Learning

A list of learning materials to understand databases internals, including but not limited to:

  • papers
  • blogs
  • courses
  • talks

Please submit a pull request if there is any material that you think should be included in this collection.

Table of Contents

Recommended Courses, Books and Talks

Courses

Books

Talks

Blogs

SQL & Relation Algebra

Courses:

Query Optimizer

Courses:

Blogs:

Planner Models

Blogs:

Papers:

Subquery Optimization

Blogs:

Papers:

Join Order Optimization

Papers:

Functional Dependency & Physical Properties

Thesis:

Papers:

Cost Model

Papers:

Statistics

Papers:

Books:

Query Execution

Courses:

Execution Framework

Papers:

Vectorization vs Compilization

Blogs:

Papers:

Join

Papers:

Hash Table

Courses:

Blogs:

Bloom Filter

Papers:

DDL

Relational Model

Blogs:

Codd's Rules

Blogs:

Relational Data Model

Blogs:

Relational Algebra

Blogs:

ER to Relational Model

Blogs:

SQL - Overview

Blogs:

Transaction

Isolation Levels

Blogs:

Papers:

Concurrency Control

Courses:

Papers:

Network

Courses:

Papers:

Storage

NoSQL Systems

Papers:

Buffer Management

Courses:

Papers:

Disk IO

Blogs:

Papers:

B-Tree

Blogs:

Courses:

Papers:

LSM-Tree

Papers:

Learned Indexes Structures

Papers:

Serializing & RPC

Data Partitioning

Blogs:

Papers:

Replication & Consistency

Blogs:

Papers:

Consensus

Technical report:

  • University of Cambridge Distributed consensus revised, a great paper about Consenssus especially Paxos and Paxos-Related algorithms, by Heidi Howard

Papers:

Scheduling

Blogs:

Papers:

Benchmark & Testing

Blogs:

Papers:

HTAP

Papers:

TLA+

Talks:

More Repositories

1

tidb

TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.
Go
37,200
star
2

talent-plan

open source training courses about distributed database and distributed systems
Rust
10,112
star
3

docs-cn

TiDB/TiKV/PD 中文文档
Shell
1,811
star
4

ossinsight

Analysis, Comparison, Trends, Rankings of Open Source Software, you can also get insight from more than 7 billion with natural language (powered by OpenAI). Follow us on Twitter: https://twitter.com/ossinsight
TypeScript
1,744
star
5

parser

A MySQL Compatible SQL Parser
Go
1,409
star
6

tidb-operator

TiDB operator creates and manages TiDB clusters running in Kubernetes.
Go
1,221
star
7

tiflash

The analytical engine for TiDB and TiDB Cloud. Try free: https://tidbcloud.com/free-trial
C++
944
star
8

tispark

TiSpark is built for running Apache Spark on top of TiDB/TiKV
Scala
883
star
9

failpoint

An implementation of failpoints for Golang.
Go
816
star
10

go-ycsb

A Go port of Yahoo! Cloud Serving Benchmark (YCSB)
Go
594
star
11

docs

TiDB database documentation. TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://www.pingcap.com/tidb-serverless/
Python
584
star
12

dm

Data Migration Platform
Go
456
star
13

tiflow

This repo maintains DM (a data migration platform) and TiCDC (change data capture for TiDB)
Go
426
star
14

tiup

A component manager for TiDB
Go
417
star
15

ossinsight-lite

🚧[WIP] Yet another customizable free GitHub stats dashboard based on TiDB Serverless: https://ossinsight-lite.vercel.app, hand-drawn style.
TypeScript
383
star
16

presentations

367
star
17

tidb-docker-compose

Python
352
star
18

tidb-ansible

Python
326
star
19

tidb-binlog

A tool used to collect and merge tidb's binlog for real-time data backup and synchronization.
Go
292
star
20

tla-plus

TLA
292
star
21

tidb-tools

tidb-tools are some useful tool collections for TiDB.
Go
286
star
22

dumpling

Dumpling is a fast, easy-to-use tool written by Go for dumping data from the database(MySQL, TiDB...) to local/cloud(S3, GCP...) in multifarious formats(SQL, CSV...).
Go
281
star
23

community

TiDB community content
260
star
24

chaos

A test framework for linearizability check with Go
Go
206
star
25

tidb.ai

https://TiDB.AI is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage and LlamaIndex. Open source and free to use.
TypeScript
186
star
26

go-tpc

A toolbox to benchmark TPC workloads in Go
Go
177
star
27

tidb-dashboard

A Web UI for monitoring, diagnosing and managing the TiDB cluster.
TypeScript
175
star
28

kvproto

Protocol buffer files for TiKV
CMake
152
star
29

tidb-lightning

This repository has been moved to https://github.com/pingcap/br
Go
142
star
30

tipocket

A toolkit for testing TiDB
Go
141
star
31

blog-cn

Shell
126
star
32

br

A command-line tool for distributed backup and restoration of the TiDB cluster data
Go
123
star
33

tidb-dev-guide

A comprehensive development guide to help you be more and more familiar with the TiDB community and become an expert finally.
118
star
34

tidb-bench

A Simple Benchmark For TiDB
C
108
star
35

gdocwiki

A wiki based on Google Doc / Drive
TypeScript
102
star
36

tidb-map

A series of maps to help users and contributors
95
star
37

tipb

TiDB protobuf
CMake
92
star
38

style-guide

Style guide for PingCAP and TiKV code
80
star
39

benchmarksql

Unofficial mirror of benchmarksql on github
Java
79
star
40

go-randgen

a QA tool to random generate sql by bnf pattern
Go
75
star
41

mysql-tester

A Golang implementation of MySQL Test Framework
Go
63
star
42

weekly

57
star
43

tidb-prisma-vercel-demo

Virtual online bookstore application demo which you can find books of various categories and rate the books.
TypeScript
56
star
44

tiproxy

Go
56
star
45

advanced-statefulset

Go
55
star
46

blog

Python
49
star
47

docs-tidb-operator

Documentation for TiDB on Kubernetes. TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://www.pingcap.com/tidb-serverless/
Python
47
star
48

tikv-client-lib-java

TiKV Java client library
Java
44
star
49

tidiff

A toolset to improve efficiency
Go
41
star
50

meetup

37
star
51

fn

Go
35
star
52

tidb-vector-python

TiDB Vector SDK for Python, including code examples. Join our Discord: https://discord.gg/XzSW23Jg9p
Python
35
star
53

tiunimanager

TiUniManager
Go
34
star
54

thirdparty-ops

This repo is used for the operation and maintenance of third party tools.
Python
32
star
55

dead-mans-switch

A bypass monitoring prober
Go
32
star
56

ng-monitoring

Go
31
star
57

django-tidb

TiDB dialect for Django
Python
30
star
58

tidb-course-201-lab

Lab scripts for the PingCAP training course: TiDB SQL for Developers.
Python
30
star
59

tidb-vision

TiDB data visualization
JavaScript
28
star
60

tidb-inspect-tools

Python
27
star
61

monitoring

Shell
26
star
62

activerecord-tidb-adapter

TiDB adapter for ActiveRecord, allows the use of TiDB as a backend for ActiveRecord and Rails apps.
Ruby
24
star
63

diag

A tool to collect diagnostic data from TiDB Clusters
Go
24
star
64

docs-dm

Documentation for the TiDB Data Migration (DM) tool in both English and Chinese.
Python
23
star
65

LinguFlow

LinguFlow, a low-code tool designed for LLM application development, simplifies the building, debugging, and deployment process for developers.
TypeScript
23
star
66

website-docs

The next generation of PingCAP Docs. Powered by Gatsby ⚛️.
TypeScript
22
star
67

book.tidb.net

JavaScript
22
star
68

kdt

Kernel Debug Toolkit
Shell
20
star
69

log

Go
17
star
70

octopus

A toolkit including many powerful distributed test tools
Go
15
star
71

Auto-GPT-TiDB-Serverless-Plugin

Python
15
star
72

errcode

Go
14
star
73

dbt-tidb

A dbt adapter for TiDB
Python
14
star
74

tidb_workload_analysis

Go
14
star
75

k8s-fluent-bit-stackdriver

Shell
11
star
76

website

The website of PingCAP. Powered by Gatsby ⚛️ and Rocket 🚀.
JavaScript
11
star
77

tpcc-mysql

forked from https://code.launchpad.net/~percona-dev/perconatools/tpcc-mysql
C
11
star
78

tidb-insight

Python
11
star
79

tidb-loadbalance

Java
10
star
80

tso

Timestamp Oracle
Go
9
star
81

tiunimanager-ui

A web UI for TiUniManager
TypeScript
9
star
82

tidb-ctl

TiDB Controller
Go
9
star
83

hackernews-insight

Chat to query Hacker News database, based on Auto-GPT and TiDB Cloud Serverless Database
TypeScript
9
star
84

tidb-cloud-backup

Go
8
star
85

wordpress-tidb-plugin

PHP
8
star
86

docs-appdev

Python
7
star
87

wordpress-tidb-docker

WordPress x TiDB Serverless Tier Cluster
Shell
7
star
88

tidb-academy-labs

6
star
89

etcdv3-gateway

Gateway for etcdv3
Go
6
star
90

tispark-test

C
6
star
91

sysutil

sysutil is a library which implementats the gRPC service Diagnostics and shares the diagnostics functions between TiDB and PD.
Go
6
star
92

sqlalchemy-tidb

Python
5
star
93

oasis

Python
5
star
94

homebrew-brew

Homebrew taps for TiDB
Ruby
5
star
95

mysqlrelay

Go
4
star
96

tidb-lmdb

lmdb as storage engine for tidb
Go
4
star
97

cloud-assets-utils

Cloud assets utils by PingCAP FE.
OCaml
4
star
98

mpdriver

MySQL Protocol Driver, used to record MySQL query commands..
Go
4
star
99

tidb-helper

Shell
3
star
100

vldb-boss-2018

Slides and links for VLDB BOSS 2018
3
star