• Stars
    star
    350
  • Rank 117,395 (Top 3 %)
  • Language
    Python
  • License
    Other
  • Created about 14 years ago
  • Updated about 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Django DB router for stateful master-slave replication

SUMMARY

Django_replicated is a Django database router designed to support more or less automatic master-slave replication. It keeps an internal state that depends on user intent to read or to write into a database. Depending on this state it automatically uses the right database (master or slave) for all SQL operations.

INSTALLATION

  1. Install django_replicated distribution using "python setup.py install".

  2. Add import of the default django_replicated settings into your settings.py:

    from django_replicated.settings import *
    
  3. In settings.py configure your master and slave databases in a standard way:

    DATABASES {
        'default': {
            # ENGINE, HOST, etc.
        },
        'slave1': {
            # ENGINE, HOST, etc.
        },
        'slave2': {
            # ENGINE, HOST, etc.
        },
    }
    
  4. Teach django_replicated which databases are slaves:

    REPLICATED_DATABASE_SLAVES = ['slave1', 'slave2']
    

    The 'default' database is always treated as master.

  5. Configure a replication router:

    DATABASE_ROUTERS = ['django_replicated.router.ReplicationRouter']
    
  6. Configure timeout to exclude a database from the available list after an unsuccessful ping:

    REPLICATED_DATABASE_DOWNTIME = 20
    

    The default downtime value is 60 seconds.

USAGE

Django_replicated routes SQL queries into different databases based not only on their type (insert/update/delete vs. select) but also on its own current state. This is done to support the situation in which there are both writes and reads in a single logical operation. If the writes and reads used separate databases, the result would be inconsistent because:

  • when using transactions, the result of the writes will not be delivered to slaves until committed;
  • even in a non-transactional environment, there is always a certain lag before the updates reach slaves.

Django_replicated expects you to define what these logical operations are doing: writing/reading or only reading. Then it will try to use slave databases only for purely reading operations.

There are several methods to define those.

Middleware

If your project is built in accordance with principles of HTTP where GET requests do not cause changes in the system (unless by side effects) then most of the work is done by simply using a middleware:

MIDDLEWARE_CLASSES = [
    ...
    'django_replicated.middleware.ReplicationMiddleware',
    ...
]

The middleware sets replication state to use slaves during handling of GET and HEAD requests and to use a master otherwise.

While this is usually enough there are cases when DB access is not controlled explicitly by your business logic. Good examples are implicit creation of sessions on the first access, writing some bookkeeping info, implicit registration of a user account somewhere inside the system. These things can happen at arbitrary moments of time, including during GET requests.

Generally, django_replicated handles this by always using the master database for write operations. If this is not enough (e.g., if you want to make sure a newly created session is read from the master), you can always instruct Django ORM to use a certain database.

Decorators

If your system does not depend on the method of HTTP request to do writes and reads you can use decorators to wrap individual views into master or slave replication modes:

from django_replicated.decorators import use_master, use_slave

@use_master
def my_view(request, ...):
    # master database used for all db operations during
    # execution of the view (if not explicitly overridden).

@use_slave
def my_view(request, ...):
    # same with slave connection

GET after POST

There is a special case that needs addressing when working with asynchronous replication scheme. Replicas can lag behind a master database on receiving updates. In practice, this means that after submitting a POST form that redirects to a page with updated data this page may be requested from a slave replica that was not updated yet. And the user will have an impression that the submit did not work.

To overcome this problem both ReplicationMiddleware and decorators support special technique where handling of a GET request resulting from a redirect after a POST is explicitly routed to a master database.

Global overrides

In some cases, it might be necessary to override how the middleware chooses a target database based on the HTTP request method. For example, you might want to route certain POST requests to a slave if you know that the request handler does not do any writes. The settings variable REPLICATED_VIEWS_OVERRIDES holds the mapping of view names (urlpatterns names) or view import paths or url path to database names:

REPLICATED_VIEWS_OVERRIDES = {
    'api-store-event': 'slave',
    'app.views.do_smthg': 'master',
    '/admin/*': 'master',
    '/users/': 'slave',
}

CHANGELOG

2.0 Backward incompatible changes

  • Default django_replicated.settings file was added.

  • Some settings variables were renamed:

      DATABASE_SLAVES -> REPLICATED_DATABASE_SLAVES
      DATABASE_DOWNTIME -> REPLICATED_DATABASE_DOWNTIME
    
  • Another setting variable was deleted:

      REPLICATED_SELECT_READ_ONLY
    
  • Router import path changed to django_replicated.router.ReplicationRouter.

  • Ability to disable state switching with utils.disable_state_change() was removed.

  • Database checkers moved to dbchecker.py module.

  • db_is_not_read_only check renamed to db_is_writable.

  • Added state checking before writes. Enabled by default.

  • Now allows relations between objects in same master-slave db set

SIMILAR LIBRARIES

More Repositories

1

gixy

Nginx configuration static analyzer
Python
8,129
star
2

YaLM-100B

Pretrained language model with 100B parameters
Python
3,716
star
3

odyssey

Scalable PostgreSQL connection pooler
C
3,102
star
4

yandex-tank

Load and performance benchmark tool
Python
2,398
star
5

rep

Machine Learning toolbox for Humans
Jupyter Notebook
678
star
6

pgmigrate

Simple tool to evolve PostgreSQL schema easily.
Python
606
star
7

faster-rnnlm

Faster Recurrent Neural Network Language Modeling Toolkit with Noise Contrastive Estimation and Hierarchical Softmax
C++
561
star
8

tomita-parser

C
492
star
9

porto

Yet another Linux container management system
C++
392
star
10

pandora

A load generator in Go language
Go
383
star
11

reshadow

Markup and styles that feel right
JavaScript
363
star
12

pire

Perl Incompatible Regular Expressions library
C++
328
star
13

metrica-tag

The client library of the web analytics tool. It is in the top 5 by popularity worldwide.
TypeScript
251
star
14

yatagan

Dependency Injection framework based on Google's Dagger2 API, optimized for fast builds and for managing large graphs with optional dependencies
Kotlin
223
star
15

ozo

OZO is a C++17 Boost.Asio based header-only library for asyncronous communication with PostgreSQL DBMS.
C++
220
star
16

mapsapi-codestyle

JavaScript and TypeScript Style Guide
JavaScript
213
star
17

zero-downtime-migrations

Apply Django migrations on PostgreSql without long locks on tables
Python
181
star
18

audio-js

Библиотека аудио-плеера для браузера
JavaScript
181
star
19

alice-skills

Примеры кода навыков для голосового помощника, придуманного в Яндексе
Python
178
star
20

burp-molly-scanner

Turn your Burp suite into headless active web application vulnerability scanner
Java
153
star
21

yandex-taxi-testsuite

testsuite: microservices testing framework
Python
144
star
22

burp-molly-pack

Security checks pack for Burp Suite
Java
137
star
23

mapsapi-modules

Async modular system
JavaScript
132
star
24

go-hasql

Go library for accessing multi-host SQL database installations
Go
122
star
25

mapkit-android-demo

MapKit Android demo
Kotlin
115
star
26

yoctodb

A tiny embedded Java-engine for extremely fast partitioned immutable-after-construction databases
Java
107
star
27

scout

A fast and safe manual dependency injector for Kotlin and Android.
Kotlin
102
star
28

handystats

C++ library for collecting user-defined in-process runtime statistics with low overhead
C++
94
star
29

speechkitcloud

Speechkit Cloud examples and SDK
JavaScript
90
star
30

fastops

This small library enables acceleration of bulk calls of certain math functions on AVX and AVX2 hardware. Currently supported operations are exp, log, sigmoid and tanh. The library is designed with extensibility in mind.
C++
82
star
31

yatool

Yatool is a cross-platform distribution, building, testing, and debugging toolkit focused on monorepositories
C
82
star
32

argon2

Implementation of argon2 (i, d, id) algorithms with CPU dispatching
C++
79
star
33

mms

Memory-mapped storage library
C++
76
star
34

mapsapi-heatmap

Heatmap: Yandex.Maps API plugin for data visualization
JavaScript
76
star
35

tcplanz

TCPDump latency analyzer
Python
75
star
36

mapkit-ios-demo

MapKit iOS demo
Swift
75
star
37

balancer

http balancer
C
72
star
38

securitygym

Python
71
star
39

NwSMTP

Asynchronous SMTP proxy server
Shell
71
star
40

geo-reviews-dataset-2023

67
star
41

smart

SMT-aware Real-time scheduler for Linux
C
67
star
42

tex-renderer

Микросервис для рендеринга tex-формул в изображения
JavaScript
59
star
43

rtv

Remote TV control for developers
JavaScript
58
star
44

mysync

MySync is mysql high-availability and cluster configuration tool.
Go
56
star
45

YandexDriver

YandexDriver is a WebDriver implementation
55
star
46

csp-tester

This extension helps web masters to test web application behaviour with Content Security Policy (CSP) ver. 1.0 implemented.
JavaScript
54
star
47

mapsapi-examples

Примеры использования API Яндекс.Карт
JavaScript
52
star
48

yandex_tracker_client

Python client for working with Yandex.Tracker Api
Python
50
star
49

ofd

Реализация протокола взаимодействия ККТ-ОФД
Python
49
star
50

reselector

Use React Components in css selectors
JavaScript
44
star
51

csp-reporter

Content Security Policy logs parser
Python
43
star
52

dpt

BEM-based prototyping framework for large projects
JavaScript
41
star
53

CMICOT

Efficient feature selection method based on Conditional Mutual Information.
C++
41
star
54

pgcheck

Tool for monitoring backend databases from PL/Proxy hosts and changing plproxy.get_cluster_partitions() output
Go
37
star
55

root-2015-tasks

Yandex.Root 2015 contest data
Python
34
star
56

deaf

Android App for Deaf
Java
33
star
57

datasync-js

DataSync API allows for structured data storage and synchronization in Web services and mobile applications.
JavaScript
33
star
58

inet64_tcp

Magic thing to make old Erlang stuff work in IPv6-only networks
Erlang
32
star
59

ymaps-pie-chart-clusterer

Yandex Maps Plugin: Pie Chart Clusterer
JavaScript
31
star
60

browser-extensions

JavaScript
31
star
61

mongoz

An alternative implementation of MongoDB sharding server aimed at high availability
C++
31
star
62

mlcup

Official baseline solutions to Yandex Cup ML challenge
Jupyter Notebook
30
star
63

webmaster.api

28
star
64

mapsapi-polylabeler

Plugin to setting labels inside polygons
JavaScript
25
star
65

mapsapi-round-controls

Plugin for Yandex.Maps JS API: rounded map controls theme
JavaScript
24
star
66

dep_tregex

Stanford Tregex-inspired language for rule-based dependency tree manipulation.
Python
21
star
67

tartifacts

📦 Create artifacts for your assemblies
JavaScript
20
star
68

cggen

Tool for generating Core Graphics code from vector image files
Swift
20
star
69

mastermind

Smart control for a big storage
Python
19
star
70

ch-backup

Backup tool for ClickHouse DBMS
Python
19
star
71

sdch_module

C++
18
star
72

ch-tools

ClickHouse administration and diagnostics tools
Python
17
star
73

openvpn-python-plugin

Runs python3 interpreter inside OpenVPN process in a persistent manner to answer it's plug-in calls.
C
17
star
74

rdsync

Go
17
star
75

YNDX000SB_kernel

Yandex.Phone kernel sources
C
16
star
76

evgen

Code generation for event logging
TypeScript
14
star
77

yandex-ecom-search

Бета-Версия документации для разработчиков по работе с товарным фидом Яндекс Поиска
14
star
78

cluster_metrics

C++
13
star
79

vgsl

Very Good Swift Library
Swift
13
star
80

yamail

YMail General Purpose Library
C++
13
star
81

agglomerative_clustering

C++
13
star
82

temporal-over-ydb

Go
12
star
83

pgconsul

PgConsul is a tool for maintaining High-Availability Postgresql cluster configurations. It is responsible for cluster recovery in case of emergencies.
Python
12
star
84

minishard

Lightweight sharding for distributed erlang applications
Erlang
12
star
85

jsx-directives

Директивы для JSX
TypeScript
12
star
86

miniapp-example

Example application for brand new platform of MiniApps inside the Yandex App
TypeScript
11
star
87

mapsapi-ios

Allows to easily add Yandex.Maps to your existing iOS project using Yandex.Maps JavaScript API
Objective-C
11
star
88

erater

Generic embedded distributed request rate limiting service for erlang applications
Erlang
10
star
89

storytests-cli

Framework agnostic CLI Utility to generate test files from Storybook
TypeScript
10
star
90

mapsapi-area

util.calculateArea: plugin for calculating geodesic features area.
JavaScript
10
star
91

miniapp-example-backend

Backend for Miniapp Example App for brand new platform of MiniApps inside the Yandex App
TypeScript
9
star
92

zest

Библиотека для взаимодействия с бэкендом
TypeScript
9
star
93

opentsdb-flume

Module for flume, allows to write incoming events directly to OpenTSDB.
Java
9
star
94

mediastorage-proxy

Mediastorage-proxy is a HTTP proxy for mediastorage based on elliptics
C++
8
star
95

erateserver

Distributed rate limiting service with HTTP interface
Erlang
7
star
96

domestic-roots-patch

A patch that adds support for the Russian domesic root certificate to the Chromium browser.
6
star
97

libmastermind

Client library for mastermind
C++
6
star
98

opensourcestand

YaC 2014 Open Source Stand
JavaScript
5
star
99

storytests-webpack-plugin

Plugin for creating test files according to Storybook
JavaScript
5
star
100

php-http-signature

PHP Implementation Of Draft RFC HttpSignature (v10)
PHP
5
star