• Stars
    star
    340
  • Rank 124,317 (Top 3 %)
  • Language
    Perl
  • License
    Other
  • Created almost 9 years ago
  • Updated 5 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

PostgreSQL Automatic Failover: High-Availibility for Postgres, based on Pacemaker and Corosync.

PostgreSQL Automatic Failover

High-Availibility for Postgres, based on industry references Pacemaker and Corosync.

Description

Pacemaker is nowadays the industry reference for High Availability. In the same fashion than for Systemd, all Linux distributions moved (or are moving) to this unique Pacemaker+Corosync stack, removing all other existing high availability stacks (CMAN, RGManager, OpenAIS, ...). It is able to detect failure on various services and automatically decide to failover the failing resource to another node when possible.

To be able to manage a specific service resource, Pacemaker interact with it through a so-called "Resource Agent". Resource agents must comply to the OCF specification which define what they must implement (start, stop, promote, etc), how they should behave and inform Pacemaker of their results.

PostgreSQL Automatic Failover is a new OCF resource Agent dedicated to PostgreSQL. Its original wish is to keep a clear limit between the Pacemaker administration and the PostgreSQL one, to keep things simple, documented and yet powerful.

Once your PostgreSQL cluster built using internal streaming replication, PAF is able to expose to Pacemaker what is the current status of the PostgreSQL instance on each node: master, slave, stopped, catching up, etc. Should a failure occurs on the master, Pacemaker will try to recover it by default. Should the failure be non-recoverable, PAF allows the slaves to be able to elect the best of them (the closest one to the old master) and promote it as the new master. All of this thanks to the robust, feature-full and most importantly experienced project: Pacemaker.

For information about how to install this agent, see INSTALL.md.

Setup and requirements

PAF supports PostgreSQL 9.3 and higher. It has been extensively tested under CentOS 6 and 7 in various scenario.

PAF has been written to give to the administrator the maximum control over their PostgreSQL configuration and architecture. Thus, you are 100% responsible for the master/slave creations and their setup. The agent will NOT edit your setup. It only requires you to follow these pre-requisites:

  • slave must be in hot_standby (accept read-only connections) ;
  • the following parameters must be configured in the appropriate place :
    • standby_mode = on (for PostgreSQL 11 and before)
    • recovery_target_timeline = 'latest'
    • primary_conninfo wih application_name set to the node name as seen in Pacemaker.
  • these last parameters has been merged inside the instance configuration file with PostgreSQL 12. For PostgreSQL 11 and before, you must provide a recovery.conf template file.

When setting up the resource in Pacemaker, here are the available parameters you can set:

  • bindir: location of the PostgreSQL binaries (default: /usr/bin)
  • pgdata: location of the PGDATA of your instance (default: /var/lib/pgsql/data)
  • datadir: path to the directory set in data_directory from your postgresql.conf file. This parameter has same default than PostgreSQL itself: the pgdata parameter value. Unless you have a special PostgreSQL setup and you understand this parameter, ignore it
  • pghost: the socket directory or IP address to use to connect to the local instance (default: /tmp or /var/run/postgresql for DEBIAN)
  • pgport: the port to connect to the local instance (default: 5432)
  • recovery_template: only for PostgreSQL 11 and before. The local template that will be copied as the PGDATA/recovery.conf file. This file must not exist on any node for PostgreSQL 12 and after. (default: $PGDATA/recovery.conf.pcmk)
  • start_opts: Additional arguments given to the postgres process on startup. See "postgres --help" for available options. Useful when the postgresql.conf file is not in the data directory (PGDATA), eg.: -c config_file=/etc/postgresql/9.3/main/postgresql.conf
  • system_user: the system owner of your instance's process (default: postgres)
  • maxlag: maximum lag allowed on a standby before we set a negative master score on it. The calculation is based on the difference between the current xlog location on the master and the write location on the standby. (default: 0, which disables this feature)

For a demonstration about how to setup a cluster, see http://clusterlabs.github.io/PAF/documentation.html.

More Repositories

1

pacemaker

Pacemaker is an advanced, scalable High-Availability cluster resource manager
C
1,035
star
2

resource-agents

Combined repository of OCF agents from the RHCS and Linux-HA projects
Shell
491
star
3

pcs

Pacemaker command line interface and GUI
Python
249
star
4

libqb

libqb is a library providing high performance logging, tracing, ipc, and poll.
C
165
star
5

hawk

A web-based GUI for managing and monitoring the Pacemaker High-Availability cluster resource manager
JavaScript
151
star
6

crmsh

Command-line interface for High-Availability cluster management on GNU/Linux systems.
Python
131
star
7

fence-agents

Fence agents
Python
104
star
8

ha_cluster_exporter

Prometheus exporter for Pacemaker based Linux HA clusters
Go
79
star
9

striker

The Anvil! Intelligent Availabilityâ„¢ Platform - Striker UI and ScanCore decision engine
Perl
26
star
10

OCF-spec

C
20
star
11

cluster-glue

Reusable Cluster Components ("glue")
C
18
star
12

fence-virt

Virtual Machine I/O Fencing
12
star
13

hawk-apiserver

Next generation HA cluster interface
Go
12
star
14

pacemaker-1.0

Code for the older 1.0 series of Pacemaker
Graphviz (DOT)
12
star
15

pcs-web-ui

Web interface for pcs
TypeScript
10
star
16

go-pacemaker

Go API for the Pacemaker cluster resource manager
Go
7
star
17

clusterlabs-www

Content for clusterlabs.org
HTML
5
star
18

anvil

The Anvil! Intelligent Availabilityâ„¢ Platform, mark 3
Perl
5
star
19

high-laughability

Quotes from our HA overlords
4
star
20

nagios-agents-metadata

OCF metadata files to enable use of nagios agents in pacemaker.
3
star
21

anvil-external-deps

collection of scripts and spec files for Anvil!
Shell
2
star