puyol_dev2 June 29, 2019 at 14:16

Benchmarking PostgreSQL with FreeBSD, CentOS, Ubuntu Debian, and openSUSE

Recovery mode

Hello, Habr! I present to you the translation of the original article “PostgreSQL benchmark on FreeBSD, CentOS, Ubuntu Debian and openSUSE” by Martin Kováčik. It examines the PostgreSQL 10.1 DBMS tests in environments close to real conditions on various unix systems.

Transfer

In this post, I am going to show the test results of the recently released PostgreSQL 10.1. I checked the DB on these OSs (all 64-bit):

Ubuntu 16.04 , kernel 4.10.0-38-generic
openSUSE 42.3, kernel 4.4.87-25-default
CentOS 7.4 , kernel 3.10.0-693.2.2.el7.x86_64
Debian 9.2 , kernel 4.9.0-4-amd64
FreeBSD 11.1

Test methodology

The goal of the test was to measure the performance of PostgreSQL in conditions similar to (typical) production deployments:

clients connect through the connection pool to ensure that there is no permanent reconnection to the database (I did not use the connection pool, I did not use the -C pgbench flag instead)
clients connect over a network, not over a unix socket
PostgreSQL data directory located on RAID 1 mirror

For each of the tested OSs, a ~ 74 GB control database was created:

pgbench -i -s 5000 pgbench

The test infrastructure consisted of two dedicated servers connected to a 1 Gbps network:

EX41-SSD: Intel i7-6700, 4 cores, 8 threads, 32 GB DDR4 RAM, used to generate SQL queries using pgbench
PX121-SSD: Intel Xeon E5-1650 v3, 6 cores, 12 threads, 256 GB RAM DDR4 ECC, 2 x 480 GB SATA 6 Gb / s, data center of the SSD series, used as a PostgreSQL server

I was interested in these test combinations:

32 GB read-only : read-only test (only samples without changing data), the data set does not fit into the PostgreSQL cache
200 GB read-only : read-only test, the data set is cached in PostgreSQL
32 GB TCP-B : read-write, data set does not fit in PostgreSQL cache
TCP-B 200 GB : read, write, data set is cached in PostgreSQL

pgbench setup

The pgbench version 10.1 program, running on a separate FreeBSD 11.1 computer, was used to generate the load. The test script consisted of three parts: vacuum + heating, a read-only test and a read and write test. Before each read-write test, the pgbench tables were cleared (the -v flag was used). During the test, I gradually increased the number of clients accessing the database.

#!/bin/sh
THREADS=8
DURATION=1800
PGIP=192.168.1.120
# warmup
pgbench -h ${PGIP} -U pgbench -j ${THREADS} -c 10 -T ${DURATION} -S -v pgbench
for clients in 1 10 20 30 40 50 60 70 80 90 100 110 120
do
  echo "RO ${clients}"
  pgbench -h ${PGIP} -U pgbench -j ${THREADS} -c ${clients} -T ${DURATION} -S pgbench > pgbench_ro_${clients}.log
done
for clients in 1 10 20 30 40 50 60 70 80 90 100 110 120
do
  echo "RW ${clients}"
  pgbench -h ${PGIP} -U pgbench -j ${THREADS} -c ${clients} -T ${DURATION} -v pgbench > pgbench_rw_${clients}.log
done

PostgreSQL Server Settings

For Linux distributions, PostgreSQL was installed on the ext4 file system in the RAID1 setup (software RAID using mdraid) on two SSDs with atime disabled . In the case of FreeBSD, the OpenZFS file system was used on two SSDs when configuring RAID1. A ZFS dataset with PostgreSQL data was created with the following parameters:

zfs get recordsize,logbias,primarycache,atime,compression zroot/var/db/postgres
NAME                   PROPERTY      VALUE         SOURCE
zroot/var/db/postgres  recordsize    8K            local
zroot/var/db/postgres  logbias       throughput    local
zroot/var/db/postgres  primarycache  all           default
zroot/var/db/postgres  atime         off           inherited from zroot
zroot/var/db/postgres  compression   lz4           local

The configuration of the PostgreSQL server was the same on all operating systems except the file paths (each operating system uses its own directory structure). The contents of the postgresql.conf file (basic settings) for a 32 GB instance:

autovacuum = off
default_statistics_target = 100
maintenance_work_mem = 1GB
checkpoint_completion_target = 0.9
effective_cache_size = 24GB
work_mem = 104MB
wal_buffers = 16MB
shared_buffers = 8GB
max_connections = 300

The contents of the postgresql.conf file for the 200 GB instance:

autovacuum = off
default_statistics_target = 100
maintenance_work_mem = 2GB
checkpoint_completion_target = 0.9
effective_cache_size = 144GB
work_mem = 640MB
wal_buffers = 16MB
shared_buffers = 48GB
max_connections = 300

Benchmarking

I tested PostgreSQL on five different operating systems in two modes - read-only and TCP-B (read-write) with two different memory profiles. The test of each OS took about 30 hours (not counting the time required to configure the OS). The results of each pgbench run were saved for later evaluation.

Results - Read Only

Results - TCP-B

Summary

The test showed that the difference between the various GNU / Linux distributions is not very significant. OpenSUSE 42.3 was the best operating system in the read-only test, while FreeBSD ran about 40% slower. Unfortunately, I did not figure out what caused such mediocre FreeBSD performance.

A more realistic picture of the performance of PostgreSQL was obtained in the read-write test (TCP-B). Among GNU / Linux distributions, Centos 7.4 was the fastest and Debian 9.2 the slowest. I was pleasantly surprised by FreeBSD 11.1, which ran more than twice as fast as the best Linux, despite the fact that FreeBSD used ZFS, which is a copy-on-write file system. I assumed that this difference was caused by the cost of software RAID in Linux, so I did three more TCP-B tests for 100 concurrent clients, this time without software RAID:

FreeBSD 11.1 + UFS : 5623.86 TPS
FreeBSD 11.1 + ZFS : 8331.85 TPS
CentOS 7.4 + ext4 : 8987.65 TPS

The results show the inefficiency of Linux SW RAID (or the efficiency of ZFS RAID). The performance of CentOS 7.4 without SW RAID is only slightly higher than that of FreeBSD 11.1 with ZFS RAID (for TCP-B and 100 concurrent clients).

Tags: