Small Datum: The impact of innodb_doublewrite

Thursday, May 1, 2025

The impact of innodb_doublewrite_pages in MySQL 8.0.41

After reading a blog post from JFG on changes to innodb_doublewrite_pages and bug 111353, I wanted to understand the impact from that on the Insert Benchmark using a large server.

I test the impact from:

using a larger (non-default) value for innodb_doublewrite_pages
disabling the doublewrite buffer

tl;dr

Using a larger value for innodb_doublewrite_pages improves QPS by up to 10%
Disabling the InnoDB doublewrite buffer is great for performance, but bad for durability. I don't suggest you do this in production.

Builds, configuration and hardware

I compiled upstream MySQL 8.0.41 from source.

The server is an ax162-s from Hetzner with 48 cores, AMD 128G RAM and AMD SMT disabled. It uses Ubuntu 22.04 and storage is ext4 using SW RAID 1 over 2 locally attached NVMe devices. More details on it are here. At list prices a similar server from Google Cloud costs 10X more than from Hetzner.

The MySQL configuration files are:

cz11a_c32r128 - the base configuration file that does not set innodb_doublewrite_pages and gets innodb_doublewrite_pages=8
cz11e_c32r128 - adds innodb_doublewrite_pages=128 to the base config
cz11f_c32r128 - adds innodb_doublewrite=0 to the base config (disables doublewrite)

The Benchmark

The benchmark is explained here and is run with 20 clients and a table per client with an IO-bound workload. The database is larger than memory with 200M rows per table and 20 tables.

The benchmark steps are:

l.i0

insert 200 million rows per table in PK order. The table has a PK index but no secondary indexes. There is one connection per client.

create 3 secondary indexes per table. There is one connection per client.

l.i1

use 2 connections/client. One inserts 4M rows per table and the other does deletes at the same rate as the inserts. Each transaction modifies 50 rows (big transactions). This step is run for a fixed number of inserts, so the run time varies depending on the insert rate.

l.i2

like l.i1 but each transaction modifies 5 rows (small transactions) and 1M rows are inserted and deleted per table.
Wait for X seconds after the step finishes to reduce variance during the read-write benchmark steps that follow. The value of X is a function of the table size.

qr100

use 3 connections/client. One does range queries and performance is reported for this. The second does does 100 inserts/s and the third does 100 deletes/s. The second and third are less busy than the first. The range queries use covering secondary indexes. This step is run for 1800 seconds. If the target insert rate is not sustained then that is considered to be an SLA failure. If the target insert rate is sustained then the step does the same number of inserts for all systems tested.

qp100

like qr100 except uses point queries on the PK index

qr500

like qr100 but the insert and delete rates are increased from 100/s to 500/s

qp500

like qp100 but the insert and delete rates are increased from 100/s to 500/s

qr1000

like qr100 but the insert and delete rates are increased from 100/s to 1000/s

qp1000

like qp100 but the insert and delete rates are increased from 100/s to 1000/s

Results: overview

The performance report is here.

The summary section in the performance report has 3 tables. The first shows absolute throughput by DBMS tested X benchmark step. The second has throughput relative to the version from the first row of the table. The third shows the background insert rate for benchmark steps with background inserts and all systems sustained the target rates. The second table makes it easy to see how performance changes over time. The third table makes it easy to see which DBMS+configs failed to meet the SLA.

Below I use relative QPS to explain how performance changes. It is: (QPS for $me / QPS for $base) where $me is the result with the cz11e_c32r128 or cz11f_c32r128 configs and $base is the result from the cz11a_c32r128 config. The configs are explained above, cz11e_c32r128 increases innodb_doublewrite_pages and cz11f_c32r128 disabled the doublewrite buffer.

When relative QPS is > 1.0 then performance improved over time. When it is < 1.0 then there are regressions. The Q in relative QPS measures:

insert/s for l.i0, l.i1, l.i2
indexed rows/s for l.x
range queries/s for qr100, qr500, qr1000
point queries/s for qp100, qp500, qp1000

Below I use colors to highlight the relative QPS values with red for <= 0.95, green for >= 1.05 and grey for values between 0.95 and 1.05.

Results: more IO-bound

The performance summary is here.

From the cz11e_c32r128 config that increases innodb_doublewrite_pages to 128:

the impact on write-heavy steps is mixed: create index was ~7% slower and l.i2 was ~10% faster
the impact on range query + write steps is positive but small. The improvements were 0%, 0% and 4%. Note that these steps are not as IO-bound as point query + write steps and the range queries do ~0.3 reads per query (see here).
the impact on point query + write steps is positive and larger. The improvements were 3%, 8% and 9%. These benchmark steps are much more IO-bound than the steps that do range queries.

From the cz11f_c32r128 config that disables the InnoDB doublewrite buffer:

the impact on write-heavy steps is large -- from 1% to 36% faster.
the impact on range query + write steps is positive but small. The improvements were 0%, 2% and 15%. Note that these steps are not as IO-bound as point query + write steps and the range queries do ~0.3 reads per query (see here).
the impact on point query + write steps is positive and larger. The improvements were 14%, 41% and 42%.

2 comments:

Simon J MuddNovember 11, 2025 at 1:11 AM
Hi Mark,
Your post is about MySQL 8.0 but it might be interesting to also reference MySQL 8.4 as the default configuration for doublewrite buffer setups are different and this may be relevant.

MySQL 8.4 uses a default value of innodb_doublewrite_pages as 128 (minimum 1) vs 8.0 using innodb_write_io_threads (minimum is the same).

This seems to imply that 8.4 performance will take advantage of the larger innodb_doublewrite_pages setting (except where you show that there is a performance drop) so default configurations etc will get this for free.

Either way MySQL 8.0 will go EOL in April 2026 so it may be worth considering upgrading to 8.4 (or "maybe" considering jumping to 9.7 if it becomes LTS) as staying on 8.0 does not seem like a good idea.

It would be good to see the benchmarks you shared (or some of them) comparing 8.0 to 8.4 to see how the 2 compare. Most of the benchmark type setups should work pretty much unaltered. 8.4 varies most against 8.0 in "replication related matters" as master/slave has been removed in 8.4 so tooling using those terms needs updating.
ReplyDelete
Replies

Add comment

Thursday, May 1, 2025

The impact of innodb_doublewrite_pages in MySQL 8.0.41

2 comments:

CPU-bound Insert Benchmark vs Postgres on 24-core and 32-core servers