Thursday, June 27, 2024

The impact of link time optimization for MySQL with sysbench

This post has results to show the benefit from using link time optimization for MySQL. That is enabled via the CMake option -DWITH_LTO=ON.

tl;dr

  • A typical improvement is ~5% more QPS from link time optimization
  • On the small servers (PN53, SER4) the benefit from link-time optimization was larger for InnoDB than for MyRocks. On the medium server (C2D) the benefit was similar for MyRocks and InnoDB.
Builds

I used InnoDB from MySQL 8.0.37 and MyRocks from FB MySQL compiled on git sha 65644b82c which uses RocksDB 9.3.1 and was latest as of June 12, 2024. The compiler was gcc 11.4.0.

Hardware

I tested on three servers:
  • SER4 - Beelink SER 4700u (see here) with 8 cores and a Ryzen 7 4700u CPU 
  • PN53 - ASUS ExpertCenter PN53 (see here) with 8 cores and an AMD Ryzen 7 7735HS CPU
  • C2D - a c2d-highcpu-32 instance type on GCP (c2d high-CPU) with 32 vCPU and SMT disabled so there are 16 cores
All servers use Ubuntu 22.04 with ext4. 

Benchmark

I used sysbench and my usage is explained here. There are 42 microbenchmarks and most test only 1 type of SQL statement. The database is cached by MyRocks and InnoDB.

The benchmark is run with:
  • SER4, PN53 - 1 thread, 1 table and 30M rows
  • C2D - 12 threads, 8 tables and 10M rows per table
  • each microbenchmark runs for 300 seconds if read-only and 600 seconds otherwise
  • prepared statements were enabled
The command lines for my helper scripts were:
# PN53, SER4
bash r.sh 1 30000000 300 600 nvme0n1 1 1 1
# C2D
bash r.sh 8 10000000 300 600 md0 1 1 12

Results

For the results below I split the 42 microbenchmarks into 5 groups -- 2 for point queries, 2 for range queries, 1 for writes. For the range query microbenchmarks, part 1 has queries that don't do aggregation while part 2 has queries that do aggregation. The spreadsheet with all data and charts is here. For each group I present a chart and a table with summary statistics.

All of the charts have relative throughput on the y-axis where that is (QPS with LTO) / (QPS without LTO) and with LTO means link-time optimization was enabled. The y-axis doesn't start at 0 to improve readability. When the relative throughput is > 1 then that version of MySQL with link-time optimization is faster.

The legend under the x-axis truncates the names I use for the microbenchmarks and I don't know how to fix that other than sharing links (see above) to the Google Sheets I used.

Results: SER4

Summary statistics for MyRocks

minmaxavgmedian
point-10.981.081.031.03
point-21.031.051.031.03
range-10.901.051.001.02
range-20.981.051.021.02
writes1.031.101.061.07

Summary statistics for InnoDB

minmaxavgmedian
point-11.051.121.071.06
point-21.051.061.061.06
range-11.051.141.091.09
range-21.061.081.071.07
writes1.031.091.081.08

There are two charts per section -- the first for MyRocks, the second for InnoDB.

Point queries, part 1
Point queries, part 2
Range queries, part 1
Range queries, part 2
Writes

Results: PN53

Summary statistics for MyRocks

minmaxavgmedian
point-11.031.151.071.05
point-21.031.081.051.05
range-10.951.061.041.04
range-21.021.081.051.05
writes1.051.071.061.06

Summary statistics for InnoDB

minmaxavgmedian
point-11.061.201.101.08
point-21.061.091.071.07
range-11.061.071.061.06
range-21.021.051.041.04
writes1.011.081.051.06

There are two charts per section -- the first for MyRocks, the second for InnoDB. 

Point queries, part 1
Point queries, part 2
Range queries, part 1
Range queries, part 2
Writes

Results: C2D

Summary statistics for MyRocks

minmaxavgmedian
point-10.971.201.041.04
point-21.011.061.041.05
range-11.011.071.041.05
range-21.031.071.041.03
writes1.011.071.031.03

Summary statistics for InnoDB

minmaxavgmedian
point-11.041.061.051.05
point-21.041.051.041.04
range-11.041.061.051.04
range-21.011.051.031.04
writes1.011.041.031.03

There are two charts per section -- the first for MyRocks, the second for InnoDB. The charts use the name Medium server instead of C2D.

Point queries, part 1
Point queries, part 2
Range queries, part 1
Range queries, part 2
Writes


No comments:

Post a Comment

RocksDB on a big server: LRU vs hyperclock, v2

This post show that RocksDB has gotten much faster over time for the read-heavy benchmarks that I use. I recently shared results from a lar...