<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:blogger='http://schemas.google.com/blogger/2008' xmlns:georss='http://www.georss.org/georss' xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-9149523927864751087</id><updated>2025-04-15T08:04:14.193-07:00</updated><category term="mysql"/><category term="rocksdb"/><category term="mongodb"/><category term="innodb"/><category term="postgres"/><category term="myrocks"/><category term="postgresql"/><category term="sysbench"/><category term="performance"/><category term="history"/><category term="db_bench"/><category term="database economics"/><category term="leveldb"/><category term="mariadb"/><category term="rant"/><category term="ann-benchmarks"/><category term="insert benchmark"/><category term="pgvector"/><category term="vector"/><category term="jemalloc"/><category term="profound"/><category term="regressions"/><category term="malloc"/><category term="review"/><category term="linux"/><category term="informix"/><category term="lua"/><category term="m"/><category term="mysql. innodb"/><category term="oracle"/><category term="percona"/><category term="qdrant"/><category term="speedb"/><category term="splinterdb"/><category term="sql"/><category term="tcmalloc"/><category term="wiredtiger"/><title type='text'>Small Datum</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><link rel='next' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default?start-index=26&amp;max-results=25'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>717</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-4766630872540251140</id><published>2025-04-11T18:24:00.000-07:00</published><updated>2025-04-11T18:24:19.082-07:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="innodb"/><category scheme="http://www.blogger.com/atom/ns#" term="jemalloc"/><category scheme="http://www.blogger.com/atom/ns#" term="malloc"/><category scheme="http://www.blogger.com/atom/ns#" term="myrocks"/><category scheme="http://www.blogger.com/atom/ns#" term="sysbench"/><category scheme="http://www.blogger.com/atom/ns#" term="tcmalloc"/><title type='text'>Battle of the Mallocators</title><content type='html'>&lt;p&gt;If you use RocksDB and want to avoid OOM then use jemalloc or tcmalloc and avoid glibc malloc. That was true in 2015 and remains true in 2025 (&lt;a href=&quot;https://smalldatum.blogspot.com/2015/10/myrocks-versus-allocators-glibc.html&quot;&gt;see here&lt;/a&gt;). The problem is that RocksDB can be an allocator stress test because it does an allocation (calls malloc) when a block is read from storage and then does a deallocation (calls free) on eviction. These allocations have very different lifetimes as some blocks remain cached for a long time and that leads to much larger RSS than expected when using glibc malloc. Fortunately, jemalloc and tcmalloc are better at tolerating that allocation pattern without making RSS too large.&lt;br /&gt;&lt;br /&gt;I have yet to notice a similar problem with InnoDB, in part because it does a few large allocations at process start for the InnoDB buffer pool and it doesn&#39;t do malloc/free per block read from storage.&lt;/p&gt;&lt;p&gt;There was a recent claim from a MySQL performance expert, Dimitri Kravtchuk, that either RSS or VSZ can grow too large with InnoDB and jemalloc. I don&#39;t know all of the details for his setup and I failed to reproduce his result on my setup. Too be fair, I show here that VSZ for InnoDB + jemalloc can be larger than you might expect but that isn&#39;t a problem, it is just an artifact of jemalloc that can be confusing. But RSS for jemalloc with InnoDB is similar to what I get from tcmalloc.&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;For glibc malloc with MyRocks I get OOM on a server with 128G of RAM when the RocksDB buffer pool size is 50G. I might have been able to avoid OOM by using between 30G and 40G for the buffer pool. On that host I normally use jemalloc with MyRocks and a 100G buffer pool.&lt;/li&gt;&lt;li&gt;With respect to peak RSS&lt;/li&gt;&lt;ul&gt;&lt;li&gt;For InnoDB the peak RSS with all allocators is similar and peak RSS is ~1.06X larger than the InnoDB buffer pool.&lt;/li&gt;&lt;li&gt;For MyRocks the peak RSS is smallest with jemalloc, slightly larger with tcmalloc and much too large with glibc malloc. For (jemalloc, tcmalloc, glibc malloc) It was (1.22, 1.31, 3.62) times larger than the 10G MyRocks buffer pool. I suspect those ratios would be smaller for jemalloc and tcmalloc had I used an 80G buffer pool.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;For performance, QPS with jemalloc and tcmalloc is slightly better than with glibc malloc&lt;/li&gt;&lt;ul&gt;&lt;li&gt;For InnoDB: [jemalloc, tcmalloc] get [2.5%, 3.5%] more QPS than glibc malloc&lt;/li&gt;&lt;li&gt;For MyRocks: [jemalloc, tcmalloc] get [5.1%, 3.0%] more QPS than glibc malloc&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Prior art&lt;/b&gt;&lt;/p&gt;&lt;p&gt;I have several blog posts on using jemalloc with MyRocks.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2015/10/myrocks-versus-allocators-glibc.html&quot;&gt;October 2015&lt;/a&gt; - MyRocks with glibc malloc, jemalloc and tcmalloc&lt;/li&gt;&lt;li&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2017/11/concurrent-large-allocations-glibc.html&quot;&gt;April 2017&lt;/a&gt; - Performance for large, concurrent allocations&lt;/li&gt;&lt;li&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2018/04/myrocks-malloc-and-fragmentation-strong.html&quot;&gt;April 2018&lt;/a&gt; - RSS for MyRocks with jemalloc vs glibc malloc&lt;/li&gt;&lt;li&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2023/08/rocksdb-and-glibc-malloc-dont-play-nice.html&quot;&gt;August 2023&lt;/a&gt; - RocksDB and glibc malloc&lt;/li&gt;&lt;li&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2023/09/checking-jemalloc-versions-for-rss.html&quot;&gt;September 2023&lt;/a&gt; - A regression in jemalloc 4.4.0 and 4.5.0 (too-large RSS)&amp;nbsp;&lt;/li&gt;&lt;li&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2023/09/variance-in-peak-rss-with-jemalloc-521.html&quot;&gt;September 2023&lt;/a&gt; - More on the regression in jemalloc 4.4.0 and 4.5.0&lt;/li&gt;&lt;li&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2023/10/variance-in-peak-rss-with-jemalloc-521.html&quot;&gt;October 2023&lt;/a&gt; - Even more on the regression in jemalloc 4.4.0 and 4.5.0&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;b&gt;Builds, configuration and hardware&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I compiled upstream MySQL 8.0.40 from source for InnoDB. I also compiled FB MySQL 8.0.32 from source for MyRocks. For FB MySQL I used source as of October 23, 2024 at git hash ba9709c9 with RocksDB 9.7.1.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The server is an&amp;nbsp;&lt;a href=&quot;https://www.hetzner.com/dedicated-rootserver/ax162-s/&quot;&gt;ax162-s&lt;/a&gt;&amp;nbsp;from Hetzner with 48 cores (AMD EPYC 9454P), 128G RAM and AMD SMT disabled. It uses Ubuntu 22.04 and storage is ext4 with SW RAID 1 over 2 locally attached NVMe devices. More details on it&amp;nbsp;&lt;a href=&quot;https://smalldatum.blogspot.com/2024/09/trying-out-dedicated-server-from-hetzner.html&quot;&gt;are here&lt;/a&gt;. At list prices a similar server from Google Cloud costs 10X more than from Hetzner.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For malloc the server uses:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;glibc&lt;/li&gt;&lt;ul&gt;&lt;li&gt;version2.35-0ubuntu3.9&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;tcmalloc&lt;/li&gt;&lt;ul&gt;&lt;li&gt;provided by libgoogle-perftools-dev and &lt;i&gt;apt-cache show&lt;/i&gt; claims this is version 2.9.1&lt;/li&gt;&lt;li&gt;enabled by malloc-lib=/usr/lib/x86_64-linux-gnu/libtcmalloc_minimal.so in my.cnf&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;jemalloc&lt;/li&gt;&lt;ul&gt;&lt;li&gt;provided by libjemalloc-dev and &lt;i&gt;apt-cache show&lt;/i&gt; claims this is version 5.2.1-4ubuntu1&lt;/li&gt;&lt;li&gt;enabled by malloc-lib=/usr/lib/x86_64-linux-gnu/libjemalloc.so in my.cnf&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;The configuration files are here &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/apr25.c48.malloc/25apr.c48.inno.conf&quot;&gt;for InnoDB&lt;/a&gt; and &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/apr25.c48.malloc/25apr.c48.fbmy.conf&quot;&gt;for MyRocks&lt;/a&gt;. For InnoDB I used an 80G buffer pool. I tried to use a 50G buffer pool for MyRocks but with glibc malloc there was OOM so I repeated all tests with a 10G buffer pool. I might have been able avoid OOM with MyRocks and glibc malloc by using a between 30G and 40G for MyRocks -- but I didn&#39;t want to spend more time figuring that out when the real answer is to use jemalloc or tcmalloc.&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I used sysbench and my usage is&amp;nbsp;&lt;a href=&quot;http://smalldatum.blogspot.com/2017/02/using-modern-sysbench-to-compare.html&quot;&gt;explained here&lt;/a&gt;. To save time I only run 27 of the 42 microbenchmarks and most test only 1 type of SQL statement.&lt;br /&gt;&lt;br /&gt;The tests run with 16 tables and 50M rows/table. There are 256 client threads and each microbenchmark runs for 1200 seconds. Normally I don&#39;t run with (client threads / cores) &amp;gt;&amp;gt; 1 but I do so here to create more stress and to copy what I think Dimitri had done.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Normally when I run sysbench I configure it so that the test tables fit in the buffer pool (block cache) but I don&#39;t do that here because I want to MyRocks to do IO as allocations per storage read create much drama for the allocator.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The command line to run all tests is: &lt;i&gt;bash r.sh 16 50000000 1200 1200 md2 1 0 256&lt;/i&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Peak VSZ and RSS&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The tables below show the peak values for VSZ and RSS from mysqld during the benchmark. The last column is the ratio (peak RSS / buffer pool size). I am not sure it is fair to compare these ratios between InnoDB and MyRocks from this work because the buffer pool size is so much larger for InnoDB. Regardless, RSS is more than 3X larger than the MyRocks buffer pool size with glibc malloc and that is a problem.&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;div&gt;Peak values for InnoDB with 80G buffer pool&lt;/div&gt;&lt;div&gt;alloc&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;VSZ&amp;nbsp; &amp;nbsp; &amp;nbsp;RSS&amp;nbsp; &amp;nbsp; &amp;nbsp;RSS/80&lt;/div&gt;&lt;div&gt;glibc&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;88.2&amp;nbsp; &amp;nbsp; 86.5&amp;nbsp; &amp;nbsp; 1.08&lt;/div&gt;&lt;div&gt;tcmalloc&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 88.1&amp;nbsp; &amp;nbsp; 85.3&amp;nbsp; &amp;nbsp; 1.06&lt;/div&gt;&lt;div&gt;jemalloc&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 91.5&amp;nbsp; &amp;nbsp; 87.0&amp;nbsp; &amp;nbsp; 1.08&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Peak values for MyRocks with 10G buffer pool&lt;/div&gt;&lt;div&gt;alloc&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;VSZ&amp;nbsp; &amp;nbsp; &amp;nbsp;RSS&amp;nbsp; &amp;nbsp; &amp;nbsp;RSS/10&lt;/div&gt;&lt;div&gt;glibc&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;46.1&amp;nbsp; &amp;nbsp; 36.2&amp;nbsp; &amp;nbsp; 3.62&lt;/div&gt;&lt;div&gt;tcmalloc&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 15.3&amp;nbsp; &amp;nbsp; 13.1&amp;nbsp; &amp;nbsp; 1.31&lt;/div&gt;&lt;div&gt;jemalloc&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 45.6&amp;nbsp; &amp;nbsp; 12.2&amp;nbsp; &amp;nbsp; 1.22&lt;/div&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Performance: InnoDB&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;From the results here, QPS is mostly similar between tcmalloc and jemalloc but there are a few microbenchmarks where tcmalloc is much better than jemalloc and those are highlighted.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The results for read-only_range=10000 are an outlier (tcmalloc much faster than jemalloc) and from &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/apr25.c48.malloc/o.met.inno80g#L131-L139&quot;&gt;vmstat metrics here&lt;/a&gt; I see that CPU/operation (cpu/o) and context switches /operation (cs/o) are much larger for jemalloc than for tcmalloc.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;These results use the relative QPS, which is the following where $allocator is tcmalloc or jemalloc. When this value is larger than 1.0 then QPS is larger with tcmalloc or jemalloc.&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;/div&gt;&lt;blockquote&gt;(QPS with $allocator) / (QPS with glibc malloc)&lt;/blockquote&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;Relative to results with glibc malloc&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-1 : results with tcmalloc&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-2 : results with jemalloc&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-1&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;col-2&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.99&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.02&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;hot-points_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.05&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.04&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;point-query_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.96&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.99&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;points-covered-pk_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.98&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.99&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;points-covered-si_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.96&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.99&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;points-notcovered-pk_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.97&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.98&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;points-notcovered-si_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.97&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.00&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;random-points_range=1000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.95&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.99&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;random-points_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.99&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.99&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;random-points_range=10&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.04&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;range-covered-pk_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.05&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.07&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;range-covered-si_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.04&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;range-notcovered-pk_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.98&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.00&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;range-notcovered-si_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.02&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.02&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only-count_range=1000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.05&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.07&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only-distinct_range=1000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.07&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.12&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only-order_range=1000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1.28&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.09&lt;/span&gt;&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only_range=10000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.05&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.05&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.08&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only_range=10&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.08&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.07&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only-simple_range=1000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.04&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only-sum_range=1000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.02&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.02&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;scan_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.01&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.00&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;delete_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.01&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;insert_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.02&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.02&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-write_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-write_range=10&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.01&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.02&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;update-index_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1.15&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.98&lt;/span&gt;&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;update-inlist_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1.06&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.99&lt;/span&gt;&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;update-nonindex_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;update-one_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.02&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.01&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;update-zipf_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1.18&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.05&lt;/span&gt;&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;write-only_range=10000&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Performance: MyRocks&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;From the results here, QPS is mostly similar between tcmalloc and jemalloc with a slight advantage for jemalloc but there are a few microbenchmarks where jemalloc is much better than tcmalloc and those are highlighted.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;The results for hot-points below are odd (jemalloc is a lot faster than tcmalloc) and from &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/apr25.c48.malloc/o.met.rx10g#L231-L239&quot;&gt;vmstat metrics here&lt;/a&gt; I see that CPU/operation (cpu/o) and context switches /operation (cs/o) are both much larger for tcmalloc.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;These results use the relative QPS, which is the following where $allocator is tcmalloc or jemalloc. When this value is larger than 1.0 then QPS is larger with tcmalloc or jemalloc.&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;/div&gt;&lt;blockquote&gt;(QPS with $allocator) / (QPS with glibc malloc)&lt;/blockquote&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;Relative to results with glibc malloc&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-1 : results with tcmalloc&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-2 : results with jemalloc&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-1&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;col-2&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;0.68&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.00&lt;/span&gt;&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;hot-points_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.04&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.04&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;point-query_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.09&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.09&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;points-covered-pk_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.00&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.09&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;points-covered-si_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.09&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.09&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;points-notcovered-pk_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.10&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.12&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;points-notcovered-si_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.08&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.08&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;random-points_range=1000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.09&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.09&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;random-points_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.05&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.10&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;random-points_range=10&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;0.99&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.07&lt;/span&gt;&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;range-covered-pk_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.01&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;range-covered-si_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.05&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.09&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;range-notcovered-pk_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.10&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.09&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;range-notcovered-si_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.07&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.05&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only-count_range=1000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.00&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.00&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only-distinct_range=1000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.98&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.04&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only-order_range=1000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only_range=10000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.96&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.02&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.04&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only_range=10&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.98&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.07&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only-simple_range=1000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.07&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.09&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-only-sum_range=1000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.02&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.02&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;scan_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.05&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;delete_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.11&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.07&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;insert_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.96&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.97&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-write_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.94&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.95&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;read-write_range=10&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.08&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.04&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;update-index_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.08&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.07&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;update-inlist_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.09&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.04&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;update-nonindex_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.04&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.04&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;update-one_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.07&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.04&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;update-zipf_range=100&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.02&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;write-only_range=10000&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/4766630872540251140/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/04/battle-of-mallocators.html#comment-form' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/4766630872540251140'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/4766630872540251140'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/04/battle-of-mallocators.html' title='Battle of the Mallocators'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-1626184717187154787</id><published>2025-03-27T12:11:00.000-07:00</published><updated>2025-03-27T12:41:42.596-07:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="postgres"/><category scheme="http://www.blogger.com/atom/ns#" term="sysbench"/><title type='text'>Postgres 17.4 vs sysbench on a large server, revisited part 2</title><content type='html'>&lt;p&gt;I recently shared two posts (&lt;a href=&quot;https://smalldatum.blogspot.com/2025/03/postgres-174-vs-sysbench-on-large-server.html&quot;&gt;here&lt;/a&gt; and &lt;a href=&quot;https://smalldatum.blogspot.com/2025/03/postgres-174-vs-sysbench-on-large.html&quot;&gt;here&lt;/a&gt;) with results for sysbench on a large server using Postgres versions 10 through 17. In general there were several large improvements over time, but one small regression that arrived in Postgres 11.x. This blog post provides more details on the problem using results from Postgres 10.23, 11.22 and 17.4.&lt;br /&gt;&lt;br /&gt;The regression occurs starting in Postgres 11.22 because Postgres is more likely to use bitmap index scan starting in 11.x. I have yet to learn why or how to prevent that.&lt;/p&gt;&lt;p&gt;&lt;b&gt;Index scan vs bitmap index scan&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Experts gave me great advice based on a few flamegraphs that I shared. It looked like Postgres started to use bitmap index scan more often starting in Postgres 11. Upstream sysbench does collect query plans for the SQL that it uses, so I modified &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/sysbench.lua/lua&quot;&gt;my fork&lt;/a&gt; to do that.&lt;/p&gt;&lt;p&gt;While the explain output helps, it would help even more were there a feature in Postgres to provide optimizer traces, similar to what &lt;a href=&quot;https://dev.mysql.com/doc/refman/8.4/en/optimizer-tracing.html&quot;&gt;MySQL has&lt;/a&gt;, to understand why some query plans are selected and others are rejected. Another feature request is to improve the official Postgres docs to provide more detail on 1) the difference between index scan and bitmap index scan and 2) the difference between lossy and non-lossy bitmap index scans (AFAIK, one needs recheck).&lt;/p&gt;&lt;p&gt;&lt;b&gt;The problem microbenchmark&lt;/b&gt;&lt;/p&gt;&lt;p&gt;The microbenchmarks in question do a range scan with aggregation and have a name like read-only_range=$X where X has been 10, 100 and 1000 but here I use X in 10, 100, 1000, 2000, 4000, 8000, 16000 and 32000. The value of X is the length of the range scan. These are run via&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/sysbench.lua/lua/oltp_read_only.lua&quot;&gt;oltp_read_only.lua&lt;/a&gt;&amp;nbsp;and use&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/sysbench.lua/lua/oltp_common.lua#L304-L315&quot;&gt;these SQL statements&lt;/a&gt;.&lt;/p&gt;&lt;p&gt;&lt;b&gt;Build, configuration, hardware, benchmark&lt;/b&gt;&lt;/p&gt;&lt;p&gt;These are described in my&amp;nbsp;&lt;a href=&quot;https://smalldatum.blogspot.com/2025/03/postgres-174-vs-sysbench-on-large-server.html&quot;&gt;previous post&lt;/a&gt;. But a few things have changed for this report&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;I only tested Postgres versions 10.23, 11.22 and 17.4&lt;/li&gt;&lt;li&gt;I ran the read-only_range=$X microbenchmark for X in 10, 100, 1000, 2000, 4000, 8000, 16000 and 32000. Previously I ran it for X in 10, 100, 10000&lt;/li&gt;&lt;li&gt;I added new tests that each run only one of the SQL statements used by oltp_read_only.lua. All of the Lua scripts &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/sysbench.lua/lua&quot;&gt;are here&lt;/a&gt;.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;What didn&#39;t change is that the tests are run with 8 tables and 10M rows/table, read-heavy microbenchmarks run for 180 seconds and write-heavy run for 300 seconds.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The benchmark was repeated using configurations with work_mem set to 1MB, 2MB, 4MB, 8MB, 16MB and 32MB. The configuration files &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/mar25.pg.workmem.c48/configs&quot;&gt;are here&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Query plans&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;This table shows that plans with &lt;i&gt;bitmap index&lt;/i&gt; are far more frequent starting in Postgres 11. The numbers are similar if I count the number of occurrences of &lt;i&gt;recheck&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;The following show the number of occurrences of &quot;bitmap index&quot; in explain output for the read-only.range=X microbenchmarks.&amp;nbsp;&lt;/span&gt;For read-only.range=1000 and read-only.range=2000 the counts are always 0. Note that the regressions are there at range=8000 and mostly don&#39;t occur for other values of range=X. It is interesting that 10.23 is least likely to use plans with &lt;i&gt;bitmap index&lt;/i&gt; while 17.4 is most likely.&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;For read-only.range=4000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;div&gt;dbms&amp;nbsp; &amp;nbsp; &amp;nbsp; 1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2&amp;nbsp; &amp;nbsp; &amp;nbsp; 4&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;16&amp;nbsp; &amp;nbsp; &amp;nbsp; 32 -&amp;gt; work_mem(MB)&lt;/div&gt;&lt;div&gt;10.23&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&lt;/div&gt;&lt;div&gt;11.22&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&lt;/div&gt;&lt;div&gt;17.4&amp;nbsp; &amp;nbsp; 723&amp;nbsp; &amp;nbsp; &amp;nbsp;1020&amp;nbsp; &amp;nbsp; 635&amp;nbsp; &amp;nbsp; &amp;nbsp;935&amp;nbsp; &amp;nbsp; &amp;nbsp;1059&amp;nbsp; &amp;nbsp; 1008&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;For read-only.range=8000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;dbms&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;4&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; 16&amp;nbsp; &amp;nbsp; &amp;nbsp; 32 -&amp;gt; work_mem(MB)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;10.23&amp;nbsp; &amp;nbsp; &amp;nbsp;40&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp;166&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;11.22&amp;nbsp; &amp;nbsp;1133&amp;nbsp; &amp;nbsp; 1237&amp;nbsp; &amp;nbsp; 1044&amp;nbsp; &amp;nbsp; 1252&amp;nbsp; &amp;nbsp; 1280&amp;nbsp; &amp;nbsp; 1231&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;17.4&amp;nbsp; &amp;nbsp; 1280&amp;nbsp; &amp;nbsp; 1278&amp;nbsp; &amp;nbsp; 1279&amp;nbsp; &amp;nbsp; 1280&amp;nbsp; &amp;nbsp; 1280&amp;nbsp; &amp;nbsp; 1280&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;For read-only.range=16000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;div&gt;dbms&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;4&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; 16&amp;nbsp; &amp;nbsp; &amp;nbsp; 32 -&amp;gt; work_mem(MB)&lt;/div&gt;&lt;div&gt;10.23&amp;nbsp; &amp;nbsp;1279&amp;nbsp; &amp;nbsp; 1279&amp;nbsp; &amp;nbsp; 1279&amp;nbsp; &amp;nbsp; 1278&amp;nbsp; &amp;nbsp; 1278&amp;nbsp; &amp;nbsp; 1278&lt;/div&gt;&lt;div&gt;11.22&amp;nbsp; &amp;nbsp;1280&amp;nbsp; &amp;nbsp; 1280&amp;nbsp; &amp;nbsp; 1279&amp;nbsp; &amp;nbsp; 1279&amp;nbsp; &amp;nbsp; 1280&amp;nbsp; &amp;nbsp; 1278&lt;/div&gt;&lt;div&gt;17.4&amp;nbsp; &amp;nbsp; 1279&amp;nbsp; &amp;nbsp; 1280&amp;nbsp; &amp;nbsp; 1279&amp;nbsp; &amp;nbsp; 1279&amp;nbsp; &amp;nbsp; 1279&amp;nbsp; &amp;nbsp; 1279&lt;/div&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Example plans for distinct_range&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;The read-only.range=8000 test uses 4 types of SQL aggregation queries - distinct_range, order_range, simple_range and sum_range. This section has example plans for distinct_range at work_mem=16M.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Below I show that the regressions are largest for sum_range and simple_range and much smaller for distinct_range and order_range -- while plans for all of these are switching from index scan to bitmap index scan.&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Postgres 10.23&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;explain SELECT DISTINCT c FROM sbtest1 WHERE id BETWEEN 4087499 AND 4095498 ORDER BY c&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Unique&amp;nbsp; (cost=28211.06..28250.59 rows=7907 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; -&amp;gt;&amp;nbsp; Sort&amp;nbsp; (cost=28211.06..28230.82 rows=7907 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Sort Key: c&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -&amp;gt;&amp;nbsp; Index Scan using sbtest1_pkey on sbtest1&amp;nbsp; (cost=0.43..27699.12 rows=7907 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Index Cond: ((id &amp;gt;= 4087499) AND (id &amp;lt;= 4095498))&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Postgres 11.22&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;explain SELECT DISTINCT c FROM sbtest1 WHERE id BETWEEN 1359956 AND 1367955 ORDER BY c&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Unique&amp;nbsp; (cost=29781.72..29823.36 rows=8327 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; -&amp;gt;&amp;nbsp; Sort&amp;nbsp; (cost=29781.72..29802.54 rows=8327 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Sort Key: c&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -&amp;gt;&amp;nbsp; Bitmap Heap Scan on sbtest1&amp;nbsp; (cost=269.79..29239.49 rows=8327 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Recheck Cond: ((id &amp;gt;= 1359956) AND (id &amp;lt;= 1367955))&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -&amp;gt;&amp;nbsp; Bitmap Index Scan on sbtest1_pkey&amp;nbsp; (cost=0.00..267.70 rows=8327 width=0)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Index Cond: ((id &amp;gt;= 1359956) AND (id &amp;lt;= 1367955))&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Postgres 17.4&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;explain SELECT DISTINCT c FROM sbtest1 WHERE id BETWEEN 8646394 AND 8654393 ORDER BY c&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Unique&amp;nbsp; (cost=31903.86..31949.03 rows=9033 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; -&amp;gt;&amp;nbsp; Sort&amp;nbsp; (cost=31903.86..31926.45 rows=9033 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Sort Key: c&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -&amp;gt;&amp;nbsp; Bitmap Heap Scan on sbtest1&amp;nbsp; (cost=193.02..31310.35 rows=9033 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Recheck Cond: ((id &amp;gt;= 8646394) AND (id &amp;lt;= 8654393))&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -&amp;gt;&amp;nbsp; Bitmap Index Scan on sbtest1_pkey&amp;nbsp; (cost=0.00..190.76 rows=9033 width=0)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Index Cond: ((id &amp;gt;= 8646394) AND (id &amp;lt;= 8654393))&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;b&gt;Example plans for order_range&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;The read-only.range=8000 test uses 4 types of SQL aggregation queries - distinct_range, order_range, simple_range and sum_range. This section has example plans for order_range at work_mem=16M.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Below I show that the regressions are largest for sum_range and simple_range and much smaller for distinct_range and order_range -- while plans for all of these are switching from index scan to bitmap index scan.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Postgres 10.23&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;table 1 : explain SELECT c FROM sbtest1 WHERE id BETWEEN 9271446 AND 9279445 ORDER BY c&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Sort&amp;nbsp; (cost=26775.57..26794.32 rows=7501 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; Sort Key: c&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; -&amp;gt;&amp;nbsp; Index Scan using sbtest1_pkey on sbtest1&amp;nbsp; (cost=0.43..26292.77 rows=7501 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Index Cond: ((id &amp;gt;= 9271446) AND (id &amp;lt;= 9279445))&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Postgres 11.22&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;explain SELECT c FROM sbtest1 WHERE id BETWEEN 9375999 AND 9383998 ORDER BY c&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Sort&amp;nbsp; (cost=30444.65..30465.97 rows=8529 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; Sort Key: c&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; -&amp;gt;&amp;nbsp; Bitmap Heap Scan on sbtest1&amp;nbsp; (cost=275.86..29887.79 rows=8529 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Recheck Cond: ((id &amp;gt;= 9375999) AND (id &amp;lt;= 9383998))&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -&amp;gt;&amp;nbsp; Bitmap Index Scan on sbtest1_pkey&amp;nbsp; (cost=0.00..273.73 rows=8529 width=0)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Index Cond: ((id &amp;gt;= 9375999) AND (id &amp;lt;= 9383998))&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Postgres 17.4&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;explain SELECT c FROM sbtest1 WHERE id BETWEEN 8530681 AND 8538680 ORDER BY c&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Sort&amp;nbsp; (cost=27548.18..27567.43 rows=7701 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; Sort Key: c&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; -&amp;gt;&amp;nbsp; Bitmap Heap Scan on sbtest1&amp;nbsp; (cost=167.37..27051.05 rows=7701 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Recheck Cond: ((id &amp;gt;= 8530681) AND (id &amp;lt;= 8538680))&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -&amp;gt;&amp;nbsp; Bitmap Index Scan on sbtest1_pkey&amp;nbsp; (cost=0.00..165.44 rows=7701 width=0)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Index Cond: ((id &amp;gt;= 8530681) AND (id &amp;lt;= 8538680))&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;b&gt;Example plans for simple_range&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;The read-only.range=8000 test uses 4 types of SQL aggregation queries - distinct_range, order_range, simple_range and sum_range. This section has example plans for simple_range at work_mem=16M.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Below I show that the regressions are largest for sum_range and simple_range and much smaller for distinct_range and order_range -- while plans for all of these are switching from index scan to bitmap index scan.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Postgres 10.23&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;explain SELECT c FROM sbtest1 WHERE id BETWEEN 7681343 AND 7689342&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Index Scan using sbtest1_pkey on sbtest1&amp;nbsp; (cost=0.43..28016.13 rows=7999 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; Index Cond: ((id &amp;gt;= 7681343) AND (id &amp;lt;= 7689342))&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Postgres 11.22&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;explain SELECT c FROM sbtest1 WHERE id BETWEEN 1406209 AND 1414208&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Bitmap Heap Scan on sbtest1&amp;nbsp; (cost=250.91..27065.17 rows=7656 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; Recheck Cond: ((id &amp;gt;= 1406209) AND (id &amp;lt;= 1414208))&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; -&amp;gt;&amp;nbsp; Bitmap Index Scan on sbtest1_pkey&amp;nbsp; (cost=0.00..249.00 rows=7656 width=0)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Index Cond: ((id &amp;gt;= 1406209) AND (id &amp;lt;= 1414208))&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Postgres 17.4&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;explain SELECT c FROM sbtest1 WHERE id BETWEEN 5487727 AND 5495726&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Bitmap Heap Scan on sbtest1&amp;nbsp; (cost=170.27..27961.99 rows=7984 width=121)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; Recheck Cond: ((id &amp;gt;= 5487727) AND (id &amp;lt;= 5495726))&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; -&amp;gt;&amp;nbsp; Bitmap Index Scan on sbtest1_pkey&amp;nbsp; (cost=0.00..168.28 rows=7984 width=0)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Index Cond: ((id &amp;gt;= 5487727) AND (id &amp;lt;= 5495726))&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;b&gt;Example plans for sum_range&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;The read-only.range=8000 test uses 4 types of SQL aggregation queries - distinct_range, order_range, simple_range and sum_range. This section has example plans for sum_range at work_mem=16M.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Below I show that the regressions are largest for sum_range and simple_range and much smaller for distinct_range and order_range -- while plans for all of these are switching from index scan to bitmap index scan.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Postgres 10.23&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;explain SELECT SUM(k) FROM sbtest1 WHERE id BETWEEN 1117274 AND 1125273&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Aggregate&amp;nbsp; (cost=27542.60..27542.61 rows=1 width=8)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; -&amp;gt;&amp;nbsp; Index Scan using sbtest1_pkey on sbtest1&amp;nbsp; (cost=0.43..27522.96 rows=7856 width=4)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Index Cond: ((id &amp;gt;= 1117274) AND (id &amp;lt;= 1125273))&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Postgres 11.22&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;explain SELECT SUM(k) FROM sbtest1 WHERE id BETWEEN 2318912 AND 2326911&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Aggregate&amp;nbsp; (cost=28030.44..28030.45 rows=1 width=8)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; -&amp;gt;&amp;nbsp; Bitmap Heap Scan on sbtest1&amp;nbsp; (cost=257.90..28010.57 rows=7948 width=4)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Recheck Cond: ((id &amp;gt;= 2318912) AND (id &amp;lt;= 2326911))&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -&amp;gt;&amp;nbsp; Bitmap Index Scan on sbtest1_pkey&amp;nbsp; (cost=0.00..255.92 rows=7948 width=0)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Index Cond: ((id &amp;gt;= 2318912) AND (id &amp;lt;= 2326911))&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Postgres 17.4&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;explain SELECT SUM(k) FROM sbtest1 WHERE id BETWEEN 1795996 AND 1803995&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Aggregate&amp;nbsp; (cost=27179.49..27179.50 rows=1 width=8)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; -&amp;gt;&amp;nbsp; Bitmap Heap Scan on sbtest1&amp;nbsp; (cost=167.72..27160.16 rows=7735 width=4)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Recheck Cond: ((id &amp;gt;= 1795996) AND (id &amp;lt;= 1803995))&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -&amp;gt;&amp;nbsp; Bitmap Index Scan on sbtest1_pkey&amp;nbsp; (cost=0.00..165.79 rows=7735 width=0)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Index Cond: ((id &amp;gt;= 1795996) AND (id &amp;lt;= 1803995))&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;While there are normally ~27 microbenchmarks (each microbenchmark uses sysbench to run tests from one Lua file) I added a few extra tests this time and I only share results from the read-only* microbenchmarks. Output from all tests &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/mar25.pg.workmem.c48&quot;&gt;is here&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The numbers below are the relative QPS which is the following where $version is either 11.22 or 17.4. When the relative QPS is &amp;lt; 1.0, then $version is slower than Postgres 10.23.&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;/div&gt;&lt;blockquote&gt;(QPS for $version) / (QPS for Postgres 10.23)&lt;/blockquote&gt;&lt;p&gt;A summary of the results is:&lt;br /&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;the regression always occurs for the range=8000 microbenchmark and is less likely for other values for range=X. Note that range=X means the queries scan X rows.&lt;/li&gt;&lt;li&gt;from the four tests that each run only one of the SQL aggregation queries used by the standard read-only microbenchmark (read-only-distinct, read-only-order, read-only-simple and read-only-sum) the regression occurs in read-only-simple and read-only-sum but not in the others and the regression is the largest in read-only-sum. The SQL statements are here for &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/sysbench.lua/lua/oltp_common.lua#L315-L317&quot;&gt;read-only-distinct&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/sysbench.lua/lua/oltp_common.lua#L312-L314&quot;&gt;read-only-order&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/sysbench.lua/lua/oltp_common.lua#L306-L308&quot;&gt;read-only-simple&lt;/a&gt; and &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/sysbench.lua/lua/oltp_common.lua#L309-L311&quot;&gt;read-only-sum&lt;/a&gt;.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;work_mem 1 MB&lt;br /&gt;1.03&amp;nbsp; &amp;nbsp; 1.06&amp;nbsp; &amp;nbsp; read-only_range=10&lt;br /&gt;1.02&amp;nbsp; &amp;nbsp; 1.04&amp;nbsp; &amp;nbsp; read-only_range=100&lt;br /&gt;1.00&amp;nbsp; &amp;nbsp; 1.00&amp;nbsp; &amp;nbsp; read-only_range=1000&lt;br /&gt;1.00&amp;nbsp; &amp;nbsp; 1.02&amp;nbsp; &amp;nbsp; read-only_range=2000&lt;br /&gt;1.00&amp;nbsp; &amp;nbsp; 0.99&amp;nbsp; &amp;nbsp; read-only_range=4000&lt;br /&gt;0.95&amp;nbsp; &amp;nbsp; 0.95&amp;nbsp; &amp;nbsp; read-only_range=8000&lt;br /&gt;0.99&amp;nbsp; &amp;nbsp; 1.02&amp;nbsp; &amp;nbsp; read-only_range=16000&lt;br /&gt;1.00&amp;nbsp; &amp;nbsp; 1.04&amp;nbsp; &amp;nbsp; read-only_range=32000&lt;br /&gt;0.98&amp;nbsp; &amp;nbsp; 0.97&amp;nbsp; &amp;nbsp; read-only-distinct_range=8000&lt;br /&gt;0.98&amp;nbsp; &amp;nbsp; 0.99&amp;nbsp; &amp;nbsp; read-only-order_range=8000&lt;br /&gt;&lt;span style=&quot;background-color: #fff2cc;&quot;&gt;0.94&amp;nbsp; &amp;nbsp; 0.90&amp;nbsp; &amp;nbsp; read-only-simple_range=8000&lt;br /&gt;0.89&amp;nbsp; &amp;nbsp; 0.83&amp;nbsp; &amp;nbsp; read-only-sum_range=8000&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;work_mem 2 MB&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;11.22&amp;nbsp; &amp;nbsp;17.4&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;1.02&amp;nbsp; &amp;nbsp; 1.06&amp;nbsp; &amp;nbsp; read-only_range=10&lt;br /&gt;1.01&amp;nbsp; &amp;nbsp; 1.02&amp;nbsp; &amp;nbsp; read-only_range=100&lt;br /&gt;1.00&amp;nbsp; &amp;nbsp; 0.99&amp;nbsp; &amp;nbsp; read-only_range=1000&lt;br /&gt;0.99&amp;nbsp; &amp;nbsp; 1.01&amp;nbsp; &amp;nbsp; read-only_range=2000&lt;br /&gt;0.98&amp;nbsp; &amp;nbsp; 0.96&amp;nbsp; &amp;nbsp; read-only_range=4000&lt;br /&gt;0.94&amp;nbsp; &amp;nbsp; 0.93&amp;nbsp; &amp;nbsp; read-only_range=8000&lt;br /&gt;0.99&amp;nbsp; &amp;nbsp; 1.00&amp;nbsp; &amp;nbsp; read-only_range=16000&lt;br /&gt;0.98&amp;nbsp; &amp;nbsp; 1.02&amp;nbsp; &amp;nbsp; read-only_range=32000&lt;br /&gt;0.97&amp;nbsp; &amp;nbsp; 0.96&amp;nbsp; &amp;nbsp; read-only-distinct_range=8000&lt;br /&gt;0.96&amp;nbsp; &amp;nbsp; 0.98&amp;nbsp; &amp;nbsp; read-only-order_range=8000&lt;br /&gt;&lt;span style=&quot;background-color: #fff2cc;&quot;&gt;0.92&amp;nbsp; &amp;nbsp; 0.89&amp;nbsp; &amp;nbsp; read-only-simple_range=8000&lt;br /&gt;0.86&amp;nbsp; &amp;nbsp; 0.80&amp;nbsp; &amp;nbsp; read-only-sum_range=8000&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;work_mem 4 MB&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;11.22&amp;nbsp; &amp;nbsp;17.4&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;1.02&amp;nbsp; &amp;nbsp; 1.06&amp;nbsp; &amp;nbsp; read-only_range=10&lt;br /&gt;1.02&amp;nbsp; &amp;nbsp; 1.03&amp;nbsp; &amp;nbsp; read-only_range=100&lt;br /&gt;1.01&amp;nbsp; &amp;nbsp; 1.01&amp;nbsp; &amp;nbsp; read-only_range=1000&lt;br /&gt;1.00&amp;nbsp; &amp;nbsp; 1.02&amp;nbsp; &amp;nbsp; read-only_range=2000&lt;br /&gt;1.00&amp;nbsp; &amp;nbsp; 1.00&amp;nbsp; &amp;nbsp; read-only_range=4000&lt;br /&gt;0.96&amp;nbsp; &amp;nbsp; 0.94&amp;nbsp; &amp;nbsp; read-only_range=8000&lt;br /&gt;1.13&amp;nbsp; &amp;nbsp; 1.24&amp;nbsp; &amp;nbsp; read-only_range=16000&lt;br /&gt;1.06&amp;nbsp; &amp;nbsp; 1.11&amp;nbsp; &amp;nbsp; read-only_range=32000&lt;br /&gt;0.98&amp;nbsp; &amp;nbsp; 0.97&amp;nbsp; &amp;nbsp; read-only-distinct_range=8000&lt;br /&gt;0.98&amp;nbsp; &amp;nbsp; 0.99&amp;nbsp; &amp;nbsp; read-only-order_range=8000&lt;br /&gt;&lt;span style=&quot;background-color: #fff2cc;&quot;&gt;0.94&amp;nbsp; &amp;nbsp; 0.90&amp;nbsp; &amp;nbsp; read-only-simple_range=8000&lt;br /&gt;0.89&amp;nbsp; &amp;nbsp; 0.82&amp;nbsp; &amp;nbsp; read-only-sum_range=8000&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;work_mem 8 MB&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;11.22&amp;nbsp; &amp;nbsp;17.4&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;1.03&amp;nbsp; &amp;nbsp; 1.07&amp;nbsp; &amp;nbsp; read-only_range=10&lt;br /&gt;1.02&amp;nbsp; &amp;nbsp; 1.03&amp;nbsp; &amp;nbsp; read-only_range=100&lt;br /&gt;1.00&amp;nbsp; &amp;nbsp; 0.99&amp;nbsp; &amp;nbsp; read-only_range=1000&lt;br /&gt;1.00&amp;nbsp; &amp;nbsp; 1.01&amp;nbsp; &amp;nbsp; read-only_range=2000&lt;br /&gt;0.99&amp;nbsp; &amp;nbsp; 0.97&amp;nbsp; &amp;nbsp; read-only_range=4000&lt;br /&gt;0.95&amp;nbsp; &amp;nbsp; 0.94&amp;nbsp; &amp;nbsp; read-only_range=8000&lt;br /&gt;0.99&amp;nbsp; &amp;nbsp; 1.00&amp;nbsp; &amp;nbsp; read-only_range=16000&lt;br /&gt;0.99&amp;nbsp; &amp;nbsp; 1.03&amp;nbsp; &amp;nbsp; read-only_range=32000&lt;br /&gt;0.97&amp;nbsp; &amp;nbsp; 0.96&amp;nbsp; &amp;nbsp; read-only-distinct_range=8000&lt;br /&gt;0.97&amp;nbsp; &amp;nbsp; 0.98&amp;nbsp; &amp;nbsp; read-only-order_range=8000&lt;br /&gt;&lt;span style=&quot;background-color: #fff2cc;&quot;&gt;0.92&amp;nbsp; &amp;nbsp; 0.89&amp;nbsp; &amp;nbsp; read-only-simple_range=8000&lt;br /&gt;0.87&amp;nbsp; &amp;nbsp; 0.81&amp;nbsp; &amp;nbsp; read-only-sum_range=8000&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;work_mem 16 MB&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;11.22&amp;nbsp; &amp;nbsp;17.4&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;1.04&amp;nbsp; &amp;nbsp; 1.08&amp;nbsp; &amp;nbsp; read-only_range=10&lt;br /&gt;1.03&amp;nbsp; &amp;nbsp; 1.05&amp;nbsp; &amp;nbsp; read-only_range=100&lt;br /&gt;1.00&amp;nbsp; &amp;nbsp; 1.00&amp;nbsp; &amp;nbsp; read-only_range=1000&lt;br /&gt;1.00&amp;nbsp; &amp;nbsp; 1.02&amp;nbsp; &amp;nbsp; read-only_range=2000&lt;br /&gt;0.99&amp;nbsp; &amp;nbsp; 0.97&amp;nbsp; &amp;nbsp; read-only_range=4000&lt;br /&gt;0.94&amp;nbsp; &amp;nbsp; 0.94&amp;nbsp; &amp;nbsp; read-only_range=8000&lt;br /&gt;0.98&amp;nbsp; &amp;nbsp; 1.00&amp;nbsp; &amp;nbsp; read-only_range=16000&lt;br /&gt;0.99&amp;nbsp; &amp;nbsp; 1.03&amp;nbsp; &amp;nbsp; read-only_range=32000&lt;br /&gt;0.97&amp;nbsp; &amp;nbsp; 0.96&amp;nbsp; &amp;nbsp; read-only-distinct_range=8000&lt;br /&gt;0.97&amp;nbsp; &amp;nbsp; 0.99&amp;nbsp; &amp;nbsp; read-only-order_range=8000&lt;br /&gt;&lt;span style=&quot;background-color: #fff2cc;&quot;&gt;0.92&amp;nbsp; &amp;nbsp; 0.90&amp;nbsp; &amp;nbsp; read-only-simple_range=8000&lt;br /&gt;0.86&amp;nbsp; &amp;nbsp; 0.80&amp;nbsp; &amp;nbsp; read-only-sum_range=8000&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;work_mem 32 MB&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;11.22&amp;nbsp; &amp;nbsp;17.4&lt;br /&gt;1.02&amp;nbsp; &amp;nbsp; 1.06&amp;nbsp; &amp;nbsp; read-only_range=10&lt;br /&gt;1.01&amp;nbsp; &amp;nbsp; 1.03&amp;nbsp; &amp;nbsp; read-only_range=100&lt;br /&gt;1.00&amp;nbsp; &amp;nbsp; 1.00&amp;nbsp; &amp;nbsp; read-only_range=1000&lt;br /&gt;0.99&amp;nbsp; &amp;nbsp; 1.02&amp;nbsp; &amp;nbsp; read-only_range=2000&lt;br /&gt;1.00&amp;nbsp; &amp;nbsp; 0.97&amp;nbsp; &amp;nbsp; read-only_range=4000&lt;br /&gt;0.95&amp;nbsp; &amp;nbsp; 0.94&amp;nbsp; &amp;nbsp; read-only_range=8000&lt;br /&gt;0.99&amp;nbsp; &amp;nbsp; 1.01&amp;nbsp; &amp;nbsp; read-only_range=16000&lt;br /&gt;0.99&amp;nbsp; &amp;nbsp; 1.04&amp;nbsp; &amp;nbsp; read-only_range=32000&lt;br /&gt;0.97&amp;nbsp; &amp;nbsp; 0.96&amp;nbsp; &amp;nbsp; read-only-distinct_range=8000&lt;br /&gt;0.97&amp;nbsp; &amp;nbsp; 0.99&amp;nbsp; &amp;nbsp; read-only-order_range=8000&lt;br /&gt;&lt;span style=&quot;background-color: #fff2cc;&quot;&gt;0.94&amp;nbsp; &amp;nbsp; 0.90&amp;nbsp; &amp;nbsp; read-only-simple_range=8000&lt;br /&gt;0.89&amp;nbsp; &amp;nbsp; 0.83&amp;nbsp; &amp;nbsp; read-only-sum_range=8000&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/1626184717187154787/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/03/postgres-174-vs-sysbench-on-large_27.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/1626184717187154787'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/1626184717187154787'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/03/postgres-174-vs-sysbench-on-large_27.html' title='Postgres 17.4 vs sysbench on a large server, revisited part 2'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-5809616865679724548</id><published>2025-03-16T13:14:00.000-07:00</published><updated>2025-03-16T13:14:29.811-07:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="innodb"/><category scheme="http://www.blogger.com/atom/ns#" term="mysql"/><category scheme="http://www.blogger.com/atom/ns#" term="sysbench"/><title type='text'>At what level of concurrency do MySQL 5.7 and 8.0 become faster than 5.6?</title><content type='html'>&lt;p&gt;Are MySQL 5.7 and 8.0 faster than 5.6? That depends a lot on the workload -- both types of SQL and amount of concurrency. Here I summarize results from sysbench on a larger server (48 cores) using 1, 4, 6, 8, 10, 20 and 40 clients to show how things change.&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;the workload here is microbenchmarks with a database cached by InnoDB&lt;/li&gt;&lt;li&gt;5.7.44 is faster than 8.0.x at all concurrency levels on most microbenchmarks&lt;/li&gt;&lt;li&gt;for 5.6.51 vs 8.0.x&lt;/li&gt;&lt;ul&gt;&lt;li&gt;for point queries, 5.6.51 is faster at &amp;lt;= 8 clients&lt;/li&gt;&lt;li&gt;for range queries without aggregation 5.6.51 is always faster&lt;/li&gt;&lt;li&gt;for range queries with aggregation 5.6.51 is faster except at 40 clients&lt;/li&gt;&lt;li&gt;for writes, 5.6.51 is almost always faster at 10 or fewer clients (excluding update-index)&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;div&gt;&lt;b&gt;Performance summaries&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;For point queries:&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;5.7.44 is always faster than 8.0&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;8.0.28 suffers from&amp;nbsp;&lt;a href=&quot;https://bugs.mysql.com/bug.php?id=102037&quot;&gt;bug 102037&lt;/a&gt;&amp;nbsp;- found by me with sysbench, fixed in 8.0.31&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;at what level of concurrency do most things get faster in 5.7 &amp;amp; 8.0 vs 5.6?&lt;/span&gt;&lt;/li&gt;&lt;ul class=&quot;ul1&quot; style=&quot;list-style-type: disc;&quot;&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;5.7.44 becomes faster than 5.6.51 at 6+ clients&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;8.0.x becomes faster than 5.6.51 at between 10 and 20 clients&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Two of the microbenchmarks are always faster in 5.6.51 - both do non-covering queries on a secondary index&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;div&gt;For range queries without aggregation&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;5.7.44 is always faster than 8.0x&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;5.6.51 is always faster than 5.7.44 and 8.0.x&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;For range queries with aggregation&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;5.7.44 is almost always faster than 8.0.x&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;5.7.44 becomes faster than 5.6.51 at 6+ clients&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;8.0.x becomes faster than 5.6.51 at 40 clients&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;For writes&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;For update-index&lt;/span&gt;&lt;/li&gt;&lt;ul&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;5.7.44 and 8.0.x are always faster than 5.6.51 at 4+ clients&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;There is an odd drop from ~6X to ~3X for 8.0.32 and 8.0.39 at 20 clients&lt;/li&gt;&lt;/ul&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;5.7.44 is mostly faster than 8.0.x for 1 to 20 clients and they have similar results at 40 clients&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;5.7.44 &amp;amp; 8.0.x are always faster than 5.6.51 at 20+ clients&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Builds, configuration and hardware&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I compiled MySQL from source for versions 5.6.51, 5.7.44, 8.0.28, 8.0.32, 8.0.39 and 8.0.41.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The server is an&amp;nbsp;&lt;a href=&quot;https://www.hetzner.com/dedicated-rootserver/ax162-s/&quot;&gt;ax162-s&lt;/a&gt;&amp;nbsp;from Hetzner with 48 cores (AMD EPYC 9454P), 128G RAM and AMD SMT disabled. It uses Ubuntu 22.04 and storage is ext4 with SW RAID 1 over 2 locally attached NVMe devices. More details on it&amp;nbsp;&lt;a href=&quot;https://smalldatum.blogspot.com/2024/09/trying-out-dedicated-server-from-hetzner.html&quot;&gt;are here&lt;/a&gt;. At list prices a similar server from Google Cloud costs 10X more than from Hetzner.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The configuration files are named my.cnf.cz11a_c32r128 and here for &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/sep24/c32r128/my5651/etc/my.cnf.cz11a_c32r128&quot;&gt;5.6.51&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/sep24/c32r128/my5744/etc/my.cnf.cz11a_c32r128&quot;&gt;5.7.44&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/sep24/c32r128/my8028/etc/my.cnf.cz11a_c32r128&quot;&gt;8.0.28&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/sep24/c32r128/my8032/etc/my.cnf.cz11a_c32r128&quot;&gt;8.0.32&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/sep24/c32r128/my8039/etc/my.cnf.cz11a_c32r128&quot;&gt;8.0.39&lt;/a&gt; and &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/sep24/c32r128/my8041/etc/my.cnf.cz11a_c32r128&quot;&gt;8.0.41&lt;/a&gt;.&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I used sysbench and my usage is&amp;nbsp;&lt;a href=&quot;http://smalldatum.blogspot.com/2017/02/using-modern-sysbench-to-compare.html&quot;&gt;explained here&lt;/a&gt;. To save time I only run 27 of the 42 microbenchmarks and most test only 1 type of SQL statement. Benchmarks are run with the database cached by InnoDB.&lt;br /&gt;&lt;br /&gt;The tests run with 8 tables and 10M rows/table. There are 40 client threads, read-heavy microbenchmarks run for 180 seconds and write-heavy run for 300 seconds.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The command lines to run all tests are:&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;blockquote style=&quot;border: none; margin: 0px 0px 0px 40px; padding: 0px; text-align: left;&quot;&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;i style=&quot;font-family: inherit;&quot;&gt;bash r.sh 8 10000000 180 300 md2 1 1 1&lt;/i&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;i style=&quot;font-family: inherit;&quot;&gt;bash r.sh 8 10000000 180 300 md2 1 1 4&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;i style=&quot;font-family: inherit;&quot;&gt;bash r.sh 8 10000000 180 300 md2 1 1 6&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;i style=&quot;font-family: inherit;&quot;&gt;bash r.sh 8 10000000 180 300 md2 1 1 8&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;i style=&quot;font-family: inherit;&quot;&gt;bash r.sh 8 10000000 180 300 md2 1 1 10&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;i style=&quot;font-family: inherit;&quot;&gt;bash r.sh 8 10000000 180 300 md2 1 1 20&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;i style=&quot;font-family: inherit;&quot;&gt;bash r.sh 8 10000000 180 300 md2 1 1 40&lt;/i&gt;&lt;/div&gt;&lt;/blockquote&gt;&lt;div&gt;&lt;i style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;i&gt;&lt;div style=&quot;font-family: inherit; font-style: normal;&quot;&gt;&lt;b&gt;Results&lt;/b&gt;&lt;/div&gt;&lt;div style=&quot;font-style: normal;&quot;&gt;&lt;span&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;For the results below I split the microbenchmarks into 4 groups: point queries, range queries without aggregation, range queries with queries, writes. The spreadsheet with all data&amp;nbsp;&lt;a href=&quot;https://docs.google.com/spreadsheets/d/1hPT59PPV3wMiTlmOi2kZl-_XX5KR1iwYoK_rVq9abCs/edit?usp=sharing&quot;&gt;is here&lt;/a&gt;. Files with performance summaries for relative and absolute QPS &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/mar25.sb.in.c48&quot;&gt;are here&lt;/a&gt;.&amp;nbsp;&lt;span style=&quot;font-family: inherit;&quot;&gt;Values from iostat and vmstat per microbenchmark are here for &lt;/span&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.in.c48/dop1/o.met&quot; style=&quot;font-family: inherit;&quot;&gt;1 client&lt;/a&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;, &lt;/span&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.in.c48/dop4/o.met&quot; style=&quot;font-family: inherit;&quot;&gt;4 clients&lt;/a&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;, &lt;/span&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.in.c48/dop6/o.met&quot; style=&quot;font-family: inherit;&quot;&gt;6 clients&lt;/a&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;, &lt;/span&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.in.c48/dop8/o.met&quot; style=&quot;font-family: inherit;&quot;&gt;8 clients&lt;/a&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;,&lt;/span&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&amp;nbsp;&lt;/span&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.in.c48/dop10/o.met&quot; style=&quot;font-family: inherit;&quot;&gt;10 clients&lt;/a&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;, &lt;/span&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.in.c48/dop20/o.met&quot; style=&quot;font-family: inherit;&quot;&gt;20 clients&lt;/a&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt; and &lt;/span&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.in.c48/dop40/o.met&quot; style=&quot;font-family: inherit;&quot;&gt;40 clients&lt;/a&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;. &lt;/span&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;These help to explain why something is faster or slower because it shows how much HW is used per query.&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;div&gt;The relative QPS is the following where $version is &amp;gt;= 5.7.44.&lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;blockquote&gt;(QPS for $version) / (QPS for MySQL 5.6.51)&lt;/blockquote&gt;&lt;/div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;The numbers in the spreadsheets are the relative QPS. When the relative QPS is &amp;gt; 1 then $version is faster than MySQL 5.6.51. When it is 3.0 then $version is 3X faster than the base case.&lt;/div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;b&gt;Results: charts&lt;/b&gt;&amp;nbsp;&lt;/div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;p&gt;Notes on the charts&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;the y-axis shows the relative QPS&lt;/li&gt;&lt;li&gt;the y-axis starts at 0.80 to make it easier to see differences&lt;/li&gt;&lt;li&gt;in some cases the y-axis truncates the good outliers, cases where the relative QPS is greater than 1.5. I do this to improve readability for values near 1.0. Regardless, the improvements are nice.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Results: point queries&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;5.7.44 is always faster than 8.0&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;8.0.28 suffers from &lt;a href=&quot;https://bugs.mysql.com/bug.php?id=102037&quot;&gt;bug 102037&lt;/a&gt; - found by me with sysbench, fixed in 8.0.31&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;at what level of concurrency do most things get faster in 5.7 &amp;amp; 8.0 vs 5.6?&lt;/span&gt;&lt;/li&gt;&lt;ul class=&quot;ul1&quot; style=&quot;list-style-type: disc;&quot;&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;5.7.44 becomes faster than 5.6.51 at 6+ clients&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;8.0.x becomes faster than 5.6.51 at between 10 and 20 clients&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Two of the microbenchmarks are always faster in 5.6.51 - both do non-covering queries on a secondary index&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; font-family: inherit; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjJzvPan5i8bRpUlUdxrMa-Q2sVMHA7gPYusOD4CaawKcQnSr6pzPjZo671OWALpmluWpUwVVN3rPqf1ild4SKgndVmbj2xKeY4FwyChND-TjfMbbBNJYgKQXMLziDbPpPV3xMc97HIj5YCtlVMkq4nlzIz9q7GHmKVbDd_Ah00RoEvCbxWFkjp2-gWD1a_/s600/QPS%20relative%20to%205.6.51_%20point%20queries,%201%20client.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjJzvPan5i8bRpUlUdxrMa-Q2sVMHA7gPYusOD4CaawKcQnSr6pzPjZo671OWALpmluWpUwVVN3rPqf1ild4SKgndVmbj2xKeY4FwyChND-TjfMbbBNJYgKQXMLziDbPpPV3xMc97HIj5YCtlVMkq4nlzIz9q7GHmKVbDd_Ah00RoEvCbxWFkjp2-gWD1a_/w640-h396/QPS%20relative%20to%205.6.51_%20point%20queries,%201%20client.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; font-family: inherit; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgWv_z8IL886YVuqrIUxqVdnR15lpypYYPWq_N7GY2okqK45E9ZezSZ9YWW91RcWqygwq2K9_xXNfksargHNJmhvzeJI-VMmrlonkzm-AhQFT5qmI8MCT-gLDgjxJr3casCn8x1JSlXEcYqe-eO92ScbvB61gxvemuEynrEL4VPpq8l0tHWHLPKw2aEUMqu/s600/QPS%20relative%20to%205.6.51_%20point%20queries,%204%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgWv_z8IL886YVuqrIUxqVdnR15lpypYYPWq_N7GY2okqK45E9ZezSZ9YWW91RcWqygwq2K9_xXNfksargHNJmhvzeJI-VMmrlonkzm-AhQFT5qmI8MCT-gLDgjxJr3casCn8x1JSlXEcYqe-eO92ScbvB61gxvemuEynrEL4VPpq8l0tHWHLPKw2aEUMqu/w640-h396/QPS%20relative%20to%205.6.51_%20point%20queries,%204%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; font-family: inherit; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgw5BCQ0y2HhxkHZbI0Dvyt5Kra3ZnQCopwkE3pxWQaGrBI70A9All4V4cHoxS8srxcl4p_E6xnYFjVQr67R4dOlgKgrauW8-vi9rLTF9SiftnWoRMk7kgkZFbesRfGs5HfCgBUdGGU2H1SueziASZdWaKrtMkWF_z2geJ8RWg-Yo5eh2QYdKuOFGOq60HC/s600/QPS%20relative%20to%205.6.51_%20point%20queries,%206%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgw5BCQ0y2HhxkHZbI0Dvyt5Kra3ZnQCopwkE3pxWQaGrBI70A9All4V4cHoxS8srxcl4p_E6xnYFjVQr67R4dOlgKgrauW8-vi9rLTF9SiftnWoRMk7kgkZFbesRfGs5HfCgBUdGGU2H1SueziASZdWaKrtMkWF_z2geJ8RWg-Yo5eh2QYdKuOFGOq60HC/w640-h396/QPS%20relative%20to%205.6.51_%20point%20queries,%206%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; font-family: inherit; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj0YDGeeNmpoTMvVDOv7HwTQuUn9up-wuUP5H_OX3eYdhMqqXMydagZ7sUMdELuB7WspfVCHXm4RaNJXg5zK3kFEhSBAJrAl2PDcfWFmv-T3ctNMfVCIqoYnoXe-fpdYMb4svPrxXf_k5TkiczQc_2XsXbhmCghEzhUCWKn2lQRNyiDvme5fn3jdjL1yQAJ/s600/QPS%20relative%20to%205.6.51_%20point%20queries,%208%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj0YDGeeNmpoTMvVDOv7HwTQuUn9up-wuUP5H_OX3eYdhMqqXMydagZ7sUMdELuB7WspfVCHXm4RaNJXg5zK3kFEhSBAJrAl2PDcfWFmv-T3ctNMfVCIqoYnoXe-fpdYMb4svPrxXf_k5TkiczQc_2XsXbhmCghEzhUCWKn2lQRNyiDvme5fn3jdjL1yQAJ/w640-h396/QPS%20relative%20to%205.6.51_%20point%20queries,%208%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; font-family: inherit; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhLaoON7qWXx6luld6HelY9NJpwc9WKZ-zhMjvb99yRmwYhjB4j3vuglFHsXVzELRRAEsaByjzfKBFcYOCFYWHyiAq84AGDFKTF7GwD0sGMf84l2plkUp9J4mFVZcpoobOT5aJAcN74zINfd_aglMDT09nKow7hxjTgH1H1gIRHclaRaWgy0-DNdDjEyg71/s600/QPS%20relative%20to%205.6.51_%20point%20queries,%2010%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhLaoON7qWXx6luld6HelY9NJpwc9WKZ-zhMjvb99yRmwYhjB4j3vuglFHsXVzELRRAEsaByjzfKBFcYOCFYWHyiAq84AGDFKTF7GwD0sGMf84l2plkUp9J4mFVZcpoobOT5aJAcN74zINfd_aglMDT09nKow7hxjTgH1H1gIRHclaRaWgy0-DNdDjEyg71/w640-h396/QPS%20relative%20to%205.6.51_%20point%20queries,%2010%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; font-family: inherit; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjSpjkpdfP-INCnI33k4U45md052cYnjGh-HmqJ2ImyxOhnSvq2R5xLa_AfAFqyr1VqVPOb25Qz_u7bCAPxojsXlflPzBqXUu522uYQSzJFIT2F-IJL4neHjx1UHG3nyNOUTOQe7MQjIeKEfMRhKiVu9rLPsbc-qyY_OuaF3DDWS8VsnkZtNYrI-IcFDuwi/s600/QPS%20relative%20to%205.6.51_%20point%20queries,%2020%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjSpjkpdfP-INCnI33k4U45md052cYnjGh-HmqJ2ImyxOhnSvq2R5xLa_AfAFqyr1VqVPOb25Qz_u7bCAPxojsXlflPzBqXUu522uYQSzJFIT2F-IJL4neHjx1UHG3nyNOUTOQe7MQjIeKEfMRhKiVu9rLPsbc-qyY_OuaF3DDWS8VsnkZtNYrI-IcFDuwi/w640-h396/QPS%20relative%20to%205.6.51_%20point%20queries,%2020%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; font-family: inherit; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEirIW5NZZUXEZIyvjPY0Zjyx0NvarBjNfL9YRrsF95Z1siC0rAU_Z651BBXN3lAEppMZXMKz-VeeEYZ6SEAxwE8Z6Y3nmnikPUrt1z2wF4Z6YxTlkiDF3UJmF9rAcPYFnIi2_0rfUFWmXmnymX-xkymAZy3_5dLH2TJgYRFSXd_VwKtr4AOzMXrK6d-hPnz/s600/QPS%20relative%20to%205.6.51_%20point%20queries,%2040%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEirIW5NZZUXEZIyvjPY0Zjyx0NvarBjNfL9YRrsF95Z1siC0rAU_Z651BBXN3lAEppMZXMKz-VeeEYZ6SEAxwE8Z6Y3nmnikPUrt1z2wF4Z6YxTlkiDF3UJmF9rAcPYFnIi2_0rfUFWmXmnymX-xkymAZy3_5dLH2TJgYRFSXd_VwKtr4AOzMXrK6d-hPnz/w640-h396/QPS%20relative%20to%205.6.51_%20point%20queries,%2040%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;b style=&quot;font-family: inherit;&quot;&gt;Results: range queries without aggregation&lt;/b&gt;&lt;/div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;5.7.44 is always faster than 8.0x&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;5.6.51 is always faster than 5.7.44 and 8.0.x&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhlV9DsSJBvbFGn4mV_ysvWk3_7rzN-lPTMW7jjB_qZ4EN1JlbFv8jWVv5BEhmMKDs5spP8Fcge8ZHgnKRS0nEBgWIwdslTUVQWeMi2AU9Q1DydfseoNWF-otVdpM03HtiEmMpqbhmT9RgHF7RFGXBwhqgOtuErO4v_36ULYLdL6akfvflFRqHoDc5XkSdk/s600/QPS%20relative%20to%205.6.51_%20range%20queries%20w_o%20aggregation,%201%20client.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhlV9DsSJBvbFGn4mV_ysvWk3_7rzN-lPTMW7jjB_qZ4EN1JlbFv8jWVv5BEhmMKDs5spP8Fcge8ZHgnKRS0nEBgWIwdslTUVQWeMi2AU9Q1DydfseoNWF-otVdpM03HtiEmMpqbhmT9RgHF7RFGXBwhqgOtuErO4v_36ULYLdL6akfvflFRqHoDc5XkSdk/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries%20w_o%20aggregation,%201%20client.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh_NTbaqg0JCLgblw8nklcr0x-EFU_LuMjaelIid-G4P52UpbwioFHc60BDSc4NxVL2lxL3YNJE2yrP9BLzQA2VRYmSU9PxAB4srmKinwzx8HAQILHn23pgenjTeXZ7LPZ6U1Q-MCSnIQ9SwQZdrt_QxLShCYFie5nywSjiuzkLNK22DW8Zz6wQfrJRidRZ/s600/QPS%20relative%20to%205.6.51_%20range%20queries%20w_o%20aggregation,%204%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh_NTbaqg0JCLgblw8nklcr0x-EFU_LuMjaelIid-G4P52UpbwioFHc60BDSc4NxVL2lxL3YNJE2yrP9BLzQA2VRYmSU9PxAB4srmKinwzx8HAQILHn23pgenjTeXZ7LPZ6U1Q-MCSnIQ9SwQZdrt_QxLShCYFie5nywSjiuzkLNK22DW8Zz6wQfrJRidRZ/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries%20w_o%20aggregation,%204%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjZTk5M8dXkZUEsPKFQJ7e5Pg82V2JpADFO9zrtM7YegDC2HXDsoozf9Zqqcdq4aZoUg_30HDmZQaPViTZH151canzRpWxXmhnuNSY6N9c5V_VdQSBrlZDSASVBT6lAEW8okbLsWITLfTjzkgqts5VOySaEoTSjbI0zK2t5sa690Ef2YgeRInIZHty422HT/s600/QPS%20relative%20to%205.6.51_%20range%20queries%20w_o%20aggregation,%206%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjZTk5M8dXkZUEsPKFQJ7e5Pg82V2JpADFO9zrtM7YegDC2HXDsoozf9Zqqcdq4aZoUg_30HDmZQaPViTZH151canzRpWxXmhnuNSY6N9c5V_VdQSBrlZDSASVBT6lAEW8okbLsWITLfTjzkgqts5VOySaEoTSjbI0zK2t5sa690Ef2YgeRInIZHty422HT/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries%20w_o%20aggregation,%206%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiofx31yBKIRKTlffnAnXxn1eSTA-Jm9929f2hddmPYMjVnVKV__hb6ss3nVhj8gGO9JEE-pFIAXAcEorDGBJ8wRdiIeYS7QuGmbDeVzxEeZXbX8-TSaoGZMelXCtOFU7ApRNG00ApVQXFGEXfn54kupO2cfVcSeQhLg9xvUtamEPC81OcCQItYJkfvLwSJ/s600/QPS%20relative%20to%205.6.51_%20range%20queries%20w_o%20aggregation,%208%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiofx31yBKIRKTlffnAnXxn1eSTA-Jm9929f2hddmPYMjVnVKV__hb6ss3nVhj8gGO9JEE-pFIAXAcEorDGBJ8wRdiIeYS7QuGmbDeVzxEeZXbX8-TSaoGZMelXCtOFU7ApRNG00ApVQXFGEXfn54kupO2cfVcSeQhLg9xvUtamEPC81OcCQItYJkfvLwSJ/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries%20w_o%20aggregation,%208%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEimDtJbfPK5jAMdgPlzF_f2divaXFWFjVCBSHB8ZX7k394rjQM_c0zvYBzgTR8Q0SsWgvVugvvgXh57z3m35-omjwpxG2j5Ivxgt2sr3vIzEYU17NmyN_Gj6blqD0sP_XgAsBtuXQeTFXS5jsMC4fhYNMRH5Wbz_5POeGQZwBDLfR4a0VCA6pm_8gb3SaBf/s600/QPS%20relative%20to%205.6.51_%20range%20queries%20w_o%20aggregation,%2010%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEimDtJbfPK5jAMdgPlzF_f2divaXFWFjVCBSHB8ZX7k394rjQM_c0zvYBzgTR8Q0SsWgvVugvvgXh57z3m35-omjwpxG2j5Ivxgt2sr3vIzEYU17NmyN_Gj6blqD0sP_XgAsBtuXQeTFXS5jsMC4fhYNMRH5Wbz_5POeGQZwBDLfR4a0VCA6pm_8gb3SaBf/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries%20w_o%20aggregation,%2010%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjeAKCtIsUGMhVABVRNGGcgw89QkWF5KfDloeWXFc5_loYsU06oB6wT4i0AMgq1QFX8z6bvRshVAdsIM2S5XStIrckQCoxdXVoMtIkSrxzQEqPA9aHNkt27r5qw_-0XVK0_iUM5ocWXvaigCmiWLzCi_qET534zjKkYB_wLENnQXY3viPue6Il8CGIbVbyS/s600/QPS%20relative%20to%205.6.51_%20range%20queries%20w_o%20aggregation,%2020%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjeAKCtIsUGMhVABVRNGGcgw89QkWF5KfDloeWXFc5_loYsU06oB6wT4i0AMgq1QFX8z6bvRshVAdsIM2S5XStIrckQCoxdXVoMtIkSrxzQEqPA9aHNkt27r5qw_-0XVK0_iUM5ocWXvaigCmiWLzCi_qET534zjKkYB_wLENnQXY3viPue6Il8CGIbVbyS/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries%20w_o%20aggregation,%2020%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg21QG5bVQPQHuJNYDUdQSyDzN01aMdm97I0uZI_lbVClta9kKNb41uO9NNwIhAhLwbb2iM3UoU_1ORWG61fg05JYxBiv1FXuGrOEKc6dEYHreV4AWL7RKdZqve4zcd2P9avIrSnwlwsJNw2wOibuKdFIgDGpbemQo1our7sTyUYHkePEDwttsNVkOBnnt_/s600/QPS%20relative%20to%205.6.51_%20range%20queries%20w_o%20aggregation,%2040%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg21QG5bVQPQHuJNYDUdQSyDzN01aMdm97I0uZI_lbVClta9kKNb41uO9NNwIhAhLwbb2iM3UoU_1ORWG61fg05JYxBiv1FXuGrOEKc6dEYHreV4AWL7RKdZqve4zcd2P9avIrSnwlwsJNw2wOibuKdFIgDGpbemQo1our7sTyUYHkePEDwttsNVkOBnnt_/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries%20w_o%20aggregation,%2040%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;b style=&quot;font-family: inherit;&quot;&gt;Results: range queries with aggregation&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;5.7.44 is almost always faster than 8.0.x&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;5.7.44 becomes faster than 5.6.51 at 6+ clients&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;8.0.x becomes faster than 5.6.51 at 40 clients&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi2IhNA46dEO7fR3G7lmtzoJoSvwufANz3O-z5flAXAhP4Li9iXilp_5ze21eu-X2yXdc-bwF5Ee4u480cExbiUHYosu-sSv7TVwXXZKFDf2o4J2-9Zvfa2wr-Gt_pgdCmEB3tFCLLm-mGYkBz-xolyu7AWmHG1FUqws3UkcTWZjdC0vymRI45o1bQwJfEh/s600/QPS%20relative%20to%205.6.51_%20range%20queries%20with%20aggregation,%201%20client.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi2IhNA46dEO7fR3G7lmtzoJoSvwufANz3O-z5flAXAhP4Li9iXilp_5ze21eu-X2yXdc-bwF5Ee4u480cExbiUHYosu-sSv7TVwXXZKFDf2o4J2-9Zvfa2wr-Gt_pgdCmEB3tFCLLm-mGYkBz-xolyu7AWmHG1FUqws3UkcTWZjdC0vymRI45o1bQwJfEh/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries%20with%20aggregation,%201%20client.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEicSSRKX_5yhFXU_ooNDV7ZCE6kpO8pPe5FGe2NeNxCIoxnpGXSK7fpFcpZj7t_6n_C2wTu2Lo_8SIly-IvCHgK3hOQ26CTHNE3OgBnsKMzuu2Vb_bBJX-pMlxS1nhG7rsrwWBpf0hGbjV8Vqh5Q4ho_ZmJ9HLo4P0GoeBe3m_2DBkcMoaeHfGgRaJ0_Bkh/s600/QPS%20relative%20to%205.6.51_%20range%20queries%20with%20aggregation,%204%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEicSSRKX_5yhFXU_ooNDV7ZCE6kpO8pPe5FGe2NeNxCIoxnpGXSK7fpFcpZj7t_6n_C2wTu2Lo_8SIly-IvCHgK3hOQ26CTHNE3OgBnsKMzuu2Vb_bBJX-pMlxS1nhG7rsrwWBpf0hGbjV8Vqh5Q4ho_ZmJ9HLo4P0GoeBe3m_2DBkcMoaeHfGgRaJ0_Bkh/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries%20with%20aggregation,%204%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgowW6ueIcS_zXwg8zz1FuGFFRx4vz_Uls8Xg2tGN5lWFov5vjCFBvPxHDW6JLzj2CXRG7UvYe5NO6ekrNbeVs93VipnQtkuOHtXKdKnKiHdspzOs87fbAR2v4zYxe7DJU8peo6scnaT59MUHlT1hWP5-LP8biNQVfAMU9FBKGfrSyk_nJmxluqzVETWq9N/s600/QPS%20relative%20to%205.6.51_%20range%20queries%20with%20aggregation,%206%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgowW6ueIcS_zXwg8zz1FuGFFRx4vz_Uls8Xg2tGN5lWFov5vjCFBvPxHDW6JLzj2CXRG7UvYe5NO6ekrNbeVs93VipnQtkuOHtXKdKnKiHdspzOs87fbAR2v4zYxe7DJU8peo6scnaT59MUHlT1hWP5-LP8biNQVfAMU9FBKGfrSyk_nJmxluqzVETWq9N/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries%20with%20aggregation,%206%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgtlB79u9MxgoGOYWbI_7PKmamCEc5FGivT_tG61EYDkqIZsi5FGrZwYktZOxioJ-SnZnjS6oT5JSNvapt6XgGVFvS8BzvrGlpNThctFmG_h8B6BvhslEKW77Cdhlny6dxURzp3X29db5BTJSP5TMtSg_LDJ1sARejqNVlx1srZcY0WPMOGZeFely-j8NbV/s600/QPS%20relative%20to%205.6.51_%20range%20queries%20with%20aggregation,%208%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgtlB79u9MxgoGOYWbI_7PKmamCEc5FGivT_tG61EYDkqIZsi5FGrZwYktZOxioJ-SnZnjS6oT5JSNvapt6XgGVFvS8BzvrGlpNThctFmG_h8B6BvhslEKW77Cdhlny6dxURzp3X29db5BTJSP5TMtSg_LDJ1sARejqNVlx1srZcY0WPMOGZeFely-j8NbV/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries%20with%20aggregation,%208%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh1ZKeLogCCu1M4QTrj7Tvl9QFHYEDXwU1CZjadK6PEq5Yht_ooX9CWxpVwL54bAXRCya1erPGuLcC0FOf0hcj4vhzA3EvzaOcYXh2nqEItmLRWc-qokRvqmyGGvcxXSQuDHSG0uLuhp-Kj-bJytcgVHXZ7gjkC8Th7n9WIAaeza7sw5eFUOzBEFwkNy1gB/s600/QPS%20relative%20to%205.6.51_%20range%20queries%20with%20aggregation,%2010%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh1ZKeLogCCu1M4QTrj7Tvl9QFHYEDXwU1CZjadK6PEq5Yht_ooX9CWxpVwL54bAXRCya1erPGuLcC0FOf0hcj4vhzA3EvzaOcYXh2nqEItmLRWc-qokRvqmyGGvcxXSQuDHSG0uLuhp-Kj-bJytcgVHXZ7gjkC8Th7n9WIAaeza7sw5eFUOzBEFwkNy1gB/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries%20with%20aggregation,%2010%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEitq5xNa1svjIkA4AJMCgSNyH8HvEWnlhNo8vIpkieR0FU3YGOdJw32qBNkuj4ZZeaA09vmY_o_ZDoWKXWnMrokhYjRyT0j7Sq9Lx4iEcIc198-kLzjGGxXeO1sD-vKQZuk522sN2GaZaHOCK-0ivOIQwZyuiPB0sQ0tgrU0qYgEoFfmD6UssBG6L-T32fE/s600/QPS%20relative%20to%205.6.51_%20range%20queries%20with%20aggregation,%2020%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEitq5xNa1svjIkA4AJMCgSNyH8HvEWnlhNo8vIpkieR0FU3YGOdJw32qBNkuj4ZZeaA09vmY_o_ZDoWKXWnMrokhYjRyT0j7Sq9Lx4iEcIc198-kLzjGGxXeO1sD-vKQZuk522sN2GaZaHOCK-0ivOIQwZyuiPB0sQ0tgrU0qYgEoFfmD6UssBG6L-T32fE/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries%20with%20aggregation,%2020%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEig7K9jDmWD7vOKmwIXNi2NgDbIKoh9dOGzkdctieSV45QUMMCyx0ZC8oY-l_8ZauT8L5grDe_wUD0RTcPy6u-q3diXEDsfc71MKx3w3hAcVdwSWSokyrIzYFjo3Yo7O0SJCZro2BVNVPfA-HLZzjBvh7QHMPvjXj4Q9r5_L3rDlHpvll1B8OEJDhBJdwKT/s600/QPS%20relative%20to%205.6.51_%20range%20queries%20with%20aggregation,%2040%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEig7K9jDmWD7vOKmwIXNi2NgDbIKoh9dOGzkdctieSV45QUMMCyx0ZC8oY-l_8ZauT8L5grDe_wUD0RTcPy6u-q3diXEDsfc71MKx3w3hAcVdwSWSokyrIzYFjo3Yo7O0SJCZro2BVNVPfA-HLZzjBvh7QHMPvjXj4Q9r5_L3rDlHpvll1B8OEJDhBJdwKT/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries%20with%20aggregation,%2040%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;b style=&quot;font-family: inherit;&quot;&gt;Results: writes&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The relative speedup for the update-index microbenchmark is frequently so large that it obscures the smaller changes on other microbenchmarks. So here I truncate the y-axis for some of the charts (for 6+ clients) and the section that follows has the charts without truncation.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;For update-index&lt;/span&gt;&lt;/li&gt;&lt;ul&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;5.7.44 and 8.0.x are always faster than 5.6.51 at 4+ clients&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;There is an odd drop from ~6X to ~3X for 8.0.32 and 8.0.39 at 20 clients but you can&#39;t see that on the charts in this section because of the truncation. It is visible in the next section. From vmstat I see an increase in CPU/operation (cpu/o) and context switches /operation (cs/o) at &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.in.c48/dop20/o.met#L65-L79&quot;&gt;20 clients&lt;/a&gt; but not at &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.in.c48/dop40/o.met#L65-L79&quot;&gt;40 clients&lt;/a&gt;.&lt;/li&gt;&lt;/ul&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;5.7.44 is mostly faster than 8.0.x for 1 to 20 clients and they have similar results at 40 clients&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;li1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span class=&quot;s1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal;&quot;&gt;&lt;/span&gt;5.7.44 &amp;amp; 8.0.x are always faster than 5.6.51 at 20+ clients&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgBA9UwLuIU2Fsqijz3gHhMDGr9yKQ52lcdbGK29Q9tNa2kFfbjLxrM5b9WNzxmZmQTWv3BD0hVuJEmtiYtZaeyoUFzGit9ghGIF5xwSCR_iRgzY4mazIsRTxMhGPYu1gOlHTxB9vLjKvnbWlCn_RUvILLjhQzqmtULZrgxR8Vh5FmqQu1wHbxYROMT7v9r/s600/QPS%20relative%20to%205.6.51_%20writes,%201%20client.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgBA9UwLuIU2Fsqijz3gHhMDGr9yKQ52lcdbGK29Q9tNa2kFfbjLxrM5b9WNzxmZmQTWv3BD0hVuJEmtiYtZaeyoUFzGit9ghGIF5xwSCR_iRgzY4mazIsRTxMhGPYu1gOlHTxB9vLjKvnbWlCn_RUvILLjhQzqmtULZrgxR8Vh5FmqQu1wHbxYROMT7v9r/w640-h396/QPS%20relative%20to%205.6.51_%20writes,%201%20client.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhtKIxzGiGe_JsS-IKl4IFy_a3JmebrVW7vnZHOyq1DOUniin0rdhd1_H7YG75KE9PjGRPx5AJQ_8pbptM6J1fOUZikzHz6PUyAcSaCrRM1c7Vp-CnZtJnw4mmHww41hN06cftFdv4VA4qxK-PAYLWCsHLeZp4XeTxOgYAomB3-4gMT8nGltZ80XbJu_J9_/s600/QPS%20relative%20to%205.6.51_%20writes,%204%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhtKIxzGiGe_JsS-IKl4IFy_a3JmebrVW7vnZHOyq1DOUniin0rdhd1_H7YG75KE9PjGRPx5AJQ_8pbptM6J1fOUZikzHz6PUyAcSaCrRM1c7Vp-CnZtJnw4mmHww41hN06cftFdv4VA4qxK-PAYLWCsHLeZp4XeTxOgYAomB3-4gMT8nGltZ80XbJu_J9_/w640-h396/QPS%20relative%20to%205.6.51_%20writes,%204%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEijp18OBM4uZwFYt1ivEPeYXc7xTUVyijgjNo2fEEwjD07Pw2QuVF2ODUlQsEg3l6-hH1clpEHz0A5EgqIVcJU0hVRyENPhOJ6C0TYY_jbrUIdQ-kSiFPJNtuaIE8tGWML45UqtCFi48cmqHTLdomWz5vytWLpbWH2CwTI52Tf8Rf_zAlH9HNgFfm_n2253/s600/QPS%20relative%20to%205.6.51_%20writes,%206%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEijp18OBM4uZwFYt1ivEPeYXc7xTUVyijgjNo2fEEwjD07Pw2QuVF2ODUlQsEg3l6-hH1clpEHz0A5EgqIVcJU0hVRyENPhOJ6C0TYY_jbrUIdQ-kSiFPJNtuaIE8tGWML45UqtCFi48cmqHTLdomWz5vytWLpbWH2CwTI52Tf8Rf_zAlH9HNgFfm_n2253/w640-h396/QPS%20relative%20to%205.6.51_%20writes,%206%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi_Dl703edYMPEeW0Z9zqMpxkFoiigj6gTs1m8s0mtdvE1ZU2z5hW_eyox2euwBReNTqJA_6OlCI-5yAztXHpnOKpTUu1ohl8Rs0b4tag28EQX8dRpZ9jpj_5l0tLNyW_Yts6kh9WE-LTLg3kwx2VUo3KrvnDL8HViW30Q0YIqFnDsogwIIWHLx4NX2fKV1/s600/QPS%20relative%20to%205.6.51_%20writes,%208%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi_Dl703edYMPEeW0Z9zqMpxkFoiigj6gTs1m8s0mtdvE1ZU2z5hW_eyox2euwBReNTqJA_6OlCI-5yAztXHpnOKpTUu1ohl8Rs0b4tag28EQX8dRpZ9jpj_5l0tLNyW_Yts6kh9WE-LTLg3kwx2VUo3KrvnDL8HViW30Q0YIqFnDsogwIIWHLx4NX2fKV1/w640-h396/QPS%20relative%20to%205.6.51_%20writes,%208%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhdTHQRtwJ3IRYnnAbYRor_axkAUwTcEyYumhyaoqpyDXpnFeCGaexzIqoRuzajH_ZTK_cVnxT9SclFjjt_MzzZR_y_ASTS8gzVxxGHR5iLSQW5RKGDEVVpDcBMNQrp6BBfBZhuRCR2DUh-7OMfreVIHCpsebrHDbU3m8_stTkeQISlGnayQPw4s6flRUPO/s600/QPS%20relative%20to%205.6.51_%20writes,%2010%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhdTHQRtwJ3IRYnnAbYRor_axkAUwTcEyYumhyaoqpyDXpnFeCGaexzIqoRuzajH_ZTK_cVnxT9SclFjjt_MzzZR_y_ASTS8gzVxxGHR5iLSQW5RKGDEVVpDcBMNQrp6BBfBZhuRCR2DUh-7OMfreVIHCpsebrHDbU3m8_stTkeQISlGnayQPw4s6flRUPO/w640-h396/QPS%20relative%20to%205.6.51_%20writes,%2010%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiYTTX2rX70OUTc2174t9Wy_lir7I_K9C7f-QYj3V7TECnOlxsqM_pIWb0wrjomHEw33yXJF_IoK30TXhaGPF6yi6DsFWDY4lVBj7OfRp9evwmdYLjJ9Eqz69UUp9DZ1lYj8ZTJBESD9wneioEziORVab7KPp5xp2ZEA3jceB4YYA4PMsErxFtnn2LSJ9r0/s600/QPS%20relative%20to%205.6.51_%20writes,%2020%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiYTTX2rX70OUTc2174t9Wy_lir7I_K9C7f-QYj3V7TECnOlxsqM_pIWb0wrjomHEw33yXJF_IoK30TXhaGPF6yi6DsFWDY4lVBj7OfRp9evwmdYLjJ9Eqz69UUp9DZ1lYj8ZTJBESD9wneioEziORVab7KPp5xp2ZEA3jceB4YYA4PMsErxFtnn2LSJ9r0/w640-h396/QPS%20relative%20to%205.6.51_%20writes,%2020%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiLfgom8OPePCpwWSa9mvyF7GBTP7ZqHBz9sU4KCkAC5TvqmKGxP0Ux0rz7NtyhYdpHuArk114vc2-aqnoT_Dn7dq4tCCN147wm1NdJE1lG9B2rb0ktCdgky7rJlaDJJEw17kGhPb6lLUUoJkO_GSQEI5hcaD2D0XMmvH23ByVi5G3d_HVIC4XewaI-MGUS/s600/QPS%20relative%20to%205.6.51_%20writes,%2040%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiLfgom8OPePCpwWSa9mvyF7GBTP7ZqHBz9sU4KCkAC5TvqmKGxP0Ux0rz7NtyhYdpHuArk114vc2-aqnoT_Dn7dq4tCCN147wm1NdJE1lG9B2rb0ktCdgky7rJlaDJJEw17kGhPb6lLUUoJkO_GSQEI5hcaD2D0XMmvH23ByVi5G3d_HVIC4XewaI-MGUS/w640-h396/QPS%20relative%20to%205.6.51_%20writes,%2040%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: left;&quot;&gt;&lt;b style=&quot;font-family: inherit;&quot;&gt;Results: charts for writes without truncation&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;The y-axis is truncated the the charts for writes in the previous section for 6+ clients. This section has those charts without truncation.&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjjgxyHnttwXXFpO8qZ3oZAcE4Jp4wEIlwhOp-JS6EsyjMWVJSOX-IKNQqAo4-WItf7Rw5wKZchKqBT0DyORNrsTf7xvgsRXRffSuJTC3-pIiJh64CXtHLmUsQd_UcXe8ijHwu_V-u83Cy5YKfmGDwaQSCm7zloK8XIVJ31xItBwiN6kxpTvf6P1OYVxUjn/s600/QPS%20relative%20to%205.6.51_%20writes,%206%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjjgxyHnttwXXFpO8qZ3oZAcE4Jp4wEIlwhOp-JS6EsyjMWVJSOX-IKNQqAo4-WItf7Rw5wKZchKqBT0DyORNrsTf7xvgsRXRffSuJTC3-pIiJh64CXtHLmUsQd_UcXe8ijHwu_V-u83Cy5YKfmGDwaQSCm7zloK8XIVJ31xItBwiN6kxpTvf6P1OYVxUjn/w640-h396/QPS%20relative%20to%205.6.51_%20writes,%206%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEicKL2hfk3-NEkfV_4LhJFgSV9ty6RY08y11vLuBrNCI4wD2uy0_m4POHgwwaZPFQ1TftX3JZuwauEm2EbHSV7HE7XNnQ4WFKhFWGTYJj6hJT_2D2icXd-6Ag4OrLfhVRkuYv8yQX1Xr-p92UgmAjIcZ0cRHalM82ndjZhBoC213DGnyl69-mEzU6buHQoS/s600/QPS%20relative%20to%205.6.51_%20writes,%208%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEicKL2hfk3-NEkfV_4LhJFgSV9ty6RY08y11vLuBrNCI4wD2uy0_m4POHgwwaZPFQ1TftX3JZuwauEm2EbHSV7HE7XNnQ4WFKhFWGTYJj6hJT_2D2icXd-6Ag4OrLfhVRkuYv8yQX1Xr-p92UgmAjIcZ0cRHalM82ndjZhBoC213DGnyl69-mEzU6buHQoS/w640-h396/QPS%20relative%20to%205.6.51_%20writes,%208%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg_gC2252NPBLafDDIgS0VGWGJ5t-yISgF1bDgRcjBqNr77sPcE3mr_g-bCKfN15etdiHvGCLPdL6j6vLsh9zi-3enUkTNECegWtjsZM_1NVvm8XcJzST7dYDtIlOlNszzsLccqsnwkNRLtvxaUnJOpUov5NRsIaI1zTrIQEII3gP0Nls93tmvuQgCgSECv/s600/QPS%20relative%20to%205.6.51_%20writes,%2010%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg_gC2252NPBLafDDIgS0VGWGJ5t-yISgF1bDgRcjBqNr77sPcE3mr_g-bCKfN15etdiHvGCLPdL6j6vLsh9zi-3enUkTNECegWtjsZM_1NVvm8XcJzST7dYDtIlOlNszzsLccqsnwkNRLtvxaUnJOpUov5NRsIaI1zTrIQEII3gP0Nls93tmvuQgCgSECv/w640-h396/QPS%20relative%20to%205.6.51_%20writes,%2010%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgACvo3V5ZzIeyY7I0J6eoIrxQqh2tJdVTJxGBmo_2__UwNcGRLyPXtCSntQHgJjiFUK_3WEa2U0do_n5HWEZYCL6M6qPxEpM6o_iJP_CtUDbVsoMrghFVo-1V51VCYf6kovjw69FaHDeSaHtC0nuiF_-ZcxrDXEQaqaK1p6Tx3XZbmM_uJm0Lkk6sslxoL/s600/QPS%20relative%20to%205.6.51_%20writes,%2020%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgACvo3V5ZzIeyY7I0J6eoIrxQqh2tJdVTJxGBmo_2__UwNcGRLyPXtCSntQHgJjiFUK_3WEa2U0do_n5HWEZYCL6M6qPxEpM6o_iJP_CtUDbVsoMrghFVo-1V51VCYf6kovjw69FaHDeSaHtC0nuiF_-ZcxrDXEQaqaK1p6Tx3XZbmM_uJm0Lkk6sslxoL/w640-h396/QPS%20relative%20to%205.6.51_%20writes,%2020%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style=&quot;font-family: inherit;&quot;&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiWwIm70vD8hdNaNuwHrRSkbQCbTR_p1-AGFMX3d1qvLufE2HItBmQbjhTs2nkTj_jg_jIs-68lq8ewk6KKn9nA8ILJc2AlFm-w0fsoiqeOCppb0Xj4O0iv7Uy_2bvdot-cICQZXtJl2y-e7eN4l_V793O6GTHdC2cuO1xqR-iZOv0tMKFToipiAyzFP5bk/s600/QPS%20relative%20to%205.6.51_%20writes,%2040%20clients.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiWwIm70vD8hdNaNuwHrRSkbQCbTR_p1-AGFMX3d1qvLufE2HItBmQbjhTs2nkTj_jg_jIs-68lq8ewk6KKn9nA8ILJc2AlFm-w0fsoiqeOCppb0Xj4O0iv7Uy_2bvdot-cICQZXtJl2y-e7eN4l_V793O6GTHdC2cuO1xqR-iZOv0tMKFToipiAyzFP5bk/w640-h396/QPS%20relative%20to%205.6.51_%20writes,%2040%20clients.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;/span&gt;&lt;/div&gt;&lt;/i&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/5809616865679724548/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/03/at-what-level-of-concurrency-do-mysql.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/5809616865679724548'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/5809616865679724548'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/03/at-what-level-of-concurrency-do-mysql.html' title='At what level of concurrency do MySQL 5.7 and 8.0 become faster than 5.6?'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjJzvPan5i8bRpUlUdxrMa-Q2sVMHA7gPYusOD4CaawKcQnSr6pzPjZo671OWALpmluWpUwVVN3rPqf1ild4SKgndVmbj2xKeY4FwyChND-TjfMbbBNJYgKQXMLziDbPpPV3xMc97HIj5YCtlVMkq4nlzIz9q7GHmKVbDd_Ah00RoEvCbxWFkjp2-gWD1a_/s72-w640-h396-c/QPS%20relative%20to%205.6.51_%20point%20queries,%201%20client.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-8547853733593086975</id><published>2025-03-15T10:04:00.000-07:00</published><updated>2025-03-15T10:04:17.203-07:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="postgres"/><category scheme="http://www.blogger.com/atom/ns#" term="sysbench"/><title type='text'>Postgres 17.4 vs sysbench on a large server, revisited</title><content type='html'>&lt;p&gt;I recently &lt;a href=&quot;https://smalldatum.blogspot.com/2025/03/postgres-174-vs-sysbench-on-large-server.html&quot;&gt;shared results&lt;/a&gt; for Postgres vs sysbench on a large server. The results were mostly boring (it is rare for me to spot regressions in Postgres) but there was one microbenchmark where there was a problem. The problem microbenchmark does a range scan with aggregation and the alleged regression arrived in Postgres 11. With advice from Postgres experts it looked like the problem was an intermittent change in the query plan.&lt;/p&gt;&lt;p&gt;In this post I explain additional tests that I did and in this case the alleged regression was still there, but like many things in DBMS-land it depends, there is nuance. For now I assume the problem is from a change in the query plan and I will run more tests with more instrumentation to investigate that. Here the alleged regression might be ~5% and only at the highest concurrency level (40. clients).&lt;/p&gt;&lt;p&gt;Postgres the DBMS and community are not fans of query plan hints and the sysbench tests that I use don&#39;t add hints for queries. Query plan hints are possible in Postgres via the &lt;a href=&quot;https://github.com/ossc-db/pg_hint_plan/releases&quot;&gt;pg_hint_plan extension&lt;/a&gt;.&amp;nbsp; Query plan hints have been good to me with web-scale MySQL. For some of the web-scale workloads that I support the SQL and schema doesn&#39;t change much and query plan hints have two benefits -- plan stability and CPU reduction. By CPU reduction I mean that the CPU overhead from the optimizer is reduced because it has less work to do.&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;There might be a regression for some range queries, but it is small here (~5%) and only occurs at the highest concurrency level (40 clients). I assume this is from a change in the query plan.&lt;/li&gt;&lt;li&gt;I have yet to explain the alleged regression&lt;/li&gt;&lt;li&gt;I like query plan hints&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;The problem microbenchmark&lt;/b&gt;&lt;/p&gt;&lt;p&gt;The microbenchmarks in question do a range scan with aggregation and have a name like read-only_range=$X where X has been 10, 100 and 1000. The value of X is the length of the range scan. These are run via &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/sysbench.lua/lua/oltp_read_only.lua&quot;&gt;oltp_read_only.lua&lt;/a&gt; and uses &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/sysbench.lua/lua/oltp_common.lua#L304-L315&quot;&gt;these SQL statements&lt;/a&gt;.&lt;/p&gt;&lt;p&gt;&lt;b&gt;Build, configuration, hardware, benchmark&lt;/b&gt;&lt;/p&gt;&lt;p&gt;These are described in my &lt;a href=&quot;https://smalldatum.blogspot.com/2025/03/postgres-174-vs-sysbench-on-large-server.html&quot;&gt;previous post&lt;/a&gt;. But a few things have changed for this report&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;I only tested Postgres versions 10.23, 11.0 and 11.10&lt;/li&gt;&lt;li&gt;I repeated the benchmark for 1, 10, 20 and 40 client threads. Previously I only ran it for 40.&lt;/li&gt;&lt;li&gt;I ran the read-only_range=$X microbenchmark for X in 10, 100, 1000, 2000, 4000, 8000, 16000 and 32000. Previously I ran it for X in 10, 100, 10000.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;What didn&#39;t change is that the tests are run with 8 tables and 10M rows/table, read-heavy microbenchmarks run for 180 seconds and write-heavy run for 300 seconds.&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: overview&lt;/b&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;span&gt;&lt;div&gt;For the results below I split the microbenchmarks into 4 groups -- 1 for point queries, 2 for range queries, 1 for writes. For the range query microbenchmarks, one group has queries without aggregation and the other has queries with aggregation. The spreadsheet with all data and charts&amp;nbsp;&lt;a href=&quot;https://docs.google.com/spreadsheets/d/1yK7KU6S406XKTgV_rD6eVcdlQgIpUOL4w5ao9B8qoqw/edit?usp=sharing&quot;&gt;is here&lt;/a&gt;. It has a tab for 1, 10, 20 and 40 clients (named dop=$X for X in 1, 10, 20 and 40).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Files with performance summaries &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/mar25.sb.pg.c48&quot;&gt;are here&lt;/a&gt;. These include summaries of results from vmstat and iostat for each microbenchmark which are here for &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.pg.c48/dop1/o.met.dop1&quot;&gt;1 client&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.pg.c48/dop10/o.met.dop10&quot;&gt;10 clients&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.pg.c48/dop20/o.met.dop20&quot;&gt;20 clients&lt;/a&gt; and &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.pg.c48/dop40/o.met.dop40&quot;&gt;40 clients&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The relative QPS is the following where $version is either 11.0 or 11.10.&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;/div&gt;&lt;blockquote&gt;(QPS for $version) / (QPS for Postgres 10.23)&lt;/blockquote&gt;&lt;/div&gt;&lt;div&gt;The numbers in the spreadsheets are the relative QPS. When the relative QPS is &amp;gt; 1 then $version is faster than Postgres 10.23.&amp;nbsp; When it is 3.0 then $version is 3X faster than the base case.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Notes on the charts&lt;/div&gt;&lt;div&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;the y-axis shows the relative QPS&lt;/li&gt;&lt;li&gt;the y-axis starts at 0.90 to make it easier to see differences&lt;/li&gt;&lt;li&gt;there are 4 charts per section, one for each of 1, 10, 20 and 40 clients&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Results: point queries&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Postgres 11.x is always faster than 10.23 and usually about 3% faster&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjBTldE8VHxaPnsy-Rw0hZITNC-bN62_KQsC7N_XYiU-K6SijwkbzI0umh4ijeLUDdnhLGduK-nKijOMAI4H2AdQlAtyaQZD3ywQ0lYpBngr7j-9UX3JvbmmRRKMEkiw_1ha4IGYhfcxIcvH1yaa-vfoX8Iv5YnRfmeb3G0mBnhw-PG43vH0qRzw8IxdVQ6/s600/QPS%20relative%20to%2010.23_%201%20client,%20point%20queries.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjBTldE8VHxaPnsy-Rw0hZITNC-bN62_KQsC7N_XYiU-K6SijwkbzI0umh4ijeLUDdnhLGduK-nKijOMAI4H2AdQlAtyaQZD3ywQ0lYpBngr7j-9UX3JvbmmRRKMEkiw_1ha4IGYhfcxIcvH1yaa-vfoX8Iv5YnRfmeb3G0mBnhw-PG43vH0qRzw8IxdVQ6/w640-h396/QPS%20relative%20to%2010.23_%201%20client,%20point%20queries.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhghrAFz5Xe_vZoo8CBP0X7XqAyBgZijym-GOK8Isb-4LPXS8XKD1MiPofv-u0UtkuGPHhwgu3xIHy-ll5SZMDFL3wZo35awnTqfjY0j7IJRuOBrEKOwl8hgjvZQdz2E9m_d4WyFR-ptkLna9hq4bfZtSkHuzZQZI13maonp1PenU8hFyt8fLWhchfP6QtL/s600/QPS%20relative%20to%2010.23_%2010%20clients,%20point%20queries.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhghrAFz5Xe_vZoo8CBP0X7XqAyBgZijym-GOK8Isb-4LPXS8XKD1MiPofv-u0UtkuGPHhwgu3xIHy-ll5SZMDFL3wZo35awnTqfjY0j7IJRuOBrEKOwl8hgjvZQdz2E9m_d4WyFR-ptkLna9hq4bfZtSkHuzZQZI13maonp1PenU8hFyt8fLWhchfP6QtL/w640-h396/QPS%20relative%20to%2010.23_%2010%20clients,%20point%20queries.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj3aDzjgyDk0GZX_tly9KQbBE0Jw0LHcZnlDh3Eef9WquGIg9yOXjP1gNGbhwYbnn7cqVY4xwG_WMVSHXfoVMVHqURV9u-TqzguwhkUopvIs6naDVKgNU8efT5MYMo3ZPEPlIINh82mBAVL-czJ84UGuCE700RzoC24iitgYXmJZ69gOk9oG69Gm-kvzOjB/s600/QPS%20relative%20to%2010.23_%2020%20clients,%20point%20queries.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj3aDzjgyDk0GZX_tly9KQbBE0Jw0LHcZnlDh3Eef9WquGIg9yOXjP1gNGbhwYbnn7cqVY4xwG_WMVSHXfoVMVHqURV9u-TqzguwhkUopvIs6naDVKgNU8efT5MYMo3ZPEPlIINh82mBAVL-czJ84UGuCE700RzoC24iitgYXmJZ69gOk9oG69Gm-kvzOjB/w640-h396/QPS%20relative%20to%2010.23_%2020%20clients,%20point%20queries.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEglJn-lvqZDkv8o3leH_fznz7iWJ8jeVUBv84u4ZVEoeo29gIj17vLiYccAf72_KnqxM1yNFv7am-i-jjYpEznfIc8SaEXgzDLixwGTQSIojiWzgzZYqFyA3FG7fbgVXeaJ5-U32dqnh4uBZuaY9FjLxrRRrulYeo92oMK-QmpX-qYS84KKz2EoLyijN1nr/s600/QPS%20relative%20to%2010.23_%2040%20clients,%20point%20queries.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEglJn-lvqZDkv8o3leH_fznz7iWJ8jeVUBv84u4ZVEoeo29gIj17vLiYccAf72_KnqxM1yNFv7am-i-jjYpEznfIc8SaEXgzDLixwGTQSIojiWzgzZYqFyA3FG7fbgVXeaJ5-U32dqnh4uBZuaY9FjLxrRRrulYeo92oMK-QmpX-qYS84KKz2EoLyijN1nr/w640-h396/QPS%20relative%20to%2010.23_%2040%20clients,%20point%20queries.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: range queries without aggregation&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;There are no regressions&lt;/li&gt;&lt;li&gt;Postgres 11.0 &amp;amp; 11.10 get up to 11% more QPS than 10.23 for the range-covered and range-notcovered microbenchmarks&lt;/li&gt;&lt;li&gt;For the scan microbenchmark QPS is mostly unchanged between Postgres 10.23, 11.0 and 11.10 but in one cases Postgres 11 was slightly slower (relative QPS for 11.0 was 0.99 at 20 clients).&amp;nbsp;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgk7w2T0232N4mSMiP6OyfCkZXPrvGdMpDGGcQnsL0V7tR92UZLdrnFs6p9kQ7ZPfNZ0Jd9PPatKFTbzSj6DLtcUpQg_5tSMnTSIr64VUFQGZ7AGTN9Bnn5-gUqWig9Sf3IIllpjT0Vh0LU84CQdsftE02n_Rdo-JLLWHKu1c7Dnlj7FieTfSmY9yjoavMh/s600/QPS%20relative%20to%2010.23_%201%20client,%20range%20queries%20w_o%20aggregation.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgk7w2T0232N4mSMiP6OyfCkZXPrvGdMpDGGcQnsL0V7tR92UZLdrnFs6p9kQ7ZPfNZ0Jd9PPatKFTbzSj6DLtcUpQg_5tSMnTSIr64VUFQGZ7AGTN9Bnn5-gUqWig9Sf3IIllpjT0Vh0LU84CQdsftE02n_Rdo-JLLWHKu1c7Dnlj7FieTfSmY9yjoavMh/w640-h396/QPS%20relative%20to%2010.23_%201%20client,%20range%20queries%20w_o%20aggregation.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhtNqXAVWAZtyCqpAD_ZmJVuvT_QadumhCmpYKN85lSFJHQkFlofd6jK2VmsKwl6o8zpd9w9GRaZV3zL8wnacP9hCxarBpY8XaOtYe7omLXUl07OgVjLYe5I6iSQrwzFZ8ji4nbXO9g8AE4RGWXgVRxfHo1zs4gRyy8dPYHGCrjQn6XLhaEN5ELm9S_pcZa/s600/QPS%20relative%20to%2010.23_%2010%20clients,%20range%20queries%20w_o%20aggregation.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhtNqXAVWAZtyCqpAD_ZmJVuvT_QadumhCmpYKN85lSFJHQkFlofd6jK2VmsKwl6o8zpd9w9GRaZV3zL8wnacP9hCxarBpY8XaOtYe7omLXUl07OgVjLYe5I6iSQrwzFZ8ji4nbXO9g8AE4RGWXgVRxfHo1zs4gRyy8dPYHGCrjQn6XLhaEN5ELm9S_pcZa/w640-h396/QPS%20relative%20to%2010.23_%2010%20clients,%20range%20queries%20w_o%20aggregation.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh6xIySi5Prip7-aR7HLCyHWz92pRWuuA61WLqgqQVYbnZZH8LakF41Hz7OfkkTFw9geyklxDilD-p-hPNb0SvDUAzR83MCQLKwhOmi30awHcYeG5AcSAqC21qKpKBEB2NZ8ntuYLYoBzzzOcZHHnbvv5PmwUHOO5rpAuh-Cmthv_sBSuugIKCjROkRw82h/s600/QPS%20relative%20to%2010.23_%2020%20clients,%20range%20queries%20w_o%20aggregation.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh6xIySi5Prip7-aR7HLCyHWz92pRWuuA61WLqgqQVYbnZZH8LakF41Hz7OfkkTFw9geyklxDilD-p-hPNb0SvDUAzR83MCQLKwhOmi30awHcYeG5AcSAqC21qKpKBEB2NZ8ntuYLYoBzzzOcZHHnbvv5PmwUHOO5rpAuh-Cmthv_sBSuugIKCjROkRw82h/w640-h396/QPS%20relative%20to%2010.23_%2020%20clients,%20range%20queries%20w_o%20aggregation.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhDqeZNtOHuZ8sJ7oHzGUtLp-rLoGM91fUrg0ljYA80mnoox-m_yDy7Q_e7-2Pa4bpYgrMJi6tT77kQpp6Dc3R9GQo1BLnOIjZTZd3vPlwy3YZXN8ij1wXpLg2wkYV3ZdyMvzcCbl57S8ck7B5UPmEuuFa0loJJJs9gybRMaTui_MRAeygyRj7g3V60DiR4/s600/QPS%20relative%20to%2010.23_%2040%20clients,%20range%20queries%20w_o%20aggregation.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhDqeZNtOHuZ8sJ7oHzGUtLp-rLoGM91fUrg0ljYA80mnoox-m_yDy7Q_e7-2Pa4bpYgrMJi6tT77kQpp6Dc3R9GQo1BLnOIjZTZd3vPlwy3YZXN8ij1wXpLg2wkYV3ZdyMvzcCbl57S8ck7B5UPmEuuFa0loJJJs9gybRMaTui_MRAeygyRj7g3V60DiR4/w640-h396/QPS%20relative%20to%2010.23_%2040%20clients,%20range%20queries%20w_o%20aggregation.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: range queries with aggregation&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;I&amp;nbsp;repeated the read-only microbenchmark using range queries of length 10, 100, 1000, 2000, 4000, 8000, 16000 and 32000.&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;br /&gt;Summary:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;In general, the advantage for Postgres 11.0 &amp;amp; 11.10 vs 10.23 was largest for the longest range scans (16000 &amp;amp; 32000) and next largest for the shortest range scans (10 &amp;amp; 100).&lt;/li&gt;&lt;li&gt;The comparison for range scans of length 1000, 2000, 4000 and 8000 was interesting. Here the benefit for Postgres 11.0 &amp;amp; 11.10 was not as large and in one case (range=8000 at 40 clients) there was a small regression (~5%). Perhaps there is a change in the query plan.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiOWWcmvmZmdukRULh704-Onu4FjerZSKlHCR7tr4jXOW_f5hOED4M4eR7XTB-bbwSd8v0eDKXV90e9JlaH1YOj2bt2O5Qhtlqu5pax9HDVXHhlphlKikCdtkQsHtnTjY3UXLBheAggbB4s11dccLdSsWhDQmRRNUhyphenhyphenqaeBrNU1iw7-RjZsYtgZPGPgxdDS/s600/QPS%20relative%20to%2010.23_%201%20client,%20range%20queries%20with%20aggregation.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiOWWcmvmZmdukRULh704-Onu4FjerZSKlHCR7tr4jXOW_f5hOED4M4eR7XTB-bbwSd8v0eDKXV90e9JlaH1YOj2bt2O5Qhtlqu5pax9HDVXHhlphlKikCdtkQsHtnTjY3UXLBheAggbB4s11dccLdSsWhDQmRRNUhyphenhyphenqaeBrNU1iw7-RjZsYtgZPGPgxdDS/w640-h396/QPS%20relative%20to%2010.23_%201%20client,%20range%20queries%20with%20aggregation.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEif2ywTdY_JZxofvg7rmhyd7GqsOPyEBEl1IMkdyJ2zKYfnXe_5K_yHGzHwhJxIev7OHsyjt13Yns3vAK3Clx5Y6FK77jiszWVELjjBi4vJgUDzdKJ_wDW1fz8OmtrriL2N5ngE7ATjNDCjQZo5gpzXF3rkPFvnVioFdt-BnaHjiLqsWe3KWtW03hKKE1B9/s600/QPS%20relative%20to%2010.23_%2010%20clients,%20range%20queries%20with%20aggregation.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEif2ywTdY_JZxofvg7rmhyd7GqsOPyEBEl1IMkdyJ2zKYfnXe_5K_yHGzHwhJxIev7OHsyjt13Yns3vAK3Clx5Y6FK77jiszWVELjjBi4vJgUDzdKJ_wDW1fz8OmtrriL2N5ngE7ATjNDCjQZo5gpzXF3rkPFvnVioFdt-BnaHjiLqsWe3KWtW03hKKE1B9/w640-h396/QPS%20relative%20to%2010.23_%2010%20clients,%20range%20queries%20with%20aggregation.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiyNWcJ5T2kS-GrENFver5E6EpvVLsDag7NteH5uXGr2ciTnrsAfI_WLKXa0K6TtJNREHQ-ql-ZoHwyHF4iZAIBuUZFTI68hWYhsXn6Ws5eARCvqmp135nKx7OcbHEFpO7rygRCfKNfIAgUXP6Q1U41k_T6MeBwX__qpGoa5eTPcJ0gIE3AfjJLqpS87haE/s600/QPS%20relative%20to%2010.23_%2020%20clients,%20range%20queries%20with%20aggregation.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiyNWcJ5T2kS-GrENFver5E6EpvVLsDag7NteH5uXGr2ciTnrsAfI_WLKXa0K6TtJNREHQ-ql-ZoHwyHF4iZAIBuUZFTI68hWYhsXn6Ws5eARCvqmp135nKx7OcbHEFpO7rygRCfKNfIAgUXP6Q1U41k_T6MeBwX__qpGoa5eTPcJ0gIE3AfjJLqpS87haE/w640-h396/QPS%20relative%20to%2010.23_%2020%20clients,%20range%20queries%20with%20aggregation.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjhz-z-BUajVaQ9S3SkaD1fyeCTej01gahxSeMDRG0VBq8hJOVkU4MHx9Pi-OVvYD2SUePwyJa64tBebgACkZW1j9RjH6f63IxDesg7sXm2IW9IjMuUO_E7nGGlCKWIJaUVXh47MmuwAXcMzAUprMBKHSVcPm7aa86SuMaLtZguTMDkWL01gZXwQVgK25IS/s600/QPS%20relative%20to%2010.23_%2040%20clients,%20range%20queries%20with%20aggregation.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjhz-z-BUajVaQ9S3SkaD1fyeCTej01gahxSeMDRG0VBq8hJOVkU4MHx9Pi-OVvYD2SUePwyJa64tBebgACkZW1j9RjH6f63IxDesg7sXm2IW9IjMuUO_E7nGGlCKWIJaUVXh47MmuwAXcMzAUprMBKHSVcPm7aa86SuMaLtZguTMDkWL01gZXwQVgK25IS/w640-h396/QPS%20relative%20to%2010.23_%2040%20clients,%20range%20queries%20with%20aggregation.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: writes&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;Summary:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Postgres 11.0 &amp;amp; 11.10 were almost always faster than 10.23 and up to 1.75X faster&lt;/li&gt;&lt;li&gt;In one case (update-one microbenchmark at 20 clients) Postgres 11.0 &amp;amp; 11.10 were ~5% slower than 10.23. And this is odd because 11.0 and 11.10 were ~1.7X faster at 40 clients on the same microbenchmark. I can only wave my hands for this one. But don&#39;t think this is a regression.&lt;/li&gt;&lt;ul&gt;&lt;li&gt;The update-one microbenchmark is run by &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/sysbench.lua/lua/oltp_update_non_index.lua&quot;&gt;oltp_update_non_index.lua&lt;/a&gt; (the name means it updates non-indexed columns), the SQL for the update &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/sysbench.lua/lua/oltp_common.lua#L325-L327&quot;&gt;is here&lt;/a&gt; and the schema &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/sysbench.lua/lua/oltp_common.lua#L215-L227&quot;&gt;is here&lt;/a&gt;.&lt;/li&gt;&lt;li&gt;From vmstat and iostat metrics for &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.pg.c48/dop20/o.met.dop20#L61-L69&quot;&gt;20 clients&lt;/a&gt; and for &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/mar25.sb.pg.c48/dop40/o.met.dop40#L61-L69&quot;&gt;40 clients&lt;/a&gt;&amp;nbsp;and looking at the CPU /operation (cpu/o) and context switches /operation (cs/o)&lt;/li&gt;&lt;ul&gt;&lt;li&gt;For 20 clients these are slightly larger for 11.0 &amp;amp; 11.10 vs 10.23&lt;/li&gt;&lt;li&gt;For 40 clients these are significantly small for 11.0 &amp;amp; 11.10 vs 10.23&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;The microbenchmarks that aren&#39;t read-only are run for 600 seconds each and it is possible for performance variance to come from write debt (writeback, vacuum, etc) inherited from microbenchmarks that preceded update-one. The order in which the update microbenchmarks are run &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/sysbench.lua/all_small.sh.some#L67-L80&quot;&gt;is here&lt;/a&gt; and is update-inlist, update-index, update-nonindex, update-one, update-zipf. From the dop=20 tab on &lt;a href=&quot;https://docs.google.com/spreadsheets/d/1yK7KU6S406XKTgV_rD6eVcdlQgIpUOL4w5ao9B8qoqw/edit?usp=sharing&quot;&gt;the spreadsheet&lt;/a&gt;, throughput is 1.1X to 1.3X larger for the 3 update microbenchmarks that precede update-one so inherited write debt might explain this results.&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhdYO1GRQ_lfEr1Fk5awo8UkKQ8dop_RW5JYj7B2yNoTlgNrEIEfx92Xvk-7YrNDcGUn1DJJUaBWz_hlcgn_vTnvUVbLWCGOIJeTsPMU_ZcLxVr9rHv6AFPx0coeTGHYSr3Wbjx7W3GXqNLzEp_gElrrujxnF69ZZpdqhfMH5_LFJhaW3X4S5qaB9oZ3TpZ/s600/QPS%20relative%20to%2010.23_%201%20client,%20writes.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhdYO1GRQ_lfEr1Fk5awo8UkKQ8dop_RW5JYj7B2yNoTlgNrEIEfx92Xvk-7YrNDcGUn1DJJUaBWz_hlcgn_vTnvUVbLWCGOIJeTsPMU_ZcLxVr9rHv6AFPx0coeTGHYSr3Wbjx7W3GXqNLzEp_gElrrujxnF69ZZpdqhfMH5_LFJhaW3X4S5qaB9oZ3TpZ/w640-h396/QPS%20relative%20to%2010.23_%201%20client,%20writes.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiwQsCkY0lN3vOtSXi20q2g4BxS2vmbGtvL_ikIDH4DotZd2P5mWA-EjjdmjNu5Yo6ltIznN2kC8c5NUy4C0muxPt5BgUXgOBgYYf9FTFNdlABMyGpBnW36rphsTjJW5HUwA_B2Vw01nLANATKhCtMFk7Z_2SMIkYLE3Oa1hDNu20FG0KSPlwyWibGKxpPq/s600/QPS%20relative%20to%2010.23_%2010%20clients,%20writes.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiwQsCkY0lN3vOtSXi20q2g4BxS2vmbGtvL_ikIDH4DotZd2P5mWA-EjjdmjNu5Yo6ltIznN2kC8c5NUy4C0muxPt5BgUXgOBgYYf9FTFNdlABMyGpBnW36rphsTjJW5HUwA_B2Vw01nLANATKhCtMFk7Z_2SMIkYLE3Oa1hDNu20FG0KSPlwyWibGKxpPq/w640-h396/QPS%20relative%20to%2010.23_%2010%20clients,%20writes.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjcEYWwEEOdjOXb8ZtZaXlkICdu1kgb3ybu49SBGwGH72DZMRVhVXBiiLFWT1h6gOQ9-npPiWkmUvrtaJTb3vlSL8Bd5flmhiAkWAxl0-qeGV0smvNF1cR4Oi86dk_0_6K3KTxJKZ0uq-RFB2Nc3F7WMLd0F47AgGQmYiAh7QuMaBx5sF-NxXdVWeAU99nv/s600/QPS%20relative%20to%2010.23_%2020%20clients,%20writes.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjcEYWwEEOdjOXb8ZtZaXlkICdu1kgb3ybu49SBGwGH72DZMRVhVXBiiLFWT1h6gOQ9-npPiWkmUvrtaJTb3vlSL8Bd5flmhiAkWAxl0-qeGV0smvNF1cR4Oi86dk_0_6K3KTxJKZ0uq-RFB2Nc3F7WMLd0F47AgGQmYiAh7QuMaBx5sF-NxXdVWeAU99nv/w640-h396/QPS%20relative%20to%2010.23_%2020%20clients,%20writes.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgz0oFwDJRa_ZjU6DY4spfaoyOUABa4ffQEBBI2iG4JHlZBzyRuUHNmrr2gd9dM0fnKdBfEVThesw6vOdRYmfWOSgOIKltj26K7N_Ljoo8Oj-gJpVuG4z7HyMffSUPo_2YQrmNjl3Z43L-RJdxHSijBmkmzBxUN8lhHKzrf3LeQ_lc5YrHRdq9EX8PATmRi/s600/QPS%20relative%20to%2010.23_%2040%20clients,%20writes.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgz0oFwDJRa_ZjU6DY4spfaoyOUABa4ffQEBBI2iG4JHlZBzyRuUHNmrr2gd9dM0fnKdBfEVThesw6vOdRYmfWOSgOIKltj26K7N_Ljoo8Oj-gJpVuG4z7HyMffSUPo_2YQrmNjl3Z43L-RJdxHSijBmkmzBxUN8lhHKzrf3LeQ_lc5YrHRdq9EX8PATmRi/w640-h396/QPS%20relative%20to%2010.23_%2040%20clients,%20writes.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;/span&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/8547853733593086975/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/03/postgres-174-vs-sysbench-on-large.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/8547853733593086975'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/8547853733593086975'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/03/postgres-174-vs-sysbench-on-large.html' title='Postgres 17.4 vs sysbench on a large server, revisited'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjBTldE8VHxaPnsy-Rw0hZITNC-bN62_KQsC7N_XYiU-K6SijwkbzI0umh4ijeLUDdnhLGduK-nKijOMAI4H2AdQlAtyaQZD3ywQ0lYpBngr7j-9UX3JvbmmRRKMEkiw_1ha4IGYhfcxIcvH1yaa-vfoX8Iv5YnRfmeb3G0mBnhw-PG43vH0qRzw8IxdVQ6/s72-w640-h396-c/QPS%20relative%20to%2010.23_%201%20client,%20point%20queries.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-9117353600985058036</id><published>2025-03-03T08:25:00.000-08:00</published><updated>2025-03-15T08:54:07.312-07:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="postgres"/><category scheme="http://www.blogger.com/atom/ns#" term="sysbench"/><title type='text'>Postgres 17.4 vs sysbench on a large server</title><content type='html'>&lt;p&gt;Postgres has done a great job at avoiding performance regressions over time. It has results from sysbench and a large server for all point releases from Postgres 17.x and then the latest point release from Postgres 10 through 16.&lt;/p&gt;&lt;p&gt;This work was done by Small Datum LLC and not sponsored.&lt;/p&gt;&lt;p&gt;tl;dr - over time ...&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;For the 27 microbenchmarks there is one regression that arrives in 11.x and remains through 17.x&lt;/li&gt;&lt;li&gt;Postgres has many big improvements&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Updates:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;I am still trying to explain the regression but the problem isn&#39;t obvious.&lt;/li&gt;&lt;li&gt;The regression first arrives in Postgres 11.0&lt;/li&gt;&lt;li&gt;Flamegraphs don&#39;t show an obvious problem&lt;/li&gt;&lt;li&gt;Perf HW counters show an increase in memory system stalls&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Builds, configuration and hardware&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I compiled Postgres from source using&amp;nbsp;&lt;i&gt;-O2 -fno-omit-frame-pointer&lt;/i&gt;&amp;nbsp;for versions&amp;nbsp; 10.23, 11.22, 12.22, 13.20, 14.17, 15.12, 16.8, 17.0, 17.1, 17.2, 17.3 and 17.4.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The server is an&amp;nbsp;&lt;a href=&quot;https://www.hetzner.com/dedicated-rootserver/ax162-s/&quot;&gt;ax162-s&lt;/a&gt;&amp;nbsp;from Hetzner with 48 cores (AMD EPYC 9454P), 128G RAM and AMD SMT disabled. It uses Ubuntu 22.04 and storage is ext4 with SW RAID 1 over 2 locally attached NVMe devices. More details on it&amp;nbsp;&lt;a href=&quot;https://smalldatum.blogspot.com/2024/09/trying-out-dedicated-server-from-hetzner.html&quot;&gt;are here&lt;/a&gt;. At list prices a similar server from Google Cloud costs 10X more than from Hetzner.&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;The configuration files for the large server are in the pg*&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/conf/arc/jun24.dell32&quot;&gt;subdirectories here&lt;/a&gt;&amp;nbsp;with the name conf.diff.cx10a_c32r128.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I used sysbench and my usage is&amp;nbsp;&lt;a href=&quot;http://smalldatum.blogspot.com/2017/02/using-modern-sysbench-to-compare.html&quot;&gt;explained here&lt;/a&gt;. To save time I only run 27 of the 42 microbenchmarks and most test only 1 type of SQL statement. Benchmarks are run with the database cached by Postgres.&lt;br /&gt;&lt;br /&gt;The tests run with 8 tables and 10M rows/table. There are 40 client threads, read-heavy microbenchmarks run for 180 seconds and write-heavy run for 300 seconds.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The command line to run all tests is:&amp;nbsp;&amp;nbsp;&lt;i style=&quot;font-family: inherit;&quot;&gt;bash r.sh 8 10000000 180 300 md2 1 1 40&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For the results below I split the microbenchmarks into 4 groups -- 1 for point queries, 2 for range queries, 1 for writes. For the range query microbenchmarks, part 1 has queries that don&#39;t do aggregation while part 2 has queries that do aggregation. The spreadsheet with all data&amp;nbsp;&lt;a href=&quot;https://docs.google.com/spreadsheets/d/1zQyM7FFbXXgnDAqt1c0v_7JJy6_Lb_xP4cYKxBilveg/edit?usp=sharing&quot;&gt;is here&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Values from iostat and vmstat divided by QPS &lt;a href=&quot;https://gist.github.com/mdcallag/341ea2e95ca6ab5a38c8453fca16ac26&quot;&gt;are here&lt;/a&gt;. This can help to explain why something is faster or slower because it shows how much HW is used per request.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;The relative QPS is the following where $version is &amp;gt;= 11.22.&lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;blockquote&gt;(QPS for $version) / (QPS for Postgres 10.23)&lt;/blockquote&gt;&lt;/div&gt;&lt;div&gt;The numbers in the spreadsheets are the relative QPS. When the relative QPS is &amp;gt; 1 then $version is faster than Postgres 10.23.&amp;nbsp; When it is 3.0 then $version is 3X faster than the base case.&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: charts&lt;/b&gt;&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;p&gt;Notes on the charts&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;the y-axis shows the relative QPS&lt;/li&gt;&lt;li&gt;the y-axis starts at 0.80 to make it easier to see differences&lt;/li&gt;&lt;li&gt;in some cases the y-axis truncates the good outliers, cases where the relative QPS is greater than 1.5. I do this to improve readability for values near 1.0. Regardless, the improvements are nice.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Point queries&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;performance is stable over time in most cases&lt;/li&gt;&lt;li&gt;performance gets much better for hot-points with Postgres 17.0&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjUkKXOZA-YWU3hnZ08mcrUyLWgv3dbb16-8a-fBKQLm88stDnc7ae_9-9flD-tCVBEsHQTYbzNtfHYgtPqEtysFkZqhyG3WJJOCjURGw4rPb2Sn7aFbCf3GlEfnA59aKZUqmKbEqMbXxUSPPIl7QLEEcHyxHFjejo3eawQa7yWKkWbrbe91ll4UHXXxvPb/s600/QPS%20relative%20to%20PG%2010.23_%20point%20queries.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjUkKXOZA-YWU3hnZ08mcrUyLWgv3dbb16-8a-fBKQLm88stDnc7ae_9-9flD-tCVBEsHQTYbzNtfHYgtPqEtysFkZqhyG3WJJOCjURGw4rPb2Sn7aFbCf3GlEfnA59aKZUqmKbEqMbXxUSPPIl7QLEEcHyxHFjejo3eawQa7yWKkWbrbe91ll4UHXXxvPb/w640-h396/QPS%20relative%20to%20PG%2010.23_%20point%20queries.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;Range queries, part 1&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;performance is stable over time with small improvements for most tests&lt;/li&gt;&lt;li&gt;performance gets ~1.2X better over time on the scan test&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjaIVv-bg109W7YXgILEUKorT9EOptEsto9DLleSBHtaJ2nP5PkXH6R6diOSAP0Umq1URXWjvxPKgCZGHbbiw17Pry6sxCotnR2wrjF_Uqr1ANM5GVzYfKMGQOEk23f511eKka-986_uvgF3C8M5AXrN41Pbg-zdyHfNXnr1sFBop6kBNq25_KlC5U-Sbz-/s600/QPS%20relative%20to%20PG%2010.23_%20range%20queries,%20part%201.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjaIVv-bg109W7YXgILEUKorT9EOptEsto9DLleSBHtaJ2nP5PkXH6R6diOSAP0Umq1URXWjvxPKgCZGHbbiw17Pry6sxCotnR2wrjF_Uqr1ANM5GVzYfKMGQOEk23f511eKka-986_uvgF3C8M5AXrN41Pbg-zdyHfNXnr1sFBop6kBNq25_KlC5U-Sbz-/w640-h396/QPS%20relative%20to%20PG%2010.23_%20range%20queries,%20part%201.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;Range queries, part 2&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;these do a short range scan with aggregation and the =10, =100 and =10000 is the number of rows scanned per query&lt;/li&gt;&lt;li&gt;performance is stable over time on the tests that do shorter range scans (=100, =10)&lt;/li&gt;&lt;li&gt;performance drops by ~10% starting in 11.22 on the test with a longer range scan (=10000)&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg-OKaIFbieaSOLFETnA8mVZwI71tJCqwjKwYEznyJ26coxpojdkd4gXpEHKRPj7__0QpWa9MFyvHOrjeb7tUnwVmB12GELhlRgczuhKeroX1c_jpuZlNgoAzmImo0qhnVIZtSYEms_anN7uHV8-dSjL1SzbQh96GDZ34SN0GzlHAncRawBigIqK7E1R8So/s600/QPS%20relative%20to%20PG%2010.23_%20range%20queries,%20part%202.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg-OKaIFbieaSOLFETnA8mVZwI71tJCqwjKwYEznyJ26coxpojdkd4gXpEHKRPj7__0QpWa9MFyvHOrjeb7tUnwVmB12GELhlRgczuhKeroX1c_jpuZlNgoAzmImo0qhnVIZtSYEms_anN7uHV8-dSjL1SzbQh96GDZ34SN0GzlHAncRawBigIqK7E1R8So/w640-h396/QPS%20relative%20to%20PG%2010.23_%20range%20queries,%20part%202.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;Writes&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;depending on the test, performance over time is either stable, has a small improvement or has a large improvement&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhvKGazs8y3Hy57S-QpvlBQIpTxWov32sZTDWTU76niZhUA2MFzo6v0VaCUyGdbAf0yEBRjr7K_nfvVS7O03yUiuwNi8FGS6c1Zo_NRVBYmpDMtI7HzCXAnVV8dVFIkoiHHQvjfefsOd6-ReSTXjsAScovuRRYNbXOxK-yTOz4F63LkSGhhFtaaG0_u1Sgi/s600/QPS%20relative%20to%20PG%2010.23_%20writes.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhvKGazs8y3Hy57S-QpvlBQIpTxWov32sZTDWTU76niZhUA2MFzo6v0VaCUyGdbAf0yEBRjr7K_nfvVS7O03yUiuwNi8FGS6c1Zo_NRVBYmpDMtI7HzCXAnVV8dVFIkoiHHQvjfefsOd6-ReSTXjsAScovuRRYNbXOxK-yTOz4F63LkSGhhFtaaG0_u1Sgi/w640-h396/QPS%20relative%20to%20PG%2010.23_%20writes.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;A possible regression!&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I have yet to explain this.&lt;br /&gt;&lt;br /&gt;For the read-only_range=10000 tests where QPS drops by ~10% starting in 11.22 the problem is that Postgres uses more CPU per query starting in 11.22 (see the cpu/o column which stands for CPU per operation or CPU per query).&lt;br /&gt;&lt;br /&gt;Metrics per test from vmstat and iostat &lt;a href=&quot;https://gist.github.com/mdcallag/341ea2e95ca6ab5a38c8453fca16ac26#file-o-met-pg-L309-L391&quot;&gt;are here&lt;/a&gt; and I highlighted results for the read-only_range=X tests that do a range scan with aggregation.&amp;nbsp;&lt;br /&gt;&lt;br /&gt;Things I have checked for read-only_range=10000&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;a href=&quot;https://gist.github.com/mdcallag/dbfda25790525e6049a069975dbcc509&quot;&gt;output from ps&lt;/a&gt; for 10.23, 11.22 and 12.22 looks similar&lt;/li&gt;&lt;li&gt;&lt;a href=&quot;https://gist.github.com/mdcallag/cbdf377ed7915a0e47718a541f817556&quot;&gt;output from vmstat&lt;/a&gt; for 10.23, 11.22 and 12.22 looks similar with a few small differences. This output is from the middle of the microbenchmark.&lt;/li&gt;&lt;ul&gt;&lt;li&gt;both swpd and free are larger in 11.22 and 12.22&lt;/li&gt;&lt;li&gt;both buff and cache are larger in 10.23&lt;/li&gt;&lt;li&gt;for 10.23, 11.22 and 12.22 those values don&#39;t change during the microbenchmark&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;Flamegraphs for 10.23, 11.22 and 12.22 &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/feb25.pg.svg&quot;&gt;are here&lt;/a&gt; and look similar. The numbered (*.1.svg, *.2.svg) files are taken in sequence and then all are combined to create the *.all.svg file.&lt;/li&gt;&lt;li&gt;Output with perf HW counters for 10.23, 11.0 and 11.10 &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/feb25.pg.perf&quot;&gt;are here&lt;/a&gt;. These show there are more memory system stalls starting in 11.0. I haven&#39;t explained the file naming scheme, but the data in the table below is from &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/feb25.pg.perf/join1.rel.pgcmp.sb.perf.hw.read-only.range10000.pk1.1.sorted&quot;&gt;this file&lt;/a&gt;.&lt;/li&gt;&lt;li&gt;Flat (no flamegraphs) perf profiles &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/feb25.pg.flat&quot;&gt;are here&lt;/a&gt; for 10.23, 11.0 and 11.10. There are ~8 samples per DBMS version and the 7th sample is here &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/feb25.pg.flat/x.pg1023_o2nofp.x10a_c32r128.pk1/sb.perf.rep.f.read-only.range10000.pk1.7&quot;&gt;for 10.23&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/feb25.pg.flat/x.pg110_o2nofp.x10a_c32r128.pk1/sb.perf.rep.f.read-only.range10000.pk1.7&quot;&gt;for 11.0&lt;/a&gt; and &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/feb25.pg.flat/x.pg1110_o2nofp.x10a_c32r128.pk1/sb.perf.rep.f.read-only.range10000.pk1.7&quot;&gt;for 11.10&lt;/a&gt;. From a quick check of them I don&#39;t see obvious problems.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;I repeated tests for Postgres 11.0 and 11.10 and they both show a regression from 11.23. Then I repeated tests for 10.23, 11.0 and 11.10 while running perf to get the values of HW counters per microbenchmark and several show an increase in memory system stalls:&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;Perf HW counters that show more overhead staring in Postgres 11.0.&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;The numbers below are the value relative to Postgres 10.23, when it is&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.257 below that that counter is 1.257X larger in 11.0 (or 11.10) then&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;it was in 10.23.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;pg11.0&amp;nbsp; pg11.10 counter&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.257&amp;nbsp; &amp;nbsp;1.288&amp;nbsp; &amp;nbsp;branch-misses&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.309&amp;nbsp; &amp;nbsp;1.345&amp;nbsp; &amp;nbsp;branch-miss-pct&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.222&amp;nbsp; &amp;nbsp;1.120&amp;nbsp; &amp;nbsp;iTLB-load-miss-pct&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.166&amp;nbsp; &amp;nbsp;1.174&amp;nbsp; &amp;nbsp;L1-dcache-load-misses&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.162&amp;nbsp; &amp;nbsp;1.145&amp;nbsp; &amp;nbsp;L1-dcache-load-miss-pct&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.207&amp;nbsp; &amp;nbsp;1.207&amp;nbsp; &amp;nbsp;L1-icache-load-miss-pct&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;1.254&amp;nbsp; &amp;nbsp;1.136&amp;nbsp; &amp;nbsp;L1-icache-loads-misses&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/span&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/9117353600985058036/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/03/postgres-174-vs-sysbench-on-large-server.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/9117353600985058036'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/9117353600985058036'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/03/postgres-174-vs-sysbench-on-large-server.html' title='Postgres 17.4 vs sysbench on a large server'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjUkKXOZA-YWU3hnZ08mcrUyLWgv3dbb16-8a-fBKQLm88stDnc7ae_9-9flD-tCVBEsHQTYbzNtfHYgtPqEtysFkZqhyG3WJJOCjURGw4rPb2Sn7aFbCf3GlEfnA59aKZUqmKbEqMbXxUSPPIl7QLEEcHyxHFjejo3eawQa7yWKkWbrbe91ll4UHXXxvPb/s72-w640-h396-c/QPS%20relative%20to%20PG%2010.23_%20point%20queries.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-5499519037223068569</id><published>2025-02-26T08:03:00.000-08:00</published><updated>2025-02-26T08:21:31.998-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="ann-benchmarks"/><category scheme="http://www.blogger.com/atom/ns#" term="pgvector"/><category scheme="http://www.blogger.com/atom/ns#" term="postgres"/><title type='text'>Feedback from the Postgres community about the vector index benchmarks</title><content type='html'>&lt;p&gt;This is a response to some of the feedback I received from the Postgres community about my recent &lt;a href=&quot;https://smalldatum.blogspot.com/search?q=pgvector&quot;&gt;benchmark results&lt;/a&gt; for vector indexes using MariaDB and Postgres (pgvector). The work here isn&#39;t sponsored and required ~2 weeks days of server time and a few more hours of my time (yes, I contribute to the PG community).&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;index create is ~4X faster when using ~4 parallel workers. I hope that parallel DDL comes to more open source DBMS.&lt;/li&gt;&lt;li&gt;parallel query does not help pgvector&lt;/li&gt;&lt;li&gt;increasing work_mem does not help pgvector&lt;/li&gt;&lt;li&gt;pgvector gets less QPS than MariaDB because it uses more CPU to compute the distance metric&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;The feedback&lt;/b&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The feedback I received includes&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;benchmarks are useless, my production workload is the only relevant workload&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;I disagree, but don&#39;t expect consensus. But this approach means you are unlikely to do comparative benchmarks because it costs too much to port your workload to some other DBMS just for the sake of a benchmark and only your team can run this so you will miss out on expertise from elsewhere.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;this work was sponsored so I don&#39;t trust it&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;There isn&#39;t much I can to about that.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;b&gt;this is benchmarketing!&lt;/b&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;There is a marketing element to this work. Perhaps I was not the first to define benchmarketing, but by &lt;a href=&quot;https://smalldatum.blogspot.com/2014/06/benchmarketing.html&quot;&gt;my definition&lt;/a&gt; a benchmark report is not benchmarketing when the results are explained. I have been transparent about how I ran the benchmark and shared some performance debugging results &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/vector-indexes-mariadb-pgvector-large_26.html&quot;&gt;here&lt;/a&gt;. MariaDB gets more QPS than pgvector because pgvector uses more CPU to compute the distance metric.&amp;nbsp;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;you should try another Postgres extension for vector indexes&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;I hope to do that eventually. But pgvector is a great choice here because it implements HNSW and MariaDB implements modified HNSW. Regardless time is finite, the numbers of servers I have is finite and my work here (both sponsored and volunteer) competes with my need to finish my taxes, play video games, sleep, etc.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;this result is bogus because I didn&#39;t like some config setting you used&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Claims like this are cheap to make and expensive to debunk. Sometimes the suggested changes make a big difference but that has been rare in my experience. Most of the time the changes at best make a small difference.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;this result is bogus because you used Docker&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;I am not a Docker fan, but I only used Docker for Qdrant. I am not a fan because I suspect it is overused and I prefer benchmark setups that don&#39;t require it. While the ann-benchmarks page states that Docker is used for all algorithms, it is not used for MariaDB or Postgres in &lt;a href=&quot;https://github.com/vuvova/ann-benchmarks&quot;&gt;my fork&lt;/a&gt;. And it is trivial, but time consuming, to update most of ann-benchmarks to not use Docker.&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;Claims about the config that I used include:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;huge pages were disabled&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Yes they were. Enabling huge pages would have helped both MariaDB and Postgres. But I prefer to not use them because the feature is painful (unless you like OOM). Posts from me on this are &lt;a href=&quot;https://smalldatum.blogspot.com/2023/03/huge-pages-with-postgres-innodb-better.html&quot;&gt;here&lt;/a&gt; and &lt;a href=&quot;https://smalldatum.blogspot.com/2023/02/sysbench-with-huge-pages-for-postgres.html&quot;&gt;here&lt;/a&gt;.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;track_io_timing was enabled and is expensive&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;There wasn&#39;t IO when queries ran as the database was cached so this is irrelevant. There was some write IO when the index was created. While I won&#39;t repeat tests with track_io_timing disabled, I am skeptical that the latency of two calls to gettimeofday() per IO are significant on modern Linux.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;autovacuum_vacuum_cost_limit is too high&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;I set it to 4000 because I often write write-intensive benchmarks on servers with large IOPs capacities. This is the first time anyone suggested they were too high and experts have reviewed my config files. I wish that Postgres vacuum didn&#39;t require so much tuning -- and all of that tuning means that someone can always claim your tuning is wrong. Regardless, the benchmark workload is load, create index, query and I only time the create index and query steps. There is little impact from vacuum on this benchmark. I have also used hundreds of server hours to search for good Postgres configs.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;parallel operations were disabled&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Parallel query isn&#39;t needed for the workloads I normally run but parallel index create is nice to have. I disable parallel index create for Postgres because my primary focus is efficiency -- how much CPU and IO is consumed per operation. But I haven&#39;t been clear on that in my blog posts. Regardless, below I show there is a big impact from parallel index create and no impact&amp;nbsp;from parallel query.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;work_mem is too low&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;I have been using the default which is fine for the workloads I normally run (sysbench, insert benchmark). Below I show there is no impact from increasing it to 8M.&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html&quot;&gt;This post&lt;/a&gt;&amp;nbsp;has much more detail about my approach in general. I ran the benchmark for 1 session. I use&amp;nbsp;&lt;a href=&quot;https://github.com/erikbern/ann-benchmarks/&quot;&gt;ann-benchmarks&lt;/a&gt;&amp;nbsp;via my&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova&quot;&gt;fork of a fork of a fork&lt;/a&gt;&amp;nbsp;at&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/commit/418ff7fbc0a9f1467af483c4f80e7b91c379ab1d&quot;&gt;this commit&lt;/a&gt;.&amp;nbsp; I used the dbpedia-openai dataset with 1M rows. It uses angular (cosine) for the distance metric. The ann-benchmarks config files is here for&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/blob/dev/ann_benchmarks/algorithms/pgvector/config.yml&quot;&gt;Postgres&lt;/a&gt;&amp;nbsp;and in this case I only have results for pgvector with halfvec (float16).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I used a large server (Hetzner ax162-s) with 48 cores, 128G of RAM, Ubuntu 22.04 and HW RAID 10 using 2 NVMe devices. I tested three configurations for Postgres and all of the settings &lt;a href=&quot;https://gist.github.com/mdcallag/cd624e2e9e40d8a9775e264a8a0174a2&quot;&gt;are here&lt;/a&gt;:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;def&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;def&lt;/i&gt; stands for default and is the config I used in all of my previous blog posts. Thus, it is the config for which I received feedback.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;wm8&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;wm8&lt;/i&gt; stands for work_mem increased to 8MB. The default (used by def) is 4MB.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;pq4&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;pq4&lt;/i&gt; stands for Parallel Query with ~4 workers. Here I changed a few settings from def to support that.&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;Output from the benchmark&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/feb25.ann.dbpedia-openai.pg.v2&quot;&gt;is here&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The command lines to run the benchmark using my&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/jan25.ann.gist-960-euclidean.v1/helper_scripts&quot;&gt;helper scripts&lt;/a&gt;&amp;nbsp;are:&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp; &amp;nbsp; bash rall.batch.sh v1&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;dbpedia-openai-1000k-angular c32r128&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: QPS vs recall&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;p&gt;These charts show the best QPS for a given recall. The graphs appears to be the same but the differences are harder to see as recall approaches 1.0 so the next section has a table with numbers.&lt;br /&gt;&lt;br /&gt;But from these graphs, QPS doesn&#39;t improve with the wm8 or pq4 configs.&lt;/p&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The chart for the def config which is what I used in previous blog posts.&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgWyOFGlTPWsdpD6FNN_nHnL81cem1dSFiJFNdQgPdI9YN_0_CeTOL4dIXw3UBtu7djtzyIM8N6-pqIEhafyQWRrwDeHBx0kZWuhrCrI1q69c5_b3dqQOicNf7KMpCN8YuW8jzjGD5Md6I7BmONML0Kh-sYpGFPKpFpQTGKpenQ_xJonKD7b2wJ0saVe6nr/s1173/dbpedia-openai-1000k-angular.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;783&quot; data-original-width=&quot;1173&quot; height=&quot;268&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgWyOFGlTPWsdpD6FNN_nHnL81cem1dSFiJFNdQgPdI9YN_0_CeTOL4dIXw3UBtu7djtzyIM8N6-pqIEhafyQWRrwDeHBx0kZWuhrCrI1q69c5_b3dqQOicNf7KMpCN8YuW8jzjGD5Md6I7BmONML0Kh-sYpGFPKpFpQTGKpenQ_xJonKD7b2wJ0saVe6nr/w400-h268/dbpedia-openai-1000k-angular.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;The chart for the wm8 config with work_mem=8M&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgOrWtktSQcuymPkP-jZ_hq1BmufIKsdmWypgJZamRfxU4qc4HTeIZsKTw-d42Gj6XnyqGalWydDX4hBdlCP9UceY9ULplqYJtJsJY7xzba2unYa66bSlEyX_gTw8nGNHSC4GmcsGeUlBDWgFR0vpRfgUhxFoo5gVZdp6Lw-ZwPPJ-vH-N1CvNrXR8WkkTw/s1173/dbpedia-openai-1000k-angular.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;783&quot; data-original-width=&quot;1173&quot; height=&quot;268&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgOrWtktSQcuymPkP-jZ_hq1BmufIKsdmWypgJZamRfxU4qc4HTeIZsKTw-d42Gj6XnyqGalWydDX4hBdlCP9UceY9ULplqYJtJsJY7xzba2unYa66bSlEyX_gTw8nGNHSC4GmcsGeUlBDWgFR0vpRfgUhxFoo5gVZdp6Lw-ZwPPJ-vH-N1CvNrXR8WkkTw/w400-h268/dbpedia-openai-1000k-angular.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;The chart for the pq4 config that uses ~4 parallel workers&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgcFPtnWfClldtasCPv_BuMNRAkxAe54BHl5DvdbTHlJ1_IrjuesfDmhY3mLfT2GHuscj2ttxYb1RVDFU2jwOnWZ0CPc_y_BjMHoNhhGJuxyhxd_j3XCZ1bZ7QUBhR_MR3vWWrOA7iGzLbgDzrUN1WxVb0PYfC4nUwSdE7nW5AI-i2S05ZsyosTZozrUsnD/s1173/dbpedia-openai-1000k-angular.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;783&quot; data-original-width=&quot;1173&quot; height=&quot;268&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgcFPtnWfClldtasCPv_BuMNRAkxAe54BHl5DvdbTHlJ1_IrjuesfDmhY3mLfT2GHuscj2ttxYb1RVDFU2jwOnWZ0CPc_y_BjMHoNhhGJuxyhxd_j3XCZ1bZ7QUBhR_MR3vWWrOA7iGzLbgDzrUN1WxVb0PYfC4nUwSdE7nW5AI-i2S05ZsyosTZozrUsnD/w400-h268/dbpedia-openai-1000k-angular.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Results: best QPS for a given recall&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Many benchmark results are marketed via peak performance (max throughput or min response time) but these are usually constrained optimization problems -- determine peak performance that satisfies some SLA. And the SLA might be response time or efficiency (cost).&lt;br /&gt;&lt;br /&gt;With ann-benchmarks the constraint is recall. Below I share the best QPS for a given recall target along with the configuration parameters (M, ef_construction, ef_search) at which that occurs.&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;pgvector does not get more QPS with parallel query&lt;/li&gt;&lt;li&gt;pgvector does not get more QPS with a larger value for work_mem&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;Legend:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;recall, QPS - best QPS at that recall&lt;/li&gt;&lt;li&gt;isecs - time to create the index in seconds&lt;/li&gt;&lt;li&gt;m= - value for M when creating the index&lt;/li&gt;&lt;li&gt;ef_cons= - value for ef_construction when creating the index&lt;/li&gt;&lt;li&gt;ef_search= - value for ef_search when running queries&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 1.000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;no algorithms achived this for the (M, ef_construction) settings I used&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.99&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&amp;nbsp; &amp;nbsp;config&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.990&amp;nbsp; &amp;nbsp;317.9&amp;nbsp; &amp;nbsp;7302&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=48, ef_cons=256, ef_search=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.990&amp;nbsp; &amp;nbsp;320.4&amp;nbsp; &amp;nbsp;7285&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=48, ef_cons=256, ef_search=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.990&amp;nbsp; &amp;nbsp;316.9&amp;nbsp; &amp;nbsp;1565&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=48, ef_cons=256, ef_search=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.98&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.983&amp;nbsp; &amp;nbsp;412.0&amp;nbsp; &amp;nbsp;4120&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_cons=192, ef_search=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.984&amp;nbsp; &amp;nbsp;415.6&amp;nbsp; &amp;nbsp;4168&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_cons=192, ef_search=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.984&amp;nbsp; &amp;nbsp;411.4&amp;nbsp; &amp;nbsp; 903&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_cons=192, ef_search=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.97&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.978&amp;nbsp; &amp;nbsp;487.3&amp;nbsp; &amp;nbsp;5070&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_cons=256, ef_search=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.970&amp;nbsp; &amp;nbsp;508.3&amp;nbsp; &amp;nbsp;2495&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_cons=96, ef_search=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.970&amp;nbsp; &amp;nbsp;508.4&amp;nbsp; &amp;nbsp;2495&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_cons=96, ef_search=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.96&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.961&amp;nbsp; &amp;nbsp;621.1&amp;nbsp; &amp;nbsp;4120&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_cons=192, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.962&amp;nbsp; &amp;nbsp;632.3&amp;nbsp; &amp;nbsp;4168&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_cons=192, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.962&amp;nbsp; &amp;nbsp;622.0&amp;nbsp; &amp;nbsp; 903&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_cons=192, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.95&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.951&amp;nbsp; &amp;nbsp;768.7&amp;nbsp; &amp;nbsp;2436&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=16, ef_cons=192, ef_search=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.952&amp;nbsp; &amp;nbsp;770.2&amp;nbsp; &amp;nbsp;2442&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=16, ef_cons=192, ef_search=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.953&amp;nbsp; &amp;nbsp;753.0&amp;nbsp; &amp;nbsp; 547&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=16, ef_cons=192, ef_search=30)&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: create index&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The database configs for Postgres are shared above and parallel index create is disabled by default because my focus has not been on DDL performance. Regardless, it works great for Postgres with pgvector. The summary is:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;index create is ~4X faster when using ~4 parallel workers&lt;/li&gt;&lt;li&gt;index sizes are similar with and without parallel create index&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;Sizes: table is ~8G and index is ~4G&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Legend&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;M - value for M when creating the index&lt;/li&gt;&lt;li&gt;cons - value for ef_construction when creating the index&lt;/li&gt;&lt;li&gt;secs - time in seconds to create the index&lt;/li&gt;&lt;li&gt;size(MB) - index size in MB&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; def&amp;nbsp; &amp;nbsp; &amp;nbsp;wm8&amp;nbsp; &amp;nbsp; &amp;nbsp;pq4&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;M&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;cons&amp;nbsp; &amp;nbsp; secs&amp;nbsp; &amp;nbsp; secs&amp;nbsp; &amp;nbsp; secs&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;32&amp;nbsp; &amp;nbsp; &amp;nbsp; 412&amp;nbsp; &amp;nbsp; &amp;nbsp;405&amp;nbsp; &amp;nbsp; &amp;nbsp;108&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;32&amp;nbsp; &amp;nbsp; &amp;nbsp; 649&amp;nbsp; &amp;nbsp; &amp;nbsp;654&amp;nbsp; &amp;nbsp; &amp;nbsp;155&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;64&amp;nbsp; &amp;nbsp; &amp;nbsp; 624&amp;nbsp; &amp;nbsp; &amp;nbsp;627&amp;nbsp; &amp;nbsp; &amp;nbsp;154&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;64&amp;nbsp; &amp;nbsp; &amp;nbsp;1029&amp;nbsp; &amp;nbsp; 1029&amp;nbsp; &amp;nbsp; &amp;nbsp;237&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;32&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;64&amp;nbsp; &amp;nbsp; &amp;nbsp;1901&amp;nbsp; &amp;nbsp; 1895&amp;nbsp; &amp;nbsp; &amp;nbsp;412&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;96&amp;nbsp; &amp;nbsp; &amp;nbsp; 834&amp;nbsp; &amp;nbsp; &amp;nbsp;835&amp;nbsp; &amp;nbsp; &amp;nbsp;194&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;96&amp;nbsp; &amp;nbsp; &amp;nbsp;1387&amp;nbsp; &amp;nbsp; 1393&amp;nbsp; &amp;nbsp; &amp;nbsp;312&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;32&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;96&amp;nbsp; &amp;nbsp; &amp;nbsp;2497&amp;nbsp; &amp;nbsp; 2495&amp;nbsp; &amp;nbsp; &amp;nbsp;541&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;48&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;96&amp;nbsp; &amp;nbsp; &amp;nbsp;3731&amp;nbsp; &amp;nbsp; 3726&amp;nbsp; &amp;nbsp; &amp;nbsp;798&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp;1409&amp;nbsp; &amp;nbsp; 1410&amp;nbsp; &amp;nbsp; &amp;nbsp;316&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp;2436&amp;nbsp; &amp;nbsp; 2442&amp;nbsp; &amp;nbsp; &amp;nbsp;547&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;32&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp;4120&amp;nbsp; &amp;nbsp; 4168&amp;nbsp; &amp;nbsp; &amp;nbsp;903&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;48&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp;6117&amp;nbsp; &amp;nbsp; 6119&amp;nbsp; &amp;nbsp; 1309&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;64&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp;7838&amp;nbsp; &amp;nbsp; 7815&amp;nbsp; &amp;nbsp; 1662&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp;1767&amp;nbsp; &amp;nbsp; 1752&amp;nbsp; &amp;nbsp; &amp;nbsp;400&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp;3146&amp;nbsp; &amp;nbsp; 3148&amp;nbsp; &amp;nbsp; &amp;nbsp;690&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;32&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp;5070&amp;nbsp; &amp;nbsp; 5083&amp;nbsp; &amp;nbsp; 1102&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;48&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp;7302&amp;nbsp; &amp;nbsp; 7285&amp;nbsp; &amp;nbsp; 1565&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;64&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp;9959&amp;nbsp; &amp;nbsp; 9946&amp;nbsp; &amp;nbsp; 2117&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/5499519037223068569/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/02/feedback-from-postgres-community-about.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/5499519037223068569'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/5499519037223068569'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/02/feedback-from-postgres-community-about.html' title='Feedback from the Postgres community about the vector index benchmarks'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgWyOFGlTPWsdpD6FNN_nHnL81cem1dSFiJFNdQgPdI9YN_0_CeTOL4dIXw3UBtu7djtzyIM8N6-pqIEhafyQWRrwDeHBx0kZWuhrCrI1q69c5_b3dqQOicNf7KMpCN8YuW8jzjGD5Md6I7BmONML0Kh-sYpGFPKpFpQTGKpenQ_xJonKD7b2wJ0saVe6nr/s72-w400-h268-c/dbpedia-openai-1000k-angular.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-6904639949988713519</id><published>2025-02-20T15:34:00.000-08:00</published><updated>2025-02-20T15:36:33.857-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="lua"/><category scheme="http://www.blogger.com/atom/ns#" term="mysql"/><category scheme="http://www.blogger.com/atom/ns#" term="postgres"/><title type='text'>How to find Lua scripts for sysbench using LUA_PATH</title><content type='html'>&lt;p&gt;&lt;a href=&quot;https://github.com/akopytov/sysbench&quot;&gt;sysbench&lt;/a&gt; is a great tool for benchmarks and I appreciate all of the work the maintainer (Alexey Kopytov) put into it as that is often a thankless task. Today I struggled to figure out how to load Lua scripts from something other than the default location that was determined when sysbench was compiled. It turns out that LUA_PATH is the thing to set, but the syntax isn&#39;t what I expected.&lt;br /&gt;&lt;br /&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;My first attempt was this, because the &lt;i&gt;PATH&lt;/i&gt; in LUA_PATH implies directory names. But that failed.&lt;/span&gt;&lt;br /&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; LUA_PATH=&quot;/mnt/data/sysbench.lua/lua&quot; sysbench ... oltp_insert run&lt;/span&gt;&lt;/p&gt;&lt;p class=&quot;p1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;It turns out that LUA_PATH uses special semantics and this worked:&lt;/span&gt;&lt;br /&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; LUA_PATH=&quot;/mnt/data/sysbench.lua/lua/?.lua&quot; sysbench ... oltp_insert run&lt;/span&gt;&lt;/p&gt;&lt;p class=&quot;p1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class=&quot;p1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;The usage above replaces the existing search path. The usage below prepends the new path to the existing (compiled in) path:&lt;/p&gt;&lt;p class=&quot;p1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: courier; font-size: small;&quot;&gt;&amp;nbsp; LUA_PATH=&quot;/mnt/data/sysbench.lua/lua/?.lua;;&quot; sysbench ... oltp_insert run&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class=&quot;p1&quot; style=&quot;font-family: &amp;quot;Helvetica Neue&amp;quot;; font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-size: 13px; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/6904639949988713519/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/02/how-to-find-lua-scripts-for-sysbench.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/6904639949988713519'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/6904639949988713519'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/02/how-to-find-lua-scripts-for-sysbench.html' title='How to find Lua scripts for sysbench using LUA_PATH'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-5235371214951979386</id><published>2025-02-19T07:41:00.000-08:00</published><updated>2025-02-19T07:45:59.827-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="informix"/><category scheme="http://www.blogger.com/atom/ns#" term="mongodb"/><category scheme="http://www.blogger.com/atom/ns#" term="mysql"/><category scheme="http://www.blogger.com/atom/ns#" term="oracle"/><category scheme="http://www.blogger.com/atom/ns#" term="postgres"/><category scheme="http://www.blogger.com/atom/ns#" term="rocksdb"/><title type='text'>My database communities</title><content type='html'>&lt;p&gt;I have been working on databases since 1996. In some cases I just worked on the product (Oracle &amp;amp; Informix), in others I consider myself a member of the community (MySQL, Postgres &amp;amp; RocksDB). And for MongoDB I used to be in the community.&lt;/p&gt;&lt;p&gt;I worked on Informix XPS in 1996. I chose Informix because I could live in Portland OR and walk to work. I was fresh out of school, didn&#39;t know much about DBMS, but got a great starter project (star query optimization). The company wasn&#39;t in great shape so I left by 1997 for Oracle. I never used Informix in production and didn&#39;t consider myself as part of the Informix community.&lt;br /&gt;&lt;br /&gt;I was at Oracle from 1997 to 2005. The first 3 years were in Portland implementing JMS for the app server team and the last 5 years at Oracle HQ working on query execution.&amp;nbsp; I fixed many bugs, added support for ieee754 types, rewrote sort and maintained the sort and bitmap index row sources. The people there were great and I learned a lot but I did not enjoy the code base and left for a startup. I never used Oracle in production and don&#39;t consider myself as part of the Oracle community.&lt;/p&gt;&lt;p&gt;I lead the MySQL engineering teams at Google for 4 years and at Facebook/Meta for 10 years. I was very much immersed in production and have been active in the community since 2006. The MySQL teams got much done at both Google (GTID, semi-sync, crash-safe replication, rewrote the InnoDB rw lock) and Facebook/Meta (MyRocks and too many other things to mention). Over the years at FB/Meta my job duties got in the way of programming so I used performance testing as a way to remain current. I also filed many bugs might still be in the top-10 for bug reports. While Oracle has been a great steward for the MySQL project I have been critical about the performance regressions from older MySQL to newer MySQL. I hope that eventually stops because it will become a big problem.&lt;br /&gt;&lt;br /&gt;I contributed some code to RocksDB, mostly for monitoring. I spent much more time doing performance QA for it, and filing a few bugs. I am definitely in the community.&lt;/p&gt;&lt;p&gt;I don&#39;t use Postgres in production but have spent much time doing performance QA for it over the past ~10 years. A small part of that was done while at Meta, I had a business case, and was able to use some of their HW and my time. But most of this has been a volunteer effort -- more than 100 hours of my time and 10,000+ hours of server time. Some of those server hours are in public clouds (Google, Hetzner) so I am also spending a bit on this. I found a few performance bugs. I have not found large performance regressions over time which is impressive. I have met many of the contributors working on the bits I care about, and that has been a nice benefit.&lt;/p&gt;&lt;p&gt;I used to be a member of the MongoDB community. Like Postgres, I never supported it in production but I spent much time doing performance QA with it. I wrote mostly positive blog posts, filed more than a few bugs and even won the William Zola Community Award. But I am busy enough with MySQL, Postgres and RocksDB so I haven&#39;t tried to use it for years. Regardless, I continue to be impressed by how fast they pay down tech debt, with one exception (no cost-based optimizer).&lt;/p&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/5235371214951979386/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/02/my-database-communities.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/5235371214951979386'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/5235371214951979386'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/02/my-database-communities.html' title='My database communities'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-693150260332552029</id><published>2025-02-15T13:10:00.000-08:00</published><updated>2025-02-15T13:10:27.677-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="ann-benchmarks"/><category scheme="http://www.blogger.com/atom/ns#" term="mariadb"/><category scheme="http://www.blogger.com/atom/ns#" term="pgvector"/><category scheme="http://www.blogger.com/atom/ns#" term="postgres"/><category scheme="http://www.blogger.com/atom/ns#" term="qdrant"/><category scheme="http://www.blogger.com/atom/ns#" term="vector"/><title type='text'>Vector indexes, large server, dbpedia-openai dataset: MariaDB, Qdrant and pgvector</title><content type='html'>&lt;p&gt;My &lt;a href=&quot;https://smalldatum.blogspot.com/2025/02/vector-indexes-mariadb-pgvector-large.html&quot;&gt;previous post&lt;/a&gt; has results for MariaDB and pgvector on the dbpedia-openai dataset. This post adds results from Qdrant. This uses&amp;nbsp;&lt;a href=&quot;https://github.com/erikbern/ann-benchmarks/&quot;&gt;ann-benchmarks&lt;/a&gt;&amp;nbsp;to compare MariaDB, Qdrant and Postgres (pgvector) with a larger dataset, dbpedia-openai at 500k rows. The dataset has 1536 dimensions and uses angular (cosine) as the distance metric. This work was done by&amp;nbsp;&lt;a href=&quot;https://smalldatum.github.io/&quot;&gt;Small Datum LLC&lt;/a&gt;&amp;nbsp;and sponsored by the MariaDB Corporation.&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;I am new to Qdrant so the chance that I made a mistake are larger than for MariaDB or Postgres&lt;/li&gt;&lt;li&gt;If you already run MariaDB or Postgres then I suggest you also use them for vector indexes&lt;/li&gt;&lt;li&gt;MariaDB usually gets ~2X more QPS than pgvector and ~1.5 more than Qdrant&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;b&gt;Editorial&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I have a bias. I am skeptical that you should deploy a new DBMS to support but one datatype (vectors) unless either you have no other DBMS in production or your production DBMS does not support vector indexing.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Production is expensive -- you have to worry about security, backups, operational support&lt;/li&gt;&lt;li&gt;A new DBMS is expensive -- you have to spend time to learn how to use it&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;My initial response to Qdrant is that the new developer experience isn&#39;t very good. This can be fixed, but right now the product is complicated (has many features), configuration is complicated (also true for the DBMS I know, but I already paid that price), and the cognitive load is large. Just one example of the cognitive load is the need to learn the names that Qdrant uses for things that already have well-known names in the SQL DBMS world.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Deploying Qdrant&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The more DBMS you include in one benchmark, the more likely you are to make a mistake because you lack expertise in all of those DBMS. I will soon learn whether I made a mistake here but I made a good faith effort to get good results from Qdrant.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I first tried to compile from source. But that failed. &lt;a href=&quot;https://qdrant.tech/documentation/guides/installation/&quot;&gt;The docs&lt;/a&gt; state that&amp;nbsp;&lt;i&gt;The current list of required libraries can be found in the Dockerfile&lt;/i&gt; and while I was able to figure that out, I prefer that they just list the dependencies. Alas, my attempts to compile from source failed with error messages about problems with (gRPC) protocol definitions.&lt;br /&gt;&lt;br /&gt;So I decided to try the Docker container they provide. I ended up not changing the Qdrant configuration provided in the Docker container. I spent some time doing performance debugging and didn&#39;t see anything to indicate that a config change was needed. For example, I didn&#39;t see disk IO during queries. But the performance debugging was harder because that Docker container image doesn&#39;t come with my favorite debug tools installed. Some of the tools were easy to install, others (perf) were not.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html&quot;&gt;This post&lt;/a&gt;&amp;nbsp;has much more detail about my approach in general. I ran the benchmark for 1 session. I use&amp;nbsp;&lt;a href=&quot;https://github.com/erikbern/ann-benchmarks/&quot;&gt;ann-benchmarks&lt;/a&gt;&amp;nbsp;via my&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova&quot;&gt;fork of a fork of a fork&lt;/a&gt;&amp;nbsp;at&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/commit/418ff7fbc0a9f1467af483c4f80e7b91c379ab1d&quot;&gt;this commit&lt;/a&gt;.&amp;nbsp; &lt;br /&gt;&lt;br /&gt;The ann-benchmarks config files are here for &lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/blob/dev/ann_benchmarks/algorithms/mariadb/config.yml&quot;&gt;MariaDB&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/blob/dev/ann_benchmarks/algorithms/pgvector/config.yml&quot;&gt;Postgres&lt;/a&gt; and &lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/blob/dev/ann_benchmarks/algorithms/qdrant/config.yml&quot;&gt;Qdrant&lt;/a&gt;. For Postgres I use the values for M and ef_construction. But MariaDB doesn&#39;t support ef_construction so I only specify the M values. While pgvector requires ef_construction to be &amp;gt;= 2*M, I do not know whether Qdrant has a similar requirement. Regardless I only test cases where that constraint is true.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Some quantization was used&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;MariaDB uses 16-bit integers rather than float32&lt;/li&gt;&lt;li&gt;pgvector uses float32, pgvector halfvec uses float16&lt;/li&gt;&lt;li&gt;For Qdrant I used none (float32) and scalar (int8)&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;I used a larger server (Hetzner ax162-s) with 48 cores, 128G of RAM, Ubuntu 22.04 and HW RAID 10 using 2 NVMe devices. The database configuration files are here&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/jan25.ann.gist-960-euclidean.v1/dop_1/ma110701/my.cnf.cz11b_lwas4k_vector_c32r128&quot;&gt;for MariaDB&lt;/a&gt;&amp;nbsp;and&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/jan25.ann.gist-960-euclidean.v1/dop_1/pg172/conf.diff.cx10a_vector_c32r128&quot;&gt;for Postgres&lt;/a&gt;. Output from the benchmark&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/feb25.ann.dbpedia-openai.qdrant&quot;&gt;is here&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I had ps and vmstat running during the benchmark and confirmed there weren&#39;t storage reads as the table and index were cached by MariaDB and Postgres.&lt;br /&gt;&lt;br /&gt;The command lines to run the benchmark using my&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/jan25.ann.gist-960-euclidean.v1/helper_scripts&quot;&gt;helper scripts&lt;/a&gt;&amp;nbsp;are:&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp; &amp;nbsp; bash rall.batch.sh v1&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;dbpedia-openai-500k-angular c32r128&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;div style=&quot;font-family: Times;&quot;&gt;&lt;b&gt;Results: QPS vs recall&lt;/b&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Times;&quot;&gt;&lt;p&gt;These charts show the best QPS for a given recall. MariaDB gets more QPS than Qdrant and pgvector but that is harder to see as the recall approaches 1, so the next section has a table for best QPS per DBMS at a given recall.&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjqJI6Jifvhf6c7Aldp05_3QPnExlrOP1kmSMrpPX4DmCCN22chmEqjNpMg3Yt0qcLAjFbLXeQHR2NoXAObj05FZLS60z0-jcUGFK4RwIZ79rvfPwdLaX5AAv16Avedhj3Wqh_W58h3M1sD1GBBC2rlcw-q5vYHHsEDMbiWNJLoIop75Ob4VsU6uX56UiG6/s1193/dbpedia-openai-500k-angular.xlog.all.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;783&quot; data-original-width=&quot;1193&quot; height=&quot;420&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjqJI6Jifvhf6c7Aldp05_3QPnExlrOP1kmSMrpPX4DmCCN22chmEqjNpMg3Yt0qcLAjFbLXeQHR2NoXAObj05FZLS60z0-jcUGFK4RwIZ79rvfPwdLaX5AAv16Avedhj3Wqh_W58h3M1sD1GBBC2rlcw-q5vYHHsEDMbiWNJLoIop75Ob4VsU6uX56UiG6/w640-h420/dbpedia-openai-500k-angular.xlog.all.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;Results: create index&lt;/b&gt;&lt;/p&gt;&lt;div&gt;The database configs for Postgres and MariaDB are shared above, and parallel index create is disabled by the config for Postgres and not supported yet by MariaDB. The summary is:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;index sizes are similar between MariaDB and pgvector with halfvec&lt;/li&gt;&lt;li&gt;time to create the index varies a lot and it is better to consider this in the context of recall which is done in next section. But Qdrant creates indexes a lot faster than MariaDB or pgvector.&lt;/li&gt;&lt;li&gt;I did not find an accurate way to determine index size for Qdrant. There is a default method in ann-benchmarks that a DBMS can override. The default just compares process RSS before and after creating an index which isn&#39;t accurate for small indexes. The MariaDB and Postgres code override the default and query the data dictionary to get a more accurate estimate.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;The max time to create an index for MariaDB and Postgres exceeds 10,000 seconds on this dataset when M and/or ef_construction are large. The max time for Qdrant was &amp;lt;= 400 seconds for no quantization and &amp;lt;= 300 seconds for scalar quantization. This is excellent. But I wonder if things Qdrant does (or doesn&#39;t do) to save time during create index contributes to making queries slower because&amp;nbsp;MariaDB has much better QPS.&lt;br /&gt;&lt;br /&gt;More details on index size and index create time for MariaDB and Postgres are in my &lt;a href=&quot;https://smalldatum.blogspot.com/2025/02/vector-indexes-mariadb-pgvector-large.html&quot;&gt;previous post&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Results: best QPS for a given recall&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Many benchmark results are marketed via peak performance (max throughput or min response time) but these are usually constrained optimization problems -- determine peak performance that satisfies some SLA. And the SLA might be response time or efficiency (cost).&lt;br /&gt;&lt;br /&gt;With ann-benchmarks the constraint is recall. Below I share the best QPS for a given recall target along with the configuration parameters (M, ef_construction, ef_search) at which that occurs for each of the algorithms -- MariaDB, pgvector with float32 and float16/halfvec, Qdrant with no and scalar quantization.&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;Qdrant with scalar quantization does not get a result for recall=1.0 for the values of M, ef_construction and ef_search I used&lt;/li&gt;&lt;li&gt;MariaDB usually gets ~2X more QPS than pgvector and ~1.5 more than Qdrant&lt;/li&gt;&lt;li&gt;Index create time was much less for Qdrant (described above)&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;Legend:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;recall, QPS - best QPS at that recall&lt;/li&gt;&lt;li&gt;rel2ma - (QPS for me / QPS for MariaDB)&lt;/li&gt;&lt;li&gt;m= is the value for M when creating the index&lt;/li&gt;&lt;li&gt;ef_cons= is the value for ef_construction when creating the index&lt;/li&gt;&lt;li&gt;ef_search= is the value for ef_search when running queries&lt;/li&gt;&lt;li&gt;quant= is the quantization used by Qdrant&lt;/li&gt;&lt;li&gt;dbms&lt;/li&gt;&lt;ul&gt;&lt;li&gt;MariaDB - MariaDB, there is no option for quantization&lt;/li&gt;&lt;li&gt;PGVector - Postgres with pgvector and float32&lt;/li&gt;&lt;li&gt;PGVector_halfvec - Postgres with pgvector and halfvec (float16)&lt;/li&gt;&lt;li&gt;Qdrant(..., quant=none) - Qdrant with no quantization&lt;/li&gt;&lt;li&gt;Qdrant(..., quant=scalar) - Qdrant with scalar quantization&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;div&gt;MariaDB gets more QPS than a DBMS when rel2ma is less than 1.0 and when rel2ma is 0.5 then MariaDB gets 2X more QPS. Below, the rel2ma values are always much less than 1.0 except in the first group of results for recall = 1.0.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall = 1.000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;rel2ma&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.000&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #f4cccc;&quot;&gt;18.3&lt;/span&gt;&amp;nbsp; &amp;nbsp; 1.00&amp;nbsp; &amp;nbsp; MariaDB(m=32, ef_search=200)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.000&amp;nbsp; &amp;nbsp;49.4&amp;nbsp; &amp;nbsp; 2.70&amp;nbsp; &amp;nbsp; PGVector(m=64, ef_construct=256, ef_search=400)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.000&amp;nbsp; &amp;nbsp;56.4&amp;nbsp; &amp;nbsp; 3.08&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=64, ef_construct=256, ef_search=400)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.000&amp;nbsp; 153.9&amp;nbsp; &amp;nbsp; 8.41&amp;nbsp; &amp;nbsp; Qdrant(m=32, ef_construct=256, quant=none, hnsw_ef=400)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.99&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;rel2ma&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.993&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;861&lt;/span&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.00&amp;nbsp; &amp;nbsp; MariaDB(m=24, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.991&amp;nbsp; &amp;nbsp;370&amp;nbsp; &amp;nbsp; &amp;nbsp;0.43&amp;nbsp; &amp;nbsp; PGVector(m=16, ef_construct=256, ef_search=80)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.990&amp;nbsp; &amp;nbsp;422&amp;nbsp; &amp;nbsp; &amp;nbsp;0.49&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=16, ef_construct=192, ef_search=80)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.990&amp;nbsp; &amp;nbsp;572&amp;nbsp; &amp;nbsp; &amp;nbsp;0.66&amp;nbsp; &amp;nbsp; Qdrant(m=32, ef_construct=256, quant=none, hnsw_ef=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.990&amp;nbsp; &amp;nbsp;764&amp;nbsp; &amp;nbsp; &amp;nbsp;0.89&amp;nbsp; &amp;nbsp; Qdrant(m=48, ef_construct=192, quant=scalar, hnsw_ef=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.98&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;rel2ma&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.983&amp;nbsp;&amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1273&lt;/span&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.00&amp;nbsp; &amp;nbsp; MariaDB(m=16, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.981&amp;nbsp; &amp;nbsp;492&amp;nbsp; &amp;nbsp; &amp;nbsp;0.39&amp;nbsp; &amp;nbsp; PGVector(m=32, ef_construct=192, ef_search=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.982&amp;nbsp; &amp;nbsp;545&amp;nbsp; &amp;nbsp; &amp;nbsp;0.43&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_construct=192, ef_search=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.981&amp;nbsp; &amp;nbsp;713&amp;nbsp; &amp;nbsp; &amp;nbsp;0.56&amp;nbsp; &amp;nbsp; Qdrant(m=16, ef_construct=192, quant=none, hnsw_ef=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.980&amp;nbsp; &amp;nbsp;895&amp;nbsp; &amp;nbsp; &amp;nbsp;0.70&amp;nbsp; &amp;nbsp; Qdrant(m=16, ef_construct=256, quant=scalar, hnsw_ef=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.97&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;rel2ma&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.983&amp;nbsp;&amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1273&lt;/span&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.00&amp;nbsp; &amp;nbsp; MariaDB(m=16, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.971&amp;nbsp; &amp;nbsp;635&amp;nbsp; &amp;nbsp; &amp;nbsp;0.50&amp;nbsp; &amp;nbsp; PGVector(m=32, ef_construct=192, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.971&amp;nbsp; &amp;nbsp;724&amp;nbsp; &amp;nbsp; &amp;nbsp;0.57&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_construct=192, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.972&amp;nbsp; &amp;nbsp;782&amp;nbsp; &amp;nbsp; &amp;nbsp;0.61&amp;nbsp; &amp;nbsp; Qdrant(m=16, ef_construct=192, quant=none, hnsw_ef=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.970&amp;nbsp; &amp;nbsp;982&amp;nbsp; &amp;nbsp; &amp;nbsp;0.77&amp;nbsp; &amp;nbsp; Qdrant(m=16, ef_construct=192, quant=scalar, hnsw_ef=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.96&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;rel2ma&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.969&amp;nbsp;&amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1602&lt;/span&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.00&amp;nbsp; &amp;nbsp; MariaDB(m=12, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.965&amp;nbsp; &amp;nbsp;762&amp;nbsp; &amp;nbsp; &amp;nbsp;0.48&amp;nbsp; &amp;nbsp; PGVector(m=16, ef_construct=192, ef_search=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.964&amp;nbsp; &amp;nbsp;835&amp;nbsp; &amp;nbsp; &amp;nbsp;0.52&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=16, ef_construct=192, ef_search=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.963&amp;nbsp; &amp;nbsp;811&amp;nbsp; &amp;nbsp; &amp;nbsp;0.51&amp;nbsp; &amp;nbsp; Qdrant(m=16, ef_construct=96, quant=none, hnsw_ef=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.961&amp;nbsp; &amp;nbsp;996&amp;nbsp; &amp;nbsp; &amp;nbsp;0.62&amp;nbsp; &amp;nbsp; Qdrant(m=16, ef_construct=96, quant=scalar, hnsw_ef=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.95&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;rel2ma&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.969&amp;nbsp;&amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1602&lt;/span&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.00&amp;nbsp; &amp;nbsp; MariaDB(m=12, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.954&amp;nbsp; &amp;nbsp;802&amp;nbsp; &amp;nbsp; &amp;nbsp;0.50&amp;nbsp; &amp;nbsp; PGVector(m=16, ef_construct=96, ef_search=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.955&amp;nbsp; &amp;nbsp;880&amp;nbsp; &amp;nbsp; &amp;nbsp;0.55&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=16, ef_construct=96, ef_search=30)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.954&amp;nbsp; &amp;nbsp;869&amp;nbsp; &amp;nbsp; &amp;nbsp;0.54&amp;nbsp; &amp;nbsp; Qdrant(m=8, ef_construct=256, quant=none, hnsw_ef=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.950&amp;nbsp; 1060&amp;nbsp; &amp;nbsp; &amp;nbsp;0.66&amp;nbsp; &amp;nbsp; Qdrant(m=16, ef_construct=192, quant=scalar, hnsw_ef=20)&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/span&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/693150260332552029/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/02/vector-indexes-large-server-dbpedia.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/693150260332552029'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/693150260332552029'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/02/vector-indexes-large-server-dbpedia.html' title='Vector indexes, large server, dbpedia-openai dataset: MariaDB, Qdrant and pgvector'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjqJI6Jifvhf6c7Aldp05_3QPnExlrOP1kmSMrpPX4DmCCN22chmEqjNpMg3Yt0qcLAjFbLXeQHR2NoXAObj05FZLS60z0-jcUGFK4RwIZ79rvfPwdLaX5AAv16Avedhj3Wqh_W58h3M1sD1GBBC2rlcw-q5vYHHsEDMbiWNJLoIop75Ob4VsU6uX56UiG6/s72-w640-h420-c/dbpedia-openai-500k-angular.xlog.all.png" height="72" width="72"/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-7345394187709794108</id><published>2025-02-10T16:19:00.000-08:00</published><updated>2025-02-16T14:56:44.994-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="ann-benchmarks"/><category scheme="http://www.blogger.com/atom/ns#" term="mariadb"/><category scheme="http://www.blogger.com/atom/ns#" term="pgvector"/><category scheme="http://www.blogger.com/atom/ns#" term="postgres"/><category scheme="http://www.blogger.com/atom/ns#" term="vector"/><title type='text'>Vector indexes, MariaDB &amp; pgvector, large server, dbpedia-openai dataset</title><content type='html'>&lt;p&gt;This post has results from&amp;nbsp;&lt;a href=&quot;https://github.com/erikbern/ann-benchmarks/&quot;&gt;ann-benchmarks&lt;/a&gt;&amp;nbsp;to compare MariaDB and Postgres with a larger dataset, dbpedia-openai at 100k, 500k and 1M rows. It has 1536 dimensions and uses angular (cosine) as the distance metric. By &lt;i&gt;larger&lt;/i&gt; I mean by the standards of what is in ann-benchmarks. This work was done by&amp;nbsp;&lt;a href=&quot;https://smalldatum.github.io/&quot;&gt;Small Datum LLC&lt;/a&gt;&amp;nbsp;and sponsored by the MariaDB Corporation.&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Index create time was much less for MariaDB in all cases except the result for recall &amp;gt;= 0.95&lt;/li&gt;&lt;li&gt;For a given recall, MariaDB gets between 2.1X and 2.7X more QPS than Postgres&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html&quot;&gt;This post&lt;/a&gt;&amp;nbsp;has much more detail about my approach in general. I ran the benchmark for 1 session. I use&amp;nbsp;&lt;a href=&quot;https://github.com/erikbern/ann-benchmarks/&quot;&gt;ann-benchmarks&lt;/a&gt;&amp;nbsp;via my&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova&quot;&gt;fork of a fork of a fork&lt;/a&gt;&amp;nbsp;at&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/commit/f0c0d0ccbbe765c6d758239eb95406b5dd07845d&quot;&gt;this commit&lt;/a&gt;.&amp;nbsp; The ann-benchmarks config files are here&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/jan25.ann.gist-960-euclidean.v1/dop_1/pg172/config.yml.mariadb&quot;&gt;for MariaDB&lt;/a&gt;&amp;nbsp;and&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/jan25.ann.gist-960-euclidean.v1/dop_1/pg172/config.yml.pgvector&quot;&gt;for Postgres&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I used a larger server (Hetzner ax162-s) with 48 cores, 128G of RAM, Ubuntu 22.04 and HW RAID 10 using 2 NVMe devices. The database configuration files are here&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/jan25.ann.gist-960-euclidean.v1/dop_1/ma110701/my.cnf.cz11b_lwas4k_vector_c32r128&quot;&gt;for MariaDB&lt;/a&gt;&amp;nbsp;and&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/jan25.ann.gist-960-euclidean.v1/dop_1/pg172/conf.diff.cx10a_vector_c32r128&quot;&gt;for Postgres&lt;/a&gt;. Output from the benchmark &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/feb25.ann.dbpedia-openai&quot;&gt;is here&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I had ps and vmstat running during the benchmark and confirmed there weren&#39;t storage reads as the table and index were cached by MariaDB and Postgres.&lt;br /&gt;&lt;br /&gt;The command lines to run the benchmark using my&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/jan25.ann.gist-960-euclidean.v1/helper_scripts&quot;&gt;helper scripts&lt;/a&gt;&amp;nbsp;are:&lt;br /&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp; &amp;nbsp; bash rall.batch.sh v1&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;dbpedia-openai-100k-angular c32r128&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp; &amp;nbsp; bash rall.batch.sh v1&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;dbpedia-openai-500k-angular c32r128&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp; &amp;nbsp; bash rall.batch.sh v1&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;dbpedia-openai-1000k-angular c32r128&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: QPS vs recall&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;p&gt;These charts show the best QPS for a given recall. MariaDB gets about 2X more QPS than Postgres for a specific recall level&amp;nbsp;&lt;/p&gt;&lt;p&gt;With 100k rows&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjr4XB3jIbN64LvLm7aXZO5iJUE4eP1GU9Y_NAIA0pLnytikiVGwn6wdDsmOOE3n6k8Hjwd6rWTdbQyJ_B6mk4Xufqc8YBFA0_yJLp0AM2b6zUUr__LoNtJBSqrt8G160bsDJ02fP_OpXhp1XBVnzWI4CFoxqKpIH3lXrHxLgAhvMvhWiAWpdtukYfl5mW5/s1173/dbpedia-openai-100k-angular.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1173&quot; height=&quot;424&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjr4XB3jIbN64LvLm7aXZO5iJUE4eP1GU9Y_NAIA0pLnytikiVGwn6wdDsmOOE3n6k8Hjwd6rWTdbQyJ_B6mk4Xufqc8YBFA0_yJLp0AM2b6zUUr__LoNtJBSqrt8G160bsDJ02fP_OpXhp1XBVnzWI4CFoxqKpIH3lXrHxLgAhvMvhWiAWpdtukYfl5mW5/w640-h424/dbpedia-openai-100k-angular.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;With 500k rows&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhBQXgK05rbE7bEhWg4vOlHCt_yhV2sPjTvdBPGyl-lmuTsPIDgiUIRy5vhGT0t0Xw7lzSQOIRbid_6OAidG84bfBetVo6SZC4poG-mWpobuioSoODKwhxVJ3PMJkt53SEDV-QcYKil_K3MpM3ayg3cwaI4lbQZyFRRpK6lGEWGSOsiaN4CNNfReUBIzOK1/s1173/dbpedia-openai-500k-angular.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1173&quot; height=&quot;424&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhBQXgK05rbE7bEhWg4vOlHCt_yhV2sPjTvdBPGyl-lmuTsPIDgiUIRy5vhGT0t0Xw7lzSQOIRbid_6OAidG84bfBetVo6SZC4poG-mWpobuioSoODKwhxVJ3PMJkt53SEDV-QcYKil_K3MpM3ayg3cwaI4lbQZyFRRpK6lGEWGSOsiaN4CNNfReUBIzOK1/w640-h424/dbpedia-openai-500k-angular.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;With 1M rows&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg7sQLzX8N83uLIZz-mXoATBzvm6bpzUuK8TCglCsALnqtFNrQnJ5WD_PSeHucvvk6_Ws944JPdwpWg_6mNJVbCyFvOtgX1F53W6gqNlxgt66ybZ3F_y6_2f6YH6SxCtMzc6mGEEHxLk0RmTeiqFWtc7i7rSrnPihOSD5YnkcQ2PI669-WqeQ9QQD8McvZV/s1173/dbpedia-openai-1000k-angular.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1173&quot; height=&quot;424&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg7sQLzX8N83uLIZz-mXoATBzvm6bpzUuK8TCglCsALnqtFNrQnJ5WD_PSeHucvvk6_Ws944JPdwpWg_6mNJVbCyFvOtgX1F53W6gqNlxgt66ybZ3F_y6_2f6YH6SxCtMzc6mGEEHxLk0RmTeiqFWtc7i7rSrnPihOSD5YnkcQ2PI669-WqeQ9QQD8McvZV/w640-h424/dbpedia-openai-1000k-angular.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;Results: create index&lt;/b&gt;&lt;/p&gt;&lt;div&gt;The database configs for Postgres and MariaDB are shared above, and parallel index create is disabled by the config for Postgres and not supported yet by MariaDB. The summary is:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;index sizes are similar between MariaDB and pgvector with halfvec&lt;/li&gt;&lt;li&gt;time to create the index varies a lot and it is better to consider this in the context of recall which is done in next section&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;Legend&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;M - value for M when creating the index&lt;/li&gt;&lt;li&gt;cons - value for ef_construction when creating the index&lt;/li&gt;&lt;li&gt;secs - time in seconds to create the index&lt;/li&gt;&lt;li&gt;size(MB) - index size in MB&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;Table sizes:&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;* Postgres is 7734M&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;* MariaDB is 7856M&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -- pgvector --&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -- pgevector --&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -- float32 --&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;-- halfvec&amp;nbsp; &amp;nbsp;--&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;M&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;cons&amp;nbsp; &amp;nbsp; secs&amp;nbsp; &amp;nbsp; size(MB)&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; secs&amp;nbsp; &amp;nbsp; size(MB)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;32&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;458&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;402&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;32&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;720&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;655&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;64&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;699&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;627&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;64&amp;nbsp; &amp;nbsp; &amp;nbsp; 1144&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1029&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;32&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;64&amp;nbsp; &amp;nbsp; &amp;nbsp; 2033&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1880&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;96&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;934&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;843&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;96&amp;nbsp; &amp;nbsp; &amp;nbsp; 1537&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1382&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;32&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;96&amp;nbsp; &amp;nbsp; &amp;nbsp; 2730&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2482&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;48&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;96&amp;nbsp; &amp;nbsp; &amp;nbsp; 4039&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3725&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp; 1606&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1409&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp; 2778&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2435&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;32&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp; 4683&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 4154&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;48&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp; 6830&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 6106&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;64&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp; 8601&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 7831&amp;nbsp; &amp;nbsp; 3958&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp; 2028&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1764&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp; 3609&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3151&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;32&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp; 5838&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 5056&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;48&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp; 8224&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 7283&amp;nbsp; &amp;nbsp; 3867&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;64&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp;11031&amp;nbsp; &amp;nbsp;7734&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 9931&amp;nbsp; &amp;nbsp; 3957&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;mariadb&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;M&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;secs&amp;nbsp; &amp;nbsp; size(MB)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;4&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 318&amp;nbsp; &amp;nbsp;3976&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;5&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 372&amp;nbsp; &amp;nbsp;3976&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;6&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 465&amp;nbsp; &amp;nbsp;3976&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 717&amp;nbsp; &amp;nbsp;3976&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;12&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1550&amp;nbsp; &amp;nbsp;3976&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2887&amp;nbsp; &amp;nbsp;3976&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;24&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;7248&amp;nbsp; &amp;nbsp;3976&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;32&amp;nbsp; &amp;nbsp; &amp;nbsp; 14120&amp;nbsp; &amp;nbsp;3976&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;48&amp;nbsp; &amp;nbsp; &amp;nbsp; 36697&amp;nbsp; &amp;nbsp;3980&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Results: best QPS for a given recall&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Many benchmark results are marketed via peak performance (max throughput or min response time) but these are usually constrained optimization problems -- determine peak performance that satisfies some SLA. And the SLA might be response time or efficiency (cost).&lt;br /&gt;&lt;br /&gt;With ann-benchmarks the constraint is recall. Below I share the best QPS for a given recall target along with the configuration parameters (M, ef_construction, ef_search) at which that occurs for each of the algorithms (MariaDB, pgvector with float32, pgvector with float16/halfvec).&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;Postgres does not get recall=1.0 for the values of M, ef_construction and ef_search I used&lt;/li&gt;&lt;li&gt;Index create time was much less for MariaDB in all cases except the result for recall &amp;gt;= 0.95&lt;/li&gt;&lt;li&gt;For a given recall target, MariaDB gets between 2.1X and 2.7X more QPS than Postgres&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;Legend:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;recall, QPS - best QPS at that recall&lt;/li&gt;&lt;li&gt;isecs - time to create the index in seconds&lt;/li&gt;&lt;li&gt;m= - value for M when creating the index&lt;/li&gt;&lt;li&gt;ef_cons= - value for ef_construction when creating the index&lt;/li&gt;&lt;li&gt;ef_search= - value for ef_search when running queries&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 1.000, pgvector did not reach the recall target&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.000&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;20&lt;/span&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;36697&amp;nbsp; &amp;nbsp;MariaDB(m=48, ef_search=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.99, MariaDB gets &amp;gt;= 2.2X more QPS than Postgres&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.990&amp;nbsp; &amp;nbsp;287&amp;nbsp; &amp;nbsp; &amp;nbsp;8224&amp;nbsp; &amp;nbsp; PGVector(m=48, ef_cons=256, ef_search=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.990&amp;nbsp; &amp;nbsp;321&amp;nbsp; &amp;nbsp; &amp;nbsp;7283&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=48, ef_cons=256, ef_search=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.992&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;731&lt;/span&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;7248&amp;nbsp; &amp;nbsp; MariaDB(m=24, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.98, MariaDB gets &amp;gt;= 2.7X more QPS than Postgres&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.984&amp;nbsp; &amp;nbsp; 375&amp;nbsp; &amp;nbsp; 4683&amp;nbsp; &amp;nbsp; PGVector(m=32, ef_cons=192, ef_search=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.984&amp;nbsp; &amp;nbsp; 418&amp;nbsp; &amp;nbsp; 4154&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_cons=192, ef_search=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.981&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1130&lt;/span&gt;&amp;nbsp; &amp;nbsp; 2887&amp;nbsp; &amp;nbsp; MariaDB(m=16, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.97, MariaDB gets &amp;gt;= 2.3X more QPS than Postgres&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.974&amp;nbsp; &amp;nbsp; 440&amp;nbsp; &amp;nbsp; 6830&amp;nbsp; &amp;nbsp; PGVector(m=48, ef_cons=192, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.973&amp;nbsp; &amp;nbsp; 483&amp;nbsp; &amp;nbsp; 6106&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=48, ef_cons=192, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.981&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1130&lt;/span&gt;&amp;nbsp; &amp;nbsp; 2887&amp;nbsp; &amp;nbsp; MariaDB(m=16, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.96, MariaDB gets &amp;gt;= 2.2X more QPS than Postgres&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.962&amp;nbsp; &amp;nbsp; 568&amp;nbsp; &amp;nbsp; 4683&amp;nbsp; &amp;nbsp; PGVector(m=32, ef_cons=192, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.961&amp;nbsp; &amp;nbsp; 635&amp;nbsp; &amp;nbsp; 4154&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_cons=192, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.965&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1433&lt;/span&gt;&amp;nbsp; &amp;nbsp; 1550&amp;nbsp; &amp;nbsp; MariaDB(m=12, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.95, MariaDB gets &amp;gt;= 2.1X more QPS&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.953&amp;nbsp; &amp;nbsp; 588&amp;nbsp; &amp;nbsp; 2730&amp;nbsp; &amp;nbsp; PGVector(m=32, ef_cons=96, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.957&amp;nbsp; &amp;nbsp; 662&amp;nbsp; &amp;nbsp; 1382&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=16, ef_cons=96, ef_search=40)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.965&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1433&lt;/span&gt;&amp;nbsp; &amp;nbsp; 1550&amp;nbsp; &amp;nbsp; MariaDB(m=12, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/7345394187709794108/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/02/vector-indexes-mariadb-pgvector-large.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/7345394187709794108'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/7345394187709794108'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/02/vector-indexes-mariadb-pgvector-large.html' title='Vector indexes, MariaDB &amp; pgvector, large server, dbpedia-openai dataset'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjr4XB3jIbN64LvLm7aXZO5iJUE4eP1GU9Y_NAIA0pLnytikiVGwn6wdDsmOOE3n6k8Hjwd6rWTdbQyJ_B6mk4Xufqc8YBFA0_yJLp0AM2b6zUUr__LoNtJBSqrt8G160bsDJ02fP_OpXhp1XBVnzWI4CFoxqKpIH3lXrHxLgAhvMvhWiAWpdtukYfl5mW5/s72-w640-h424-c/dbpedia-openai-100k-angular.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-1245898536686970927</id><published>2025-01-28T10:07:00.000-08:00</published><updated>2025-02-10T16:11:31.928-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="ann-benchmarks"/><category scheme="http://www.blogger.com/atom/ns#" term="mariadb"/><category scheme="http://www.blogger.com/atom/ns#" term="pgvector"/><category scheme="http://www.blogger.com/atom/ns#" term="postgres"/><category scheme="http://www.blogger.com/atom/ns#" term="vector"/><title type='text'>Vector indexes, MariaDB &amp; pgvector, large server, large dataset: part 1</title><content type='html'>&lt;p&gt;This post has results from &lt;a href=&quot;https://github.com/erikbern/ann-benchmarks/&quot;&gt;ann-benchmarks&lt;/a&gt; to compare MariaDB and Postgres with a larger dataset, gist-960-euclidean.&amp;nbsp; Previous posts (&lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html&quot;&gt;here&lt;/a&gt; and &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/vector-indexes-mariadb-pgvector-large.html&quot;&gt;here&lt;/a&gt;) used fashion-mnist-784-euclidean which is a small dataset. By larger I mean by the standards of what is in ann-benchmarks. This dataset has 1M rows and 960 dimensions. The fashion-mnist-784-euclidean dataset has 60,000 rows and 784 dimensions. Both use Euclidean distance. This work was done by&amp;nbsp;&lt;a href=&quot;https://smalldatum.github.io/&quot;&gt;Small Datum LLC&lt;/a&gt;&amp;nbsp;and sponsored by the MariaDB Corporation.&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;MariaDB gets between 2.5X and 3.9X more QPS than Postgres for recall &amp;gt;= 0.95&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html&quot;&gt;This post&lt;/a&gt;&amp;nbsp;has much more detail about my approach in general. I ran the benchmark for 1 session. I use&amp;nbsp;&lt;a href=&quot;https://github.com/erikbern/ann-benchmarks/&quot;&gt;ann-benchmarks&lt;/a&gt;&amp;nbsp;via my&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova&quot;&gt;fork of a fork of a fork&lt;/a&gt;&amp;nbsp;at&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/commit/f0c0d0ccbbe765c6d758239eb95406b5dd07845d&quot;&gt;this commit&lt;/a&gt;.&amp;nbsp; The ann-benchmarks config files are here &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/jan25.ann.gist-960-euclidean.v1/dop_1/pg172/config.yml.mariadb&quot;&gt;for MariaDB&lt;/a&gt; and &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/jan25.ann.gist-960-euclidean.v1/dop_1/pg172/config.yml.pgvector&quot;&gt;for Postgres&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I used a larger server (Hetzner ax162-s) with 48 cores, 128G of RAM, Ubuntu 22.04 and HW RAID 10 using 2 NVMe devices. The database configuration files are here &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/jan25.ann.gist-960-euclidean.v1/dop_1/ma110701/my.cnf.cz11b_lwas4k_vector_c32r128&quot;&gt;for MariaDB&lt;/a&gt; and &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/jan25.ann.gist-960-euclidean.v1/dop_1/pg172/conf.diff.cx10a_vector_c32r128&quot;&gt;for Postgres&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I had ps and vmstat running during the benchmark and confirmed there weren&#39;t storage reads as the table and index were cached by MariaDB and Postgres.&lt;br /&gt;&lt;br /&gt;The command line to run the benchmark using my &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/jan25.ann.gist-960-euclidean.v1/helper_scripts&quot;&gt;helper scripts&lt;/a&gt; is:&lt;br /&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp; &amp;nbsp; bash rall.batch.sh v1 gist-960-euclidean c32r128&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: QPS vs recall&lt;/b&gt;&lt;/div&gt;&lt;p&gt;This chart shows the best QPS for a given recall. MariaDB gets ~1.5X more QPS than pgvector at low recall and between 2X and 4X more QPS at high recall.&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjgEqbgiawVXfn7fIVw1VnOD6gOhz3Sfsaz7dtGUR2P-y_BJJUOmMxpNAXokhdkDOPc-a27Ga6oNd3CSqzrD_p-tDakrzy37BmDbLGYvn1HU72E-fU9QZa0AMnbr83MUXqfYE_By8ov9cqYA7uTYMzqahjyVJ4XjZ7SkIkjdOsje3xouFsSgwz-r6Kh6S7N/s1173/gist-960-euclidean.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1173&quot; height=&quot;424&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjgEqbgiawVXfn7fIVw1VnOD6gOhz3Sfsaz7dtGUR2P-y_BJJUOmMxpNAXokhdkDOPc-a27Ga6oNd3CSqzrD_p-tDakrzy37BmDbLGYvn1HU72E-fU9QZa0AMnbr83MUXqfYE_By8ov9cqYA7uTYMzqahjyVJ4XjZ7SkIkjdOsje3xouFsSgwz-r6Kh6S7N/w640-h424/gist-960-euclidean.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div&gt;&lt;b&gt;Results: create index&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The database configs for Postgres and MariaDB are shared above, and parallel index create is disabled by the config for Postgres and not supported yet by MariaDB. The summary is:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;index sizes are similar between MariaDB and pgvector with halfvec&lt;/li&gt;&lt;li&gt;time to create the index varies a lot and it is better to consider this in the context of recall which is done in next section&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;Table size is 3906 MB for Postgres and 5292 MB for MariaDB.&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Legend&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;M - value for M when creating the index&lt;/li&gt;&lt;li&gt;cons - value for ef_construction when creating the index&lt;/li&gt;&lt;li&gt;secs - time in seconds to create the index&lt;/li&gt;&lt;li&gt;size(MB) - index size in MB&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -- pgvector --&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -- pgvector --&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -- float32&amp;nbsp; --&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -- halfvec&amp;nbsp; --&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;M&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;cons&amp;nbsp; &amp;nbsp; secs&amp;nbsp; &amp;nbsp; size(MB)&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; secs&amp;nbsp; &amp;nbsp; size(MB)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;32&amp;nbsp; &amp;nbsp; &amp;nbsp; 323&amp;nbsp; &amp;nbsp; 3870&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;292&amp;nbsp; &amp;nbsp; 2568&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;32&amp;nbsp; &amp;nbsp; &amp;nbsp; 610&amp;nbsp; &amp;nbsp; 7811&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;551&amp;nbsp; &amp;nbsp; 2603&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;64&amp;nbsp; &amp;nbsp; &amp;nbsp; 512&amp;nbsp; &amp;nbsp; 3865&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;466&amp;nbsp; &amp;nbsp; 2565&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;64&amp;nbsp; &amp;nbsp; &amp;nbsp; 964&amp;nbsp; &amp;nbsp; 7684&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;869&amp;nbsp; &amp;nbsp; 2561&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;32&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;64&amp;nbsp; &amp;nbsp; &amp;nbsp;1958&amp;nbsp; &amp;nbsp; 7812&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1773&amp;nbsp; &amp;nbsp; 2604&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;96&amp;nbsp; &amp;nbsp; &amp;nbsp; 717&amp;nbsp; &amp;nbsp; 3863&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;646&amp;nbsp; &amp;nbsp; 2564&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;96&amp;nbsp; &amp;nbsp; &amp;nbsp;1330&amp;nbsp; &amp;nbsp; 7681&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1187&amp;nbsp; &amp;nbsp; 2560&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;32&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;96&amp;nbsp; &amp;nbsp; &amp;nbsp;2640&amp;nbsp; &amp;nbsp; 7679&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2368&amp;nbsp; &amp;nbsp; 2559&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;48&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;96&amp;nbsp; &amp;nbsp; &amp;nbsp;3990&amp;nbsp; &amp;nbsp; 7812&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3606&amp;nbsp; &amp;nbsp; 2606&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp;1265&amp;nbsp; &amp;nbsp; 3861&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1142&amp;nbsp; &amp;nbsp; 2562&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp;2295&amp;nbsp; &amp;nbsp; 7679&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2036&amp;nbsp; &amp;nbsp; 2559&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;32&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp;4361&amp;nbsp; &amp;nbsp; 7678&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3880&amp;nbsp; &amp;nbsp; 2559&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;48&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp;6281&amp;nbsp; &amp;nbsp; 7678&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 5581&amp;nbsp; &amp;nbsp; 2562&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;64&amp;nbsp; &amp;nbsp; &amp;nbsp; 192&amp;nbsp; &amp;nbsp; &amp;nbsp;8589&amp;nbsp; &amp;nbsp; 7678&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 7612&amp;nbsp; &amp;nbsp; 3839&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp;1607&amp;nbsp; &amp;nbsp; 3861&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1448&amp;nbsp; &amp;nbsp; 2562&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp;2882&amp;nbsp; &amp;nbsp; 7678&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2560&amp;nbsp; &amp;nbsp; 2559&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;32&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp;5260&amp;nbsp; &amp;nbsp; 7678&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 4611&amp;nbsp; &amp;nbsp; 2559&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;48&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp;7678&amp;nbsp; &amp;nbsp; 7678&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 6713&amp;nbsp; &amp;nbsp; 2561&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;64&amp;nbsp; &amp;nbsp; &amp;nbsp; 256&amp;nbsp; &amp;nbsp; &amp;nbsp;9962&amp;nbsp; &amp;nbsp; 7678&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 8851&amp;nbsp; &amp;nbsp; 3839&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;mariadb&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;M&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;secs&amp;nbsp; &amp;nbsp; size(MB)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;4&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 243&amp;nbsp; &amp;nbsp;2316&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;5&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 313&amp;nbsp; &amp;nbsp;2320&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;6&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 439&amp;nbsp; &amp;nbsp;2316&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&amp;nbsp;8&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 775&amp;nbsp; &amp;nbsp;2316&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;12&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1878&amp;nbsp; &amp;nbsp;2316&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;16&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;3547&amp;nbsp; &amp;nbsp;2348&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;24&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;8690&amp;nbsp; &amp;nbsp;2696&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;32&amp;nbsp; &amp;nbsp; &amp;nbsp; 16172&amp;nbsp; &amp;nbsp;2696&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;48&amp;nbsp; &amp;nbsp; &amp;nbsp; 38732&amp;nbsp; &amp;nbsp;2756&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Results: best QPS for a given recall&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Many benchmark results are marketed via peak performance (max throughput or min response time) but these are usually constrained optimization problems -- determine peak performance that satisfies some SLA. And the SLA might be response time or efficiency (cost).&lt;br /&gt;&lt;br /&gt;With ann-benchmarks the constraint is recall. Below I share the best QPS for a given recall target along with the configuration parameters (M, ef_construction, ef_search) at which that occurs for each of the algorithms (MariaDB, pgvector with float32, pgvector with float16/halfvec).&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Postgres does not get recall=1.0 for the values of M, ef_construction and ef_search I used&lt;/li&gt;&lt;li&gt;Index create time was less for MariaDB in all cases except the result for recall &amp;gt;= 0.96. However, if you care more about index size than peak QPS then it might be better to look at more results per recall level, as in the best 3 results per DBMS rather than the best as I do here.&lt;/li&gt;&lt;li&gt;For a given recall target, MariaDB gets between 2.5X and 3.9X more QPS than Postgres&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;Legend:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;recall, QPS - best QPS at that recall&lt;/li&gt;&lt;li&gt;isecs - time to create the index in seconds&lt;/li&gt;&lt;li&gt;m= - value for M when creating the index&lt;/li&gt;&lt;li&gt;ef_cons= - value for ef_construction when creating the index&lt;/li&gt;&lt;li&gt;ef_search= - value for ef_search when running queries&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 1.000, Postgres did not reach the recall target&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;1.000&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;87.1&lt;/span&gt;&amp;nbsp; 38732&amp;nbsp; &amp;nbsp;MariaDB(m=48, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.99, MariaDB gets 3.7X more QPS than Postgres&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.990&amp;nbsp; &amp;nbsp; 81.1&amp;nbsp; &amp;nbsp;9962&amp;nbsp; &amp;nbsp; PGVector(m=64, ef_cons=256, ef_search=120)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.990&amp;nbsp; &amp;nbsp; 85.4&amp;nbsp; &amp;nbsp;8851&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=64, ef_cons=256, ef_search=120)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.991&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;311.8&lt;/span&gt;&amp;nbsp; &amp;nbsp;8690&amp;nbsp; &amp;nbsp; MariaDB(m=24, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.98,MariaDB gets 3.5X more QPS than Postgres&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.984&amp;nbsp; &amp;nbsp;101.1&amp;nbsp; &amp;nbsp;6281&amp;nbsp; &amp;nbsp; PGVector(m=48, ef_cons=192, ef_search=120)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.984&amp;nbsp; &amp;nbsp;109.7&amp;nbsp; &amp;nbsp;5581&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=48, ef_cons=192, ef_search=120)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.985&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;384.9&lt;/span&gt;&amp;nbsp; &amp;nbsp;3547&amp;nbsp; &amp;nbsp; MariaDB(m=16, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.97, MariaDB gets 2.5X more QPS than Postgres&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.973&amp;nbsp; &amp;nbsp;138.0&amp;nbsp; &amp;nbsp;5260&amp;nbsp; &amp;nbsp; PGVector(m=32, ef_cons=256, ef_search=120)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.971&amp;nbsp; &amp;nbsp;152.7&amp;nbsp; &amp;nbsp;3880&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_cons=192, ef_search=120)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.985&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;384.9&lt;/span&gt;&amp;nbsp; &amp;nbsp;3547&amp;nbsp; &amp;nbsp; MariaDB(m=16, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.96, MariaDB gets 3.9X more QPS than Postgres&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.966&amp;nbsp; &amp;nbsp;139.8&amp;nbsp; &amp;nbsp;6281&amp;nbsp; &amp;nbsp; PGVector(m=48, ef_cons=192, ef_search=80)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.964&amp;nbsp; &amp;nbsp;155.4&amp;nbsp; &amp;nbsp;2368&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_cons=96, ef_search=120)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.960&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;610.1&lt;/span&gt;&amp;nbsp; &amp;nbsp;3547&amp;nbsp; &amp;nbsp; MariaDB(m=16, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.95, MariaDB gets 2.9X more QPS than Postgres&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.951&amp;nbsp; &amp;nbsp;190.8&amp;nbsp; &amp;nbsp;5260&amp;nbsp; &amp;nbsp; PGVector(m=32, ef_cons=256, ef_search=80)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.951&amp;nbsp; &amp;nbsp;208.7&amp;nbsp; &amp;nbsp;4611&amp;nbsp; &amp;nbsp; PGVector_halfvec(m=32, ef_cons=256, ef_search=80)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;0.960&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;610.1&lt;/span&gt;&amp;nbsp; &amp;nbsp;3547&amp;nbsp; &amp;nbsp; MariaDB(m=16, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/span&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/1245898536686970927/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/01/vector-indexes-mariadb-pgvector-large_28.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/1245898536686970927'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/1245898536686970927'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/01/vector-indexes-mariadb-pgvector-large_28.html' title='Vector indexes, MariaDB &amp; pgvector, large server, large dataset: part 1'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjgEqbgiawVXfn7fIVw1VnOD6gOhz3Sfsaz7dtGUR2P-y_BJJUOmMxpNAXokhdkDOPc-a27Ga6oNd3CSqzrD_p-tDakrzy37BmDbLGYvn1HU72E-fU9QZa0AMnbr83MUXqfYE_By8ov9cqYA7uTYMzqahjyVJ4XjZ7SkIkjdOsje3xouFsSgwz-r6Kh6S7N/s72-w640-h424-c/gist-960-euclidean.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-3612827615811941001</id><published>2025-01-26T12:00:00.000-08:00</published><updated>2025-01-28T10:19:06.509-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="ann-benchmarks"/><category scheme="http://www.blogger.com/atom/ns#" term="mariadb"/><category scheme="http://www.blogger.com/atom/ns#" term="pgvector"/><category scheme="http://www.blogger.com/atom/ns#" term="postgres"/><category scheme="http://www.blogger.com/atom/ns#" term="vector"/><title type='text'>Vector indexes, MariaDB &amp; pgvector, large server, small dataset: part 2</title><content type='html'>&lt;p&gt;This post has results for vector index support in MariaDB and Postgres. This work was done by&amp;nbsp;&lt;a href=&quot;https://smalldatum.github.io/&quot;&gt;Small Datum LLC&lt;/a&gt;&amp;nbsp;and sponsored by the MariaDB Corporation. This is part 2 in a series that compares QPS and recall for the fashion-mnist-784-euclidean dataset using from 1 to 48 concurrent sessions on a large server. This is part 2 and part 1 &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/vector-indexes-mariadb-pgvector-large.html&quot;&gt;is here&lt;/a&gt;.&amp;nbsp;&lt;/p&gt;&lt;p&gt;The purpose of this post is to explain the results I shared in &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/vector-indexes-mariadb-pgvector-large.html&quot;&gt;part 1&lt;/a&gt; where MariaDB does much better than pgvector at low and high concurrency but the performance gap isn&#39;t as large at medium concurrency where low means &amp;lt;= 4 concurrent sessions, medium means 8 to 20 and high means &amp;gt;= 24.&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;QPS is ~1.4X larger for MariaDB than for pgvector at 2 and 48 concurrent sessions&lt;/li&gt;&lt;li&gt;pgvector uses more CPU/query than MariaDB&lt;/li&gt;&lt;li&gt;MariaDB does more context switches /query than pgvector&lt;/li&gt;&lt;li&gt;MariaDB appears to use less CPU to compute euclidean distance&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html&quot;&gt;This post&lt;/a&gt;&amp;nbsp;has much more detail about my approach in general and &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/vector-indexes-mariadb-pgvector-large.html&quot;&gt;part 1&lt;/a&gt; has more detail on this particular setup. I repeated the benchmark for 2 to 48 concurrent sessions because my test server has 48 cores. I use&amp;nbsp;&lt;a href=&quot;https://github.com/erikbern/ann-benchmarks/&quot;&gt;ann-benchmarks&lt;/a&gt;&amp;nbsp;via my&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova&quot;&gt;fork of a fork of a fork&lt;/a&gt;&amp;nbsp;at&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/commit/f0c0d0ccbbe765c6d758239eb95406b5dd07845d&quot;&gt;this commit&lt;/a&gt;.&amp;nbsp;&lt;br /&gt;&lt;br /&gt;For more on euclidean distance (L2)&amp;nbsp;&lt;a href=&quot;https://en.wikipedia.org/wiki/Euclidean_distance&quot;&gt;see here&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For MariaDB:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;queries use&amp;nbsp;&lt;i&gt;ORDER by vec_distance_euclidean&lt;/i&gt;&lt;/li&gt;&lt;li&gt;create index uses &lt;i&gt;DISTANCE=euclidean&lt;/i&gt;&lt;/li&gt;&lt;/ul&gt;For Postgres&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;queries use&amp;nbsp;&lt;i&gt;ORDER BY embedding::halfvec(...) &amp;lt;-&amp;gt; $name::halfvec(...)&lt;/i&gt;&lt;/li&gt;&lt;li&gt;create index uses&amp;nbsp;&lt;i&gt;USING hnsw ((embedding::halfvec(...)) halfvec_l2_ops&lt;/i&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;Results: QPS by concurrency&lt;/b&gt;&lt;/p&gt;&lt;p&gt;The charts in this section show QPS by concurrency level as the benchmark was repeated for 1 to 48 concurrent sessions (X concurrent sessions means X concurrent queries).&lt;br /&gt;&lt;/p&gt;&lt;p&gt;The charts come from &lt;a href=&quot;https://docs.google.com/spreadsheets/d/19gbLWjWJwGlbgGOEVXO1MXrQx3ALs9YbNYfAjhzVEPo/edit?usp=sharing&quot;&gt;this spreadsheet&lt;/a&gt;. All of the data from the benchmark&amp;nbsp;&lt;a href=&quot;https://gist.github.com/mdcallag/03eb8ee3929e512f466b3f18aac03cd1&quot;&gt;is here&lt;/a&gt;&amp;nbsp;and the data I scraped to make these charts &lt;a href=&quot;https://gist.github.com/mdcallag/03eb8ee3929e512f466b3f18aac03cd1&quot;&gt;is here&lt;/a&gt;. I used configurations that provide a recall of ~0.96.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;MariaDB - ef_search=10, M=6&lt;/li&gt;&lt;li&gt;Postgres - ef_search=10, M=16, ef_construction=32&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;This chart has absolute QPS for each of the systems tested.&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi-LTbvrNhEsv9yZwa-A11OlvhoGYpNTC-dX3uBJf5OhfTdZaVoO3Iy_IFEv08y0CAML8fURZiubkrDM8jszeLe1-GeTjFKoCpNqwLWF_m2obn6xCDnlLBEtGuGBuc6HBfCDFqlEx_6pkEysQa9QgFZma4h4zZxdrmn3nDiWAW5IZclKOyPuc9WMUieQy_W/s600/QPS.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi-LTbvrNhEsv9yZwa-A11OlvhoGYpNTC-dX3uBJf5OhfTdZaVoO3Iy_IFEv08y0CAML8fURZiubkrDM8jszeLe1-GeTjFKoCpNqwLWF_m2obn6xCDnlLBEtGuGBuc6HBfCDFqlEx_6pkEysQa9QgFZma4h4zZxdrmn3nDiWAW5IZclKOyPuc9WMUieQy_W/w640-h396/QPS.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;This chart has relative QPS which is: (QPS for Postgres / QPS for MariaDB).&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;The MariaDB advantage is larger at low and high concurrency.&lt;/li&gt;&lt;li&gt;The MariaDB advantage isn&#39;t as large between 8 and 20 concurrent sessions&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjvgn8F6lI7IiCWRuqwYzTIPyycvBbjQEO6XhK6LTt0ZZ-jG-ABpBmL3EavdSt5wyfvK3HqHE18pcdPlOCy7JCG3i1dIqwnPYG3CIOt1T0g-JDWT8xeOuA7UgyW2XNEsTrMtwFrzU4SqwfnDhiPAuTiFVU_un3ZjG3bBBYMXnf8zDMrMrNBr6pTFndh1Z92/s600/QPS%20relative%20to%20MariaDB.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjvgn8F6lI7IiCWRuqwYzTIPyycvBbjQEO6XhK6LTt0ZZ-jG-ABpBmL3EavdSt5wyfvK3HqHE18pcdPlOCy7JCG3i1dIqwnPYG3CIOt1T0g-JDWT8xeOuA7UgyW2XNEsTrMtwFrzU4SqwfnDhiPAuTiFVU_un3ZjG3bBBYMXnf8zDMrMrNBr6pTFndh1Z92/w640-h396/QPS%20relative%20to%20MariaDB.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;And this table also has relative QPS.&lt;/p&gt;&lt;li&gt;The MariaDB advantage is larger at low and high concurrency.&lt;/li&gt;&lt;li&gt;The MariaDB advantage isn&#39;t as large between 8 and 20 concurrent sessions&lt;/li&gt;&lt;p&gt;&lt;google-sheets-html-origin&gt;&lt;/google-sheets-html-origin&gt;&lt;/p&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;111&quot;&gt;&lt;/col&gt;&lt;col width=&quot;114&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;pgvector.halfvec&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;pgvector&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.60&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.57&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;2&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.71&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.69&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;4&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.78&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.72&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;8&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.89&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.83&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;12&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.82&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.79&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;16&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.98&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.86&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;20&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.85&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;24&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.82&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.81&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;28&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.85&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.79&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;32&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.90&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.86&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;36&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.71&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.76&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;40&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.84&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.75&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;44&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.70&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.65&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;48&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.73&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.64&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;This table has (QPS / concurrency). For all systems tested the QPS per session decreases as the concurrency increases. I suspect the benchmark client is part of the problem but I am just speculating&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;multiprocessing.pool is used by both MariaDB and pgvector, which is good, less GIL. See here &lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/blob/dev/ann_benchmarks/algorithms/mariadb/module.py#L450-L458&quot;&gt;for MariaDB&lt;/a&gt; and for &lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/blob/be1288064f3f00daffbdc7260e55a06c03609324/ann_benchmarks/algorithms/pgvector/module.py#L132-L138&quot;&gt;pgvector&lt;/a&gt;.&lt;/li&gt;&lt;li&gt;the &lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/blob/be1288064f3f00daffbdc7260e55a06c03609324/ann_benchmarks/runner.py#L94-L102&quot;&gt;benchmark client includes&lt;/a&gt; all of the time to process queries, including&lt;/li&gt;&lt;ul&gt;&lt;li&gt;creating &amp;amp; start multiprocessing.pool - perhaps the pool can be cached &amp;amp; reused across runs&lt;/li&gt;&lt;li&gt;creating a database connection&lt;/li&gt;&lt;li&gt;gathering results from the concurrent sessions - some of this is done in the main thread&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;AFAIK, the total number of queries per run is fixed, so the number of queries per session is less when there are more concurrent sessions and setup overhead (create database connection, create multiprocessing.pool, process results) becomes more significant as the concurrency level increases.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;google-sheets-html-origin&gt;&lt;/google-sheets-html-origin&gt;&lt;/p&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;130&quot;&gt;&lt;/col&gt;&lt;col width=&quot;129&quot;&gt;&lt;/col&gt;&lt;col width=&quot;124&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;MariaDB&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;pgvector.halfvec&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;pgvector&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;4639.6&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;2803.5&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;2661.3&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;2&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;3376.4&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;2405.6&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;2327.3&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;4&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;2869.2&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;2226.8&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;2076.0&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;8&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;2149.6&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1922.2&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1784.0&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;12&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1858.1&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1522.2&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1471.8&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;16&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1527.5&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1498.3&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1310.5&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;20&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1353.1&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1303.1&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1156.3&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;24&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1261.8&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1039.8&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1023.5&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;28&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1062.7&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;906.6&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;839.5&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;32&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;909.0&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;814.8&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;784.9&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;36&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;918.5&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;649.0&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;702.5&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;40&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;783.2&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;659.8&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;589.9&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;44&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;745.8&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;518.9&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;482.5&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;48&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;680.1&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;495.9&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;435.3&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;b&gt;Performance debugging&lt;/b&gt;&lt;/p&gt;&lt;p&gt;The benchmark client does a lot of work (like checking results for recall) which means there is a brief burst of CPU overhead when queries run followed by longer periods where the benchmark client is processing the results. So I modified the benchmark client to only run queries in a loop and avoid other overheads like checking the results for recall. This makes it easier to collect performance data like CPU profiles (perf), PMP stacks and vmstat samples.&lt;/p&gt;&lt;p&gt;&lt;b&gt;Performance debugging: MariaDB&lt;/b&gt;&lt;/p&gt;&lt;p&gt;From a test with 2 concurrent sessions the perf profile shows that much CPU is used to compute the dot product which is used to determine the distance between vectors:&lt;/p&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;# Overhead&amp;nbsp; Command&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Shared Object&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Symbol&lt;br /&gt;&amp;nbsp; &amp;nbsp; 16.89%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] FVector::dot_product&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;4.71%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] escape_string_for_mysql&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;3.42%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] search_layer&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;2.99%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] buf_page_get_gen&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;2.31%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] my_charlen_utf8mb4&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;2.16%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] MYSQLparse&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;2.03%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;libc.so.6&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] __memmove_avx512_unaligned_erms&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.74%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] PatternedSimdBloomFilter&amp;lt;FVectorNode&amp;gt;::Query&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.58%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;libm.so.6&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] __roundf&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.49%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] mtr_memo_slot_t::release&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.40%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] mhnsw_read_first&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.32%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] page_cur_search_with_match&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.09%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;libc.so.6&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] __memcmp_evex_movbe&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.06%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] FVectorNode::distance_to&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.03%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] row_search_mvcc&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.98%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] rec_get_offsets_func&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.93%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] cmp_data&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.93%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] alloc_root&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.75%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] Visited::cmp&lt;/span&gt;&lt;/div&gt;&lt;p&gt;And then the result with 48 concurrent sessions&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;the percentage of time in dot_product was ~17% above, but only ~11.5% here&lt;/li&gt;&lt;li&gt;more time is spent in InnoDB functions like buf_page_get_gen, mtr_memo_slot_t::release, page_cur_search_with_match, btr_cur_t_::search_leaf, ssux_lock::psd_read_lock, rec_get_offsets_func and buf_page_make_young_if_needed. Some of that might be expected but that can also be a sign of too much mutex contention in InnoDB.&lt;/li&gt;&lt;li&gt;I don&#39;t see signs of mutex contention in &lt;a href=&quot;https://gist.github.com/mdcallag/8c023a27b12c0cb65d7be73ca2a99bf2&quot;&gt;PMP output&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;# Overhead&amp;nbsp; Command&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Shared Object&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Symbol&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; 11.49%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] FVector::dot_product&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;7.17%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] buf_page_get_gen&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;4.44%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] search_layer&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;4.00%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] escape_string_for_mysql&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;2.49%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] mtr_memo_slot_t::release&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;2.23%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] page_cur_search_with_match&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.86%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] MYSQLparse&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.85%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] my_charlen_utf8mb4&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.75%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;libc.so.6&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] __memmove_avx512_unaligned_erms&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.60%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] btr_cur_t::search_leaf&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.38%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] ssux_lock::psi_rd_lock&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.37%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] mhnsw_read_first&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.32%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] FVectorNode::distance_to&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.23%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] rec_get_offsets_func&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.19%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] PatternedSimdBloomFilter&amp;lt;FVectorNode&amp;gt;::Query&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.02%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] FVectorNode::load&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.97%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;libm.so.6&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] __roundf&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.88%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] buf_page_make_young_if_needed&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.83%&amp;nbsp; one_connection&amp;nbsp; &amp;nbsp;mariadbd.orig&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] cmp_dtuple_rec_with_match_low&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;p&gt;From vmstat with 2 concurrent sessions&lt;/p&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;procs -----------memory---------- ---swap-- -----io---- -system-- ------cpu-----&lt;br /&gt;&amp;nbsp;r&amp;nbsp; b&amp;nbsp; &amp;nbsp;swpd&amp;nbsp; &amp;nbsp;free&amp;nbsp; &amp;nbsp;buff&amp;nbsp; cache&amp;nbsp; &amp;nbsp;si&amp;nbsp; &amp;nbsp;so&amp;nbsp; &amp;nbsp; bi&amp;nbsp; &amp;nbsp; bo&amp;nbsp; &amp;nbsp;in&amp;nbsp; &amp;nbsp;cs us sy id wa st&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp;2&amp;nbsp; 0&amp;nbsp; 96784 89703408 1982284 34588692&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; 70 1401 56743&amp;nbsp; 3&amp;nbsp; 1 95&amp;nbsp; 0&amp;nbsp; 0&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp;2&amp;nbsp; 0&amp;nbsp; 96784 89703408 1982284 34588696&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp;9 1356 56588&amp;nbsp; 4&amp;nbsp; 1 96&amp;nbsp; 0&amp;nbsp; 0&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp;2&amp;nbsp; 0&amp;nbsp; 96784 89703408 1982284 34588696&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp;0 1434 56755&amp;nbsp; 3&amp;nbsp; 1 95&amp;nbsp; 0&amp;nbsp; 0&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp;2&amp;nbsp; 0&amp;nbsp; 96784 89703408 1982284 34588696&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp;0 1340 56629&amp;nbsp; 4&amp;nbsp; 1 96&amp;nbsp; 0&amp;nbsp; 0&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp;2&amp;nbsp; 0&amp;nbsp; 96784 89702672 1982288 34588700&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; 86 1810 56874&amp;nbsp; 4&amp;nbsp; 1 95&amp;nbsp; 0&amp;nbsp; 0&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;p&gt;From vmstat with 48 concurrent sessions&lt;/p&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;procs -----------memory---------- ---swap-- -----io---- -system-- ------cpu-----&lt;br /&gt;&amp;nbsp;r&amp;nbsp; b&amp;nbsp; &amp;nbsp;swpd&amp;nbsp; &amp;nbsp;free&amp;nbsp; &amp;nbsp;buff&amp;nbsp; cache&amp;nbsp; &amp;nbsp;si&amp;nbsp; &amp;nbsp;so&amp;nbsp; &amp;nbsp; bi&amp;nbsp; &amp;nbsp; bo&amp;nbsp; &amp;nbsp;in&amp;nbsp; &amp;nbsp;cs us sy id wa st&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;58&amp;nbsp; 0&amp;nbsp; 96784 89428544 1988284 34733232&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp;290 27883 637320 80 20&amp;nbsp; 0&amp;nbsp; 0&amp;nbsp; 0&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;56&amp;nbsp; 0&amp;nbsp; 96784 89430560 1988284 34733236&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp;0 28393 639018 81 19&amp;nbsp; 0&amp;nbsp; 0&amp;nbsp; 0&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;52&amp;nbsp; 0&amp;nbsp; 96784 89430560 1988284 34733236&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp;0 28240 638914 80 20&amp;nbsp; 0&amp;nbsp; 0&amp;nbsp; 0&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;59&amp;nbsp; 0&amp;nbsp; 96784 89430560 1988284 34733236&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp;0 33141 642055 80 20&amp;nbsp; 1&amp;nbsp; 0&amp;nbsp; 0&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;56&amp;nbsp; 0&amp;nbsp; 96784 89430560 1988284 34733236&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp;0 29520 639747 81 19&amp;nbsp; 0&amp;nbsp; 0&amp;nbsp; 0&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;p&gt;Comparing the vmstat results for 2 vs 48 sessions&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;CPU/query is (vmstat.us + vmstat.sy) / QPS&lt;/li&gt;&lt;ul&gt;&lt;li&gt;For 2 sessions it is ((4+5+4+5+5) / 5) / 6752.7 = .000681&lt;/li&gt;&lt;li&gt;For 48 sessions it is ((100+100+100+100+100)/5) / 32645.5 = .003063&lt;/li&gt;&lt;li&gt;CPU/query is ~4.5X larger at 48 sessions&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;Context switches /query is vmstat.cs / QPS&lt;/li&gt;&lt;ul&gt;&lt;li&gt;For 2 sessions it is 56743 / 6752.7 = 8.40&lt;/li&gt;&lt;li&gt;For 48 sessions it is 637320 / 32645.5 = 19.52&lt;/li&gt;&lt;li&gt;Context switches /query is ~2.3X larger at 48 sessions&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Performance debugging: pgvector with halfvec&lt;/b&gt;&lt;/p&gt;&lt;p&gt;From a test with 2 concurrent sessions the perf profile shows&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;computing L2 distance accounts for the most time, here it is 25.98% while above for MariaDB at 2 concurrent sessions it was 16.89%. Perhaps MariaDB is faster at computing L2 distance, perhaps MariaDB has more overhead elsewhere to reduce the fraction of time in computing L2 distance. But I suspect that the former is true.&lt;/li&gt;&lt;li&gt;Postgres here appears to have more CPU overhead than MariaDB in accessing the data (PinBuffer, LWLock, etc)&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;# Overhead&amp;nbsp; Command&amp;nbsp; &amp;nbsp;Shared Object&amp;nbsp; &amp;nbsp; &amp;nbsp; Symbol&lt;br /&gt;&amp;nbsp; &amp;nbsp; 25.98%&amp;nbsp; postgres&amp;nbsp; vector.so&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] HalfvecL2SquaredDistanceF16c&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;9.68%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] PinBuffer&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;6.46%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] hash_search_with_hash_value&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;5.39%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] LWLockRelease&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;4.25%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] pg_detoast_datum&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;3.95%&amp;nbsp; postgres&amp;nbsp; vector.so&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] vector_to_halfvec&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;3.06%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] LWLockAttemptLock&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;2.97%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] LWLockAcquire&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;2.89%&amp;nbsp; postgres&amp;nbsp; vector.so&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] HnswLoadUnvisitedFromDisk&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;2.44%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] StartReadBuffer&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.82%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] GetPrivateRefCountEntry&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.70%&amp;nbsp; postgres&amp;nbsp; vector.so&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] tidhash_insert_hash_internal&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.65%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] LockBuffer&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.59%&amp;nbsp; postgres&amp;nbsp; vector.so&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] HnswLoadElementImpl&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.88%&amp;nbsp; postgres&amp;nbsp; libc.so.6&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] __memcmp_evex_movbe&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.80%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] ItemPointerEquals&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.79%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] ResourceOwnerForget&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.71%&amp;nbsp; postgres&amp;nbsp; vector.so&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] HnswSearchLayer&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.64%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] pq_getmsgfloat4&lt;/span&gt;&lt;/div&gt;&lt;p&gt;And then from a test with 48 concurrent sessions&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;The fraction of time computing L2 distance here is less than it is for 2 sessions above. This is similar to the results for MariaDB.&lt;/li&gt;&lt;li&gt;&lt;a href=&quot;https://gist.github.com/mdcallag/ef445069d7690de718d9d2364f32701b&quot;&gt;From PMP&lt;/a&gt; I don&#39;t see signs of mutex contention, but I only took 3 samples&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;# Overhead&amp;nbsp; Command&amp;nbsp; &amp;nbsp;Shared Object&amp;nbsp; &amp;nbsp; &amp;nbsp; Symbol&lt;br /&gt;&amp;nbsp; &amp;nbsp; 19.86%&amp;nbsp; postgres&amp;nbsp; vector.so&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] HalfvecL2SquaredDistanceF16c&lt;br /&gt;&amp;nbsp; &amp;nbsp; 10.89%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] PinBuffer&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;6.78%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] hash_search_with_hash_value&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;5.44%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] LWLockRelease&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;5.25%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] LWLockAttemptLock&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;4.97%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] pg_detoast_datum&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;3.30%&amp;nbsp; postgres&amp;nbsp; vector.so&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] vector_to_halfvec&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;2.62%&amp;nbsp; postgres&amp;nbsp; vector.so&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] HnswLoadUnvisitedFromDisk&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;2.61%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] StartReadBuffer&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;2.03%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] GetPrivateRefCountEntry&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.70%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] LockBuffer&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.69%&amp;nbsp; postgres&amp;nbsp; vector.so&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] HnswLoadElementImpl&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.49%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] LWLockAcquire&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;1.28%&amp;nbsp; postgres&amp;nbsp; vector.so&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] tidhash_insert_hash_internal&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.79%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] ReadBufferExtended&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.78%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] AllocSetAlloc&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.76%&amp;nbsp; postgres&amp;nbsp; libc.so.6&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] __memcmp_evex_movbe&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.65%&amp;nbsp; postgres&amp;nbsp; postgres&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[.] ResourceOwnerForget&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;0.63%&amp;nbsp; postgres&amp;nbsp; vector.so&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [.] HnswSearchLayer&lt;/span&gt;&lt;/div&gt;&lt;p&gt;From vmstat with 2 concurrent sessions&lt;/p&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;procs -----------memory---------- ---swap-- -----io---- -system-- ------cpu-----&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&lt;div&gt;&amp;nbsp;r&amp;nbsp; b&amp;nbsp; &amp;nbsp;swpd&amp;nbsp; &amp;nbsp;free&amp;nbsp; &amp;nbsp;buff&amp;nbsp; cache&amp;nbsp; &amp;nbsp;si&amp;nbsp; &amp;nbsp;so&amp;nbsp; &amp;nbsp; bi&amp;nbsp; &amp;nbsp; bo&amp;nbsp; &amp;nbsp;in&amp;nbsp; &amp;nbsp;cs us sy id wa st&lt;/div&gt;&lt;div&gt;&amp;nbsp;2&amp;nbsp; 0&amp;nbsp; 96784 90724048 1997176 36391152&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; 80 4188 24830&amp;nbsp; 4&amp;nbsp; 0 96&amp;nbsp; 0&amp;nbsp; 0&lt;/div&gt;&lt;div&gt;&amp;nbsp;2&amp;nbsp; 0&amp;nbsp; 96784 90724304 1997176 36391152&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp;9 4079 24931&amp;nbsp; 4&amp;nbsp; 0 96&amp;nbsp; 0&amp;nbsp; 0&lt;/div&gt;&lt;div&gt;&amp;nbsp;2&amp;nbsp; 0&amp;nbsp; 96784 90724304 1997176 36391152&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp;210 4117 24857&amp;nbsp; 4&amp;nbsp; 0 96&amp;nbsp; 0&amp;nbsp; 0&lt;/div&gt;&lt;div&gt;&amp;nbsp;2&amp;nbsp; 0&amp;nbsp; 96784 90724048 1997176 36391152&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; 42 4160 24946&amp;nbsp; 4&amp;nbsp; 0 96&amp;nbsp; 0&amp;nbsp; 0&lt;/div&gt;&lt;div&gt;&amp;nbsp;2&amp;nbsp; 0&amp;nbsp; 96784 90723576 1997176 36391152&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; 51 4511 24971&amp;nbsp; 4&amp;nbsp; 1 96&amp;nbsp; 0&amp;nbsp; 0&lt;/div&gt;&lt;/span&gt;&lt;/div&gt;&lt;p&gt;From vmstat with 48 concurrent sessions&lt;/p&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;procs -----------memory---------- ---swap-- -----io---- -system-- ------cpu-----&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp;r&amp;nbsp; b&amp;nbsp; &amp;nbsp;swpd&amp;nbsp; &amp;nbsp;free&amp;nbsp; &amp;nbsp;buff&amp;nbsp; cache&amp;nbsp; &amp;nbsp;si&amp;nbsp; &amp;nbsp;so&amp;nbsp; &amp;nbsp; bi&amp;nbsp; &amp;nbsp; bo&amp;nbsp; &amp;nbsp;in&amp;nbsp; &amp;nbsp;cs us sy id wa st&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;88&amp;nbsp; 0&amp;nbsp; 96784 90323440 2001256 36414792&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp;0 13129 224133 92&amp;nbsp; 8&amp;nbsp; 0&amp;nbsp; 0&amp;nbsp; 0&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;86&amp;nbsp; 0&amp;nbsp; 96784 90323440 2001256 36414796&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp;197 13011 223926 92&amp;nbsp; 8&amp;nbsp; 0&amp;nbsp; 0&amp;nbsp; 0&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;85&amp;nbsp; 0&amp;nbsp; 96784 90322936 2001256 36414796&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; 48 13118 224239 92&amp;nbsp; 8&amp;nbsp; 0&amp;nbsp; 0&amp;nbsp; 0&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;83&amp;nbsp; 0&amp;nbsp; 96784 90322688 2001264 36414800&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; 56 13084 223987 92&amp;nbsp; 8&amp;nbsp; 0&amp;nbsp; 0&amp;nbsp; 0&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;86&amp;nbsp; 0&amp;nbsp; 96784 90324952 2001264 36414800&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; 17 13361 224189 92&amp;nbsp; 8&amp;nbsp; 0&amp;nbsp; 0&amp;nbsp; 0&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;Comparing the vmstat results for 2 vs 48 sessions&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;QPS is ~1.4X larger for MariaDB at 2 and 48 sessions&lt;/li&gt;&lt;li&gt;CPU/query is (vmstat.us + vmstat.sy) / QPS&lt;/li&gt;&lt;ul&gt;&lt;li&gt;For 2 sessions it is ((4+4+4+4+5)/5) / 4811.2 = .000872&lt;/li&gt;&lt;li&gt;For 48 sessions it is ((100+100+100+100+100)/5) / 23802.6 = .004201&lt;/li&gt;&lt;li&gt;CPU/query is ~4.8X larger at 48 sessions than at 2&lt;/li&gt;&lt;li&gt;CPU/query for Postgres is ~1.3X larger than MariaDB at 2 sessions and ~1.4X larger at 48&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;Context switches is vmstat.cs / QPS&lt;/li&gt;&lt;ul&gt;&lt;li&gt;For 2 sessions it is 24830 / 4811.2 = 5.16&lt;/li&gt;&lt;li&gt;For 48 sessions it is 224133 / 23802.6 = 9.41&lt;/li&gt;&lt;li&gt;Context switches /query is ~1.8X larger at 48 sessions&lt;/li&gt;&lt;li&gt;Context switches /query for MariaDB is ~1.6X larger than Postgres at 2 sessions and ~2.1X larger at 48&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/3612827615811941001/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/01/vector-indexes-mariadb-pgvector-large_26.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/3612827615811941001'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/3612827615811941001'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/01/vector-indexes-mariadb-pgvector-large_26.html' title='Vector indexes, MariaDB &amp; pgvector, large server, small dataset: part 2'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi-LTbvrNhEsv9yZwa-A11OlvhoGYpNTC-dX3uBJf5OhfTdZaVoO3Iy_IFEv08y0CAML8fURZiubkrDM8jszeLe1-GeTjFKoCpNqwLWF_m2obn6xCDnlLBEtGuGBuc6HBfCDFqlEx_6pkEysQa9QgFZma4h4zZxdrmn3nDiWAW5IZclKOyPuc9WMUieQy_W/s72-w640-h396-c/QPS.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-866754815500629834</id><published>2025-01-25T09:27:00.000-08:00</published><updated>2025-01-28T09:57:28.386-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="ann-benchmarks"/><category scheme="http://www.blogger.com/atom/ns#" term="mariadb"/><category scheme="http://www.blogger.com/atom/ns#" term="pgvector"/><category scheme="http://www.blogger.com/atom/ns#" term="postgres"/><category scheme="http://www.blogger.com/atom/ns#" term="vector"/><title type='text'>Vector indexes, MariaDB &amp; pgvector, large server, small dataset: part 1</title><content type='html'>&lt;p&gt;This post has results for vector index support in MariaDB and Postgres. This work was done by &lt;a href=&quot;https://smalldatum.github.io/&quot;&gt;Small Datum LLC&lt;/a&gt; and sponsored by the MariaDB Corporation.&amp;nbsp;&lt;/p&gt;&lt;p&gt;My previous posts (&lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html&quot;&gt;here&lt;/a&gt; and &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb_13.html&quot;&gt;here&lt;/a&gt;) used a server with 8 cores and 32G of RAM. While that was OK for one of the smaller datasets from&amp;nbsp;&lt;a href=&quot;https://github.com/erikbern/ann-benchmarks/&quot;&gt;ann-benchmarks&lt;/a&gt; it wasn&#39;t enough for larger datasets and the problem was the amount of memory used by the benchmark client. I have &lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/commit/f0c0d0ccbbe765c6d758239eb95406b5dd07845d&quot;&gt;some changes&lt;/a&gt; to the benchmark client to reduce the transient spikes in memory usage I wasn&#39;t able to fully solve the problem, so I moved to a larger server with 48 cores and 128G of RAM.&amp;nbsp;&lt;/p&gt;&lt;p&gt;This post has results for the fashion-mnist-784-euclidean dataset using from 1 to 48 concurrent sessions. This is part 1. There will be parts 2, 3 and 4 to explain the results and then I move on to a larger dataset.&lt;/p&gt;&lt;p&gt;I compare MariaDB with pgvector because I respect the work that the Postgres community has done to support vector search workloads. And I am happy to report that MariaDB has also done a great job on this. While I don&#39;t know the full story of the development effort, this feature came from the MariaDB Foundation and the community and it is wonderful to see that collaboration.&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;At low and high concurrency levels MariaDB gets much more QPS for a given recall target. Here low means &amp;lt;= 4 concurrent sessions and high means &amp;gt;= 24.&amp;nbsp;&lt;/li&gt;&lt;li&gt;At middle concurrency levels (8 through 20 concurrent sessions) MariaDB still does better but the gap isn&#39;t as large. I try to explain this in future posts.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html&quot;&gt;This post&lt;/a&gt;&amp;nbsp;has much more detail. However I switched to a larger server (Hetzner ax162-s) with 48 cores, 128G of RAM, Ubuntu 22.04 and HW RAID 10 using 2 NVMe devices.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;I use &lt;a href=&quot;https://github.com/erikbern/ann-benchmarks/&quot;&gt;ann-benchmarks&lt;/a&gt;&amp;nbsp;via my&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova&quot;&gt;fork of a fork of a fork&lt;/a&gt;&amp;nbsp;at&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/commit/f0c0d0ccbbe765c6d758239eb95406b5dd07845d&quot;&gt;this commit&lt;/a&gt;. Note that parallel index create was disabled for Postgres by my configuration and isn&#39;t (yet) supported by MariaDB.&lt;br /&gt;&lt;br /&gt;I ran tests for&amp;nbsp;fashion-mnist-784-euclidean at 1, 2, 4, 8, 12, 16, 20, 24, 28, 32, 36, 40, 44 and 44 concurrent sessions. The command lines were the following using pgvector, pgvector_halfvec and mariadb as the value of $alg. When --batch is used the concurrency level (between 2 and 48 concurrent sessions) is set by an environment variable (POSTGRES_BATCH_CONCURRENCY or MARIADB_BATCH_CONCURRENCY)&lt;br /&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; python3 run.py --algorithm $alg&amp;nbsp; --dataset fashion-mnist-784-euclidean&amp;nbsp;--timeout -1 --local --force \&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; --respect_config_order --runs 3&lt;br /&gt;&amp;nbsp; &amp;nbsp; python3 run.py --algorithm $alg&amp;nbsp; --dataset fashion-mnist-784-euclidean&amp;nbsp;--timeout -1 --local --force \&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; --respect_config_order --runs 3 --batch&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;I filed &lt;a href=&quot;https://jira.mariadb.org/browse/MDEV-35897&quot;&gt;MDEV-35897&lt;/a&gt; for MariaDB because it allocates and then deallocates too much memory when ef_search is large, and large for me was &amp;gt;= 300. The overhead from this hurts query response times. Fortunately the fix should be easy. For now I changed config.yml for MariaDB to not use ef_search values larger than 200 (see query_args &lt;a href=&quot;https://gist.github.com/mdcallag/56f6b872fd42820124ec1b429d087633#file-gistfile1-txt-L13&quot;&gt;here&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;Files:&lt;br /&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;The config.yml files are here &lt;a href=&quot;https://gist.github.com/mdcallag/56f6b872fd42820124ec1b429d087633&quot;&gt;for MariaDB&lt;/a&gt; and &lt;a href=&quot;https://gist.github.com/mdcallag/8815347e1c3931033cd701619f97cc06&quot;&gt;for pgvector&lt;/a&gt;.&lt;/li&gt;&lt;li&gt;Files related to these tests are&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/jan25.ann.fashion-mnist-784-euclidean.hetz&quot;&gt;archived here&lt;/a&gt;.&lt;/li&gt;&lt;li&gt;The database configuration files are here&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/jan25.ann.gist-960-euclidean.v1/dop_1/ma110701/my.cnf.cz11b_lwas4k_vector_c32r128&quot;&gt;for MariaDB&lt;/a&gt;&amp;nbsp;and&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/arc/jan25.ann.gist-960-euclidean.v1/dop_1/pg172/conf.diff.cx10a_vector_c32r128&quot;&gt;for Postgres&lt;/a&gt;.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: QPS vs recall graphs&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;The recall vs QPS graph is created by running plot.py from ann-benchmarks. The line colors are red for MariaDB, dark blue for pgvector with halfvec (float16) and light blue for pgvector.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;1 session (no concurrency)&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj4tptHumdgNhEuZWzQ-yTtj96kV08za_uYM3ZkIIKPochccHUB04DiNX78eEYK23iZ3dN-wMJUWMf7Y5sFZyVUHqNTw0aAr6xXt7hEC7dK7aeUks3JF9ZkgNGZBuO-Gh7jIglurg_L7BhWUaj_dr_nTWYAjWjusGrykhzSq9BsT0KG5TIZdYwKt3hkIj4U/s1173/fashion-mnist-784-euclidean.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1173&quot; height=&quot;265&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj4tptHumdgNhEuZWzQ-yTtj96kV08za_uYM3ZkIIKPochccHUB04DiNX78eEYK23iZ3dN-wMJUWMf7Y5sFZyVUHqNTw0aAr6xXt7hEC7dK7aeUks3JF9ZkgNGZBuO-Gh7jIglurg_L7BhWUaj_dr_nTWYAjWjusGrykhzSq9BsT0KG5TIZdYwKt3hkIj4U/w400-h265/fashion-mnist-784-euclidean.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;2 sessions&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh6XuaU9B-rg2kGkAiKm0AumzzJGEoG4HIsfOitkXfGzh-y3VByZ-berxLAeC43zgOwG28LOjjq_XfqG1biw2VvOewWAh1tqVgSslH0XQ5DVuTpcawsWDWXzPX90wSrztWObnKNe5vPjatrPuBY1HURv9pprwdoNmjBum3ccQdd9Q3YDLl4FYZ4QayKPC8e/s1173/fashion-mnist-784-euclidean-batch.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1173&quot; height=&quot;265&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh6XuaU9B-rg2kGkAiKm0AumzzJGEoG4HIsfOitkXfGzh-y3VByZ-berxLAeC43zgOwG28LOjjq_XfqG1biw2VvOewWAh1tqVgSslH0XQ5DVuTpcawsWDWXzPX90wSrztWObnKNe5vPjatrPuBY1HURv9pprwdoNmjBum3ccQdd9Q3YDLl4FYZ4QayKPC8e/w400-h265/fashion-mnist-784-euclidean-batch.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;4 sessions&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgixnYkFY3Cpcorf_jJwAfoSC6s_6_lg-ij_EHD9GGNjtnec8VQFpzmzfsI96cFk3MsGe7LQqtxM3k-l6iKJVFun-qBmDFrfPBLKfpzSeX1SpJkt-1WWS1ZrXOokgkKnw2zlp-KRwT9ZMG6RdDp8SBKssTLCPajR1vHlllgjdER1jWrH4rXM-aYNm9jtxfL/s1182/fashion-mnist-784-euclidean-batch.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1182&quot; height=&quot;264&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgixnYkFY3Cpcorf_jJwAfoSC6s_6_lg-ij_EHD9GGNjtnec8VQFpzmzfsI96cFk3MsGe7LQqtxM3k-l6iKJVFun-qBmDFrfPBLKfpzSeX1SpJkt-1WWS1ZrXOokgkKnw2zlp-KRwT9ZMG6RdDp8SBKssTLCPajR1vHlllgjdER1jWrH4rXM-aYNm9jtxfL/w400-h264/fashion-mnist-784-euclidean-batch.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;8 sessions&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgQKPr6Z9iKjv29km2KJ0VIxlhe7osPLwSHS9uV-cWFArRrkobXBZwlbeZHkOrbErEK5b8h_J6WGffW4BdIi6vVqBFT-OKmwXlLjJn8uXce3Xsw2l_dQTTIX-ewwO1Nd-EcUuhEZ-w4qwteyl832SaZXh6DxtLJOVuyzPZzJbQcU9VzcVAytOftHLuLsnW0/s1182/fashion-mnist-784-euclidean-batch.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1182&quot; height=&quot;264&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgQKPr6Z9iKjv29km2KJ0VIxlhe7osPLwSHS9uV-cWFArRrkobXBZwlbeZHkOrbErEK5b8h_J6WGffW4BdIi6vVqBFT-OKmwXlLjJn8uXce3Xsw2l_dQTTIX-ewwO1Nd-EcUuhEZ-w4qwteyl832SaZXh6DxtLJOVuyzPZzJbQcU9VzcVAytOftHLuLsnW0/w400-h264/fashion-mnist-784-euclidean-batch.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;12 sessions&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhCSmLVgNscWBdbRVIxao-IMZsT6SAwKFrXK_gspwDyooLyLNJAH-Iw5QEvw6wKvPz60Dcit85Xfn5NsuYqDq5A9z2eCxfMIZJOX-ucfqXpNJYZSfbq89tjRpSy9p4yIoiAEifdyTM_H4hx7YgHK7DqXdnfNPmh_nRO2eQxrQ5_fs8LMl2sM-umfjzGQe43/s1182/fashion-mnist-784-euclidean-batch.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1182&quot; height=&quot;264&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhCSmLVgNscWBdbRVIxao-IMZsT6SAwKFrXK_gspwDyooLyLNJAH-Iw5QEvw6wKvPz60Dcit85Xfn5NsuYqDq5A9z2eCxfMIZJOX-ucfqXpNJYZSfbq89tjRpSy9p4yIoiAEifdyTM_H4hx7YgHK7DqXdnfNPmh_nRO2eQxrQ5_fs8LMl2sM-umfjzGQe43/w400-h264/fashion-mnist-784-euclidean-batch.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;16 sessions&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj3cR9_vY6tKWe-mVdeFlhHxnOh4lcSRXi-VwEA41A0wVfKbnwVCqvcaQ3x-w6uxLV0QmCpna58angXug0nDYbzl5XKgtLM6CMqMdzIg1NmcueU-Xi5wQ3H5sPB9pr9jh_IQMkfbfVZvIfYpneFIXSOAbzZ0PbWscLQ6Fp2YO6NCaTqTU9l8cdYrmavbXw4/s1182/fashion-mnist-784-euclidean-batch.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1182&quot; height=&quot;264&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj3cR9_vY6tKWe-mVdeFlhHxnOh4lcSRXi-VwEA41A0wVfKbnwVCqvcaQ3x-w6uxLV0QmCpna58angXug0nDYbzl5XKgtLM6CMqMdzIg1NmcueU-Xi5wQ3H5sPB9pr9jh_IQMkfbfVZvIfYpneFIXSOAbzZ0PbWscLQ6Fp2YO6NCaTqTU9l8cdYrmavbXw4/w400-h264/fashion-mnist-784-euclidean-batch.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;20 sessions&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh-7O_SyIY9CbksFuLswWobFtrcnlDiBw0j5tj2QpEzvY9gJThpyQ1CW7yFgE1-50ZIqB4MQ4e6Yo4fchzkAVncuLuWLFvdm1_xtQo5g8l3tAMAi1F5JCXWs35DaAqc7OK9laabn2kGuyNCR1F9RvJDe6dvCM66iptBzVnUiIX6c8DMkpSO7Hfwc15yde7J/s1182/fashion-mnist-784-euclidean-batch.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1182&quot; height=&quot;264&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh-7O_SyIY9CbksFuLswWobFtrcnlDiBw0j5tj2QpEzvY9gJThpyQ1CW7yFgE1-50ZIqB4MQ4e6Yo4fchzkAVncuLuWLFvdm1_xtQo5g8l3tAMAi1F5JCXWs35DaAqc7OK9laabn2kGuyNCR1F9RvJDe6dvCM66iptBzVnUiIX6c8DMkpSO7Hfwc15yde7J/w400-h264/fashion-mnist-784-euclidean-batch.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;24 sessions&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEglBe8D_hy3as9HW_4MG2isrgUtA8RZwMlFy2lqLevjZzZMuoS_3gZvE3-Gt5M3SNUc_2opj1iEBI2q95G-okrIhfNR2ng_VM4dVZKSqmwN3I3oXi2eDofdKFJog8YXEBYO0FFuy5Raj9GM5rKVit0lFTuctMr66Sset7xDYsGJtwQR3_m3xSN9X9ZJJa36/s1182/fashion-mnist-784-euclidean-batch.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1182&quot; height=&quot;264&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEglBe8D_hy3as9HW_4MG2isrgUtA8RZwMlFy2lqLevjZzZMuoS_3gZvE3-Gt5M3SNUc_2opj1iEBI2q95G-okrIhfNR2ng_VM4dVZKSqmwN3I3oXi2eDofdKFJog8YXEBYO0FFuy5Raj9GM5rKVit0lFTuctMr66Sset7xDYsGJtwQR3_m3xSN9X9ZJJa36/w400-h264/fashion-mnist-784-euclidean-batch.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;28 sessions&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhB8smPkPRay4XkeDAKJIBs3mfG2VebZQLYRaEIYkRUvvKm2wwCao3BNimzvo0jWlkDNbte1IrFZo5dTv01Dty9Y1Lndpck8azxx_PR_m9dNx5ZfKV4_108pUucwkZjENG4u4zhkCyohNX3sZ1GnPvdBs1CduwrPp6TkkjwVl31YzFXGjq_HCgZjlt0QdPa/s1182/fashion-mnist-784-euclidean-batch.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1182&quot; height=&quot;264&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhB8smPkPRay4XkeDAKJIBs3mfG2VebZQLYRaEIYkRUvvKm2wwCao3BNimzvo0jWlkDNbte1IrFZo5dTv01Dty9Y1Lndpck8azxx_PR_m9dNx5ZfKV4_108pUucwkZjENG4u4zhkCyohNX3sZ1GnPvdBs1CduwrPp6TkkjwVl31YzFXGjq_HCgZjlt0QdPa/w400-h264/fashion-mnist-784-euclidean-batch.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;32 sessions&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEguGeRE1LQ95cQzykuLrOV-JeNOA0NSUFUaihHQXcHay7EFtuU0s0u0d9H9DX3vs4pQJ8JSuydY0LiAptVvDdMKWs4lWl2i4cPmro75DyK09u1dvU7ugLXcZ5QoB3Frlm88yaAh1C9h3cEdHZsKMiMEipvALRD63x2j5jRCD7vaoDubmO6ztlDFcSjzUVDs/s1182/fashion-mnist-784-euclidean-batch.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1182&quot; height=&quot;264&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEguGeRE1LQ95cQzykuLrOV-JeNOA0NSUFUaihHQXcHay7EFtuU0s0u0d9H9DX3vs4pQJ8JSuydY0LiAptVvDdMKWs4lWl2i4cPmro75DyK09u1dvU7ugLXcZ5QoB3Frlm88yaAh1C9h3cEdHZsKMiMEipvALRD63x2j5jRCD7vaoDubmO6ztlDFcSjzUVDs/w400-h264/fashion-mnist-784-euclidean-batch.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;36 sessions&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhyhpRObamLLp0GDuGtNydCUbXUY1fJGb2mJ2BwSjnM2QvhNbLD540hbx1-A-tNfuD4elxLLptdHywOVvGA5oPtD6TMcLNau1-Cg7pD8vai4r3p7_2NsT7C7Fd2HFFnAVf1LKdGu7ETgh1oLMCDQ7Wc9Ol1DyMu5hF0vPu-oBZAL4T5iM0IbEYzAmikJ7fE/s1182/fashion-mnist-784-euclidean-batch.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1182&quot; height=&quot;264&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhyhpRObamLLp0GDuGtNydCUbXUY1fJGb2mJ2BwSjnM2QvhNbLD540hbx1-A-tNfuD4elxLLptdHywOVvGA5oPtD6TMcLNau1-Cg7pD8vai4r3p7_2NsT7C7Fd2HFFnAVf1LKdGu7ETgh1oLMCDQ7Wc9Ol1DyMu5hF0vPu-oBZAL4T5iM0IbEYzAmikJ7fE/w400-h264/fashion-mnist-784-euclidean-batch.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;40 sessions&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhqjzW83DczxyuoGbdtpzF0gVlS0UoBZjprmQ1yGSRv1x20YjHklAUPIc-t6wSg7WKy-MSgrLt483YT_IiXbclFf1KobqsdcVwvjKruRS6gGNCk68G5Oleh9Eo7odfpFEh8a51KzApMYIBzcXaqdbgMyGSl99se82VqCA5BI5ZFr2uaN2PU1e_hHUb-mdqZ/s1182/fashion-mnist-784-euclidean-batch.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1182&quot; height=&quot;264&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhqjzW83DczxyuoGbdtpzF0gVlS0UoBZjprmQ1yGSRv1x20YjHklAUPIc-t6wSg7WKy-MSgrLt483YT_IiXbclFf1KobqsdcVwvjKruRS6gGNCk68G5Oleh9Eo7odfpFEh8a51KzApMYIBzcXaqdbgMyGSl99se82VqCA5BI5ZFr2uaN2PU1e_hHUb-mdqZ/w400-h264/fashion-mnist-784-euclidean-batch.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;44 sessions&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgAS80tkyZIz0ZKGsLuxUqHcligD83T0XfS498JlWKSALVfs4sWonV7H7RZj17BSbO5uJvMgGc0eSju4LoYHZKM29qJW8iSMeA81KnqSNmMijQ_oilfEu67aZjRhvrq6snkAgsU7boUPSHoJ4_-DBLY510E3Q5CdTvnmwAAGRk4DYC5H5G8ygqIMYQCNG2c/s1182/fashion-mnist-784-euclidean-batch.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1182&quot; height=&quot;264&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgAS80tkyZIz0ZKGsLuxUqHcligD83T0XfS498JlWKSALVfs4sWonV7H7RZj17BSbO5uJvMgGc0eSju4LoYHZKM29qJW8iSMeA81KnqSNmMijQ_oilfEu67aZjRhvrq6snkAgsU7boUPSHoJ4_-DBLY510E3Q5CdTvnmwAAGRk4DYC5H5G8ygqIMYQCNG2c/w400-h264/fashion-mnist-784-euclidean-batch.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;48 sessions&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiobW0cyHXBN12rkRMXqiTq-LvXNpntGHi5rcf-IdGFW49hrHPVwcsE-nbcTNcms_WHE0Ff3z3SdI8zeecodTISzXOTE3jRti5dKn4B3WPl-LfXHV3noKmmuhSIGmTtwPfR1uW0ZNDyIhq7M6kFHHXHxenOJcF-hQpcNSumrgftxhcAXbOQ9cxceZdPm5De/s1182/fashion-mnist-784-euclidean-batch.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1182&quot; height=&quot;264&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiobW0cyHXBN12rkRMXqiTq-LvXNpntGHi5rcf-IdGFW49hrHPVwcsE-nbcTNcms_WHE0Ff3z3SdI8zeecodTISzXOTE3jRti5dKn4B3WPl-LfXHV3noKmmuhSIGmTtwPfR1uW0ZNDyIhq7M6kFHHXHxenOJcF-hQpcNSumrgftxhcAXbOQ9cxceZdPm5De/w400-h264/fashion-mnist-784-euclidean-batch.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/866754815500629834/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/01/vector-indexes-mariadb-pgvector-large.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/866754815500629834'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/866754815500629834'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/01/vector-indexes-mariadb-pgvector-large.html' title='Vector indexes, MariaDB &amp; pgvector, large server, small dataset: part 1'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj4tptHumdgNhEuZWzQ-yTtj96kV08za_uYM3ZkIIKPochccHUB04DiNX78eEYK23iZ3dN-wMJUWMf7Y5sFZyVUHqNTw0aAr6xXt7hEC7dK7aeUks3JF9ZkgNGZBuO-Gh7jIglurg_L7BhWUaj_dr_nTWYAjWjusGrykhzSq9BsT0KG5TIZdYwKt3hkIj4U/s72-w400-h265-c/fashion-mnist-784-euclidean.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-4180757878987185773</id><published>2025-01-13T15:47:00.001-08:00</published><updated>2025-01-25T09:21:18.235-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="ann-benchmarks"/><category scheme="http://www.blogger.com/atom/ns#" term="mariadb"/><category scheme="http://www.blogger.com/atom/ns#" term="pgvector"/><category scheme="http://www.blogger.com/atom/ns#" term="postgres"/><category scheme="http://www.blogger.com/atom/ns#" term="vector"/><title type='text'>Evaluating vector indexes in MariaDB and pgvector: part 2</title><content type='html'>&lt;p&gt;This post has results from the &lt;a href=&quot;https://github.com/erikbern/ann-benchmarks/&quot;&gt;ann-benchmarks&lt;/a&gt; with the&lt;span style=&quot;font-family: inherit;&quot;&gt;&amp;nbsp;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;fashion-mnist-784-euclidean&lt;/span&gt;&lt;/span&gt;&amp;nbsp;dataset for MariaDB and Postgres (pgvector) with concurrent queries (--batch). My &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html&quot;&gt;previous post&lt;/a&gt; has results when not using concurrent queries. This work was done by &lt;a href=&quot;https://smalldatum.github.io/&quot;&gt;Small Datum LLC&lt;/a&gt; and sponsored by the MariaDB Corporation.&lt;/p&gt;&lt;p&gt;I compare MariaDB with pgvector because I respect the work that the Postgres community has done to support vector search. And I am happy to report that MariaDB has also done a great job on this. While I don&#39;t know the full story of the development effort, this feature came from the MariaDB Foundation and the community and it is wonderful to see that collaboration.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;Performance for MariaDB is excellent&lt;br /&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;peak QPS for a given recall target is much better than pgvector&lt;/li&gt;&lt;li&gt;time to create the index to achieve a given recall target is almost always much better than pgvector&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html&quot;&gt;Part 1&lt;/a&gt; has all of the details.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;I am using the&amp;nbsp;&lt;a href=&quot;https://github.com/erikbern/ann-benchmarks/&quot;&gt;ann-benchmarks&lt;/a&gt;&amp;nbsp;via my&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova&quot;&gt;fork of a fork of a fork&lt;/a&gt;&amp;nbsp;at&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/commit/9dfe5c62f5abab448f767e0bf16fd9090a709f36&quot;&gt;this commit&lt;/a&gt;.&amp;nbsp;I added --batch to the command lines listed in &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html&quot;&gt;part 1&lt;/a&gt; to repeat the benchmark using concurrent queries. Note that parallel index create was disabled for Postgres and isn&#39;t (yet) supported by MariaDB.&lt;br /&gt;&lt;br /&gt;With --batch there is one concurrent query per CPU core and my server has 8 cores.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Files related to these tests are&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/jan25.ann.fashion-mnist-784-euclidean&quot;&gt;archived here&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Results: QPS vs recall graphs&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The recall vs QPS graph is created by:&amp;nbsp;&lt;i&gt;python3 plot.py --dataset fashion-mnist-784-euclidean --batch.&lt;/i&gt; This chart is very similar to the chart in &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html&quot;&gt;part 1&lt;/a&gt;. One difference is that the peak QPS for MariaDB and Postgres are ~6000/s and ~4000/s there vs ~22000/s and ~15000/s here.&lt;br /&gt;&lt;br /&gt;The results below for MariaDB are excellent. It gets more QPS than pgvector at a given recall target.&lt;/div&gt;&lt;div&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjwZTN_FzRaGaJpqj9u_wPfNDiA1lvgGEC1gD7InRmF0J_kh_tPolrBfx3BMh5xZYkUeVR7J6149J1ARo5M8RpDfcE6abdoVTiWo4cF2q7NI-JaAZlLxOGOfyuVb1WQD3KLPtRytimMzIYIxcp8Sfqkbj7nTpjOts-PqkTswfbnVba-dVp_DcVJK83kU2ri/s1182/fashion-mnist-784-euclidean-batch.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1182&quot; height=&quot;422&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjwZTN_FzRaGaJpqj9u_wPfNDiA1lvgGEC1gD7InRmF0J_kh_tPolrBfx3BMh5xZYkUeVR7J6149J1ARo5M8RpDfcE6abdoVTiWo4cF2q7NI-JaAZlLxOGOfyuVb1WQD3KLPtRytimMzIYIxcp8Sfqkbj7nTpjOts-PqkTswfbnVba-dVp_DcVJK83kU2ri/w640-h422/fashion-mnist-784-euclidean-batch.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div&gt;&lt;b&gt;Results: create index&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I am still trying to figure out how to present this data. All of the numbers&amp;nbsp;&lt;a href=&quot;https://gist.github.com/mdcallag/665518f7eda603bf3b6a7e1c6a999750&quot;&gt;are here&lt;/a&gt;&amp;nbsp;for the time to create an index and the size of the index. The database configs for Postgres and MariaDB are shared above, and parallel index create is disabled by the config for Postgres (and not supported yet by MariaDB). The summary is:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Indexes have a similar size with MariaDB and Postgres with halfvec. The Postgres index without halfvec is about 2X larger.&lt;/li&gt;&lt;li&gt;Time to create the index for Postgres is similar with and without halfvec&lt;/li&gt;&lt;li&gt;Time to create the index for MariaDB is less than for pgvector. Perhaps the best way to compare this is the time to create the index for a similar point on the QPS/recall graph (see the last section of this blog post)&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Results: best QPS for a given recall&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Many benchmark results are marketed via peak performance (max throughput or min response time) but these are usually constrained optimization problems -- determine peak performance that satisfies some SLA. And the SLA might be response time or efficiency (cost).&lt;br /&gt;&lt;br /&gt;With ann-benchmarks the constraint might be recall where you seek the best QPS that satisfies a recall target. Below I share the best QPS for a given recall target along with the configuration parameters (M, ef_construction, ef_search) at which that occurs for each of the algorithms (MariaDB, pgvector with float32, pgvector with float16).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For all cases below except the first (recall = 1.000) the time to create the index is less with MariaDB.&lt;br /&gt;&lt;br /&gt;For all cases below the best QPS at a given recall target is much better, sometimes 3X better, with MariaDB than with pgvector.&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Legend&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;* recall &amp;amp; QPS - results from the benchmark&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;* isecs - number of seconds to create the index for M and ef_cons (ef_cons)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall = 1.000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&amp;nbsp; &amp;nbsp;algorithm&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.000&amp;nbsp; &amp;nbsp; 4727&amp;nbsp; &amp;nbsp;115.9&amp;nbsp; &amp;nbsp;PGVector(m=16, ef_cons=256, ef_search=120)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.000&amp;nbsp; &amp;nbsp; 5479&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;98.6&lt;/span&gt;&amp;nbsp; &amp;nbsp;PGVector_halfvec(m=16, ef_cons=192, ef_search=120)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.000&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;13828&lt;/span&gt;&amp;nbsp; &amp;nbsp;108.5&amp;nbsp; &amp;nbsp;MariaDB(m=32, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.99&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&amp;nbsp; &amp;nbsp;algorithm&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.990&amp;nbsp; &amp;nbsp;10704&amp;nbsp; &amp;nbsp; 71.2&amp;nbsp; &amp;nbsp;PGVector(m=16, ef_cons=96, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.991&amp;nbsp; &amp;nbsp;11377&amp;nbsp; &amp;nbsp; 90.1&amp;nbsp; &amp;nbsp;PGVector_halfvec(m=16, ef_cons=256, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.995&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;17642&lt;/span&gt;&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;24.5&lt;/span&gt;&amp;nbsp; &amp;nbsp;MariaDB(m=12, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.98&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&amp;nbsp; &amp;nbsp;algorithm&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.985&amp;nbsp; &amp;nbsp;10843&amp;nbsp; &amp;nbsp; 51.0&amp;nbsp; &amp;nbsp;PGVector(m=16, ef_cons=32, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.984&amp;nbsp; &amp;nbsp;11749&amp;nbsp; &amp;nbsp; 44.5&amp;nbsp; &amp;nbsp;PGVector_halfvec(m=16, ef_cons=32, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.995&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;17642&lt;/span&gt;&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;24.5&lt;/span&gt;&amp;nbsp; &amp;nbsp;MariaDB(m=12, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.97&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&amp;nbsp; &amp;nbsp;algorithm&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.970&amp;nbsp; &amp;nbsp;12836&amp;nbsp; &amp;nbsp; 35.1&amp;nbsp; &amp;nbsp;PGVector(m=8, ef_cons=64, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.973&amp;nbsp; &amp;nbsp;13742&amp;nbsp; &amp;nbsp; 33.8&amp;nbsp; &amp;nbsp;PGVector_halfvec(m=8, ef_cons=96, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.995&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;17642&lt;/span&gt;&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;24.5&lt;/span&gt;&amp;nbsp; &amp;nbsp;MariaDB(m=12, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.96&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&amp;nbsp; &amp;nbsp;algorithm&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.970&amp;nbsp; &amp;nbsp;12836&amp;nbsp; &amp;nbsp; 35.1&amp;nbsp; &amp;nbsp;PGVector(m=8, ef_cons=64, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.973&amp;nbsp; &amp;nbsp;13742&amp;nbsp; &amp;nbsp; 33.8&amp;nbsp; &amp;nbsp;PGVector_halfvec(m=8, ef_cons=96, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.995&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;17642&lt;/span&gt;&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;24.5&lt;/span&gt;&amp;nbsp; &amp;nbsp;MariaDB(m=12, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.95&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; &amp;nbsp; &amp;nbsp;isecs&amp;nbsp; &amp;nbsp;algorithm&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.970&amp;nbsp; &amp;nbsp;12836&amp;nbsp; &amp;nbsp; 35.1&amp;nbsp; &amp;nbsp;PGVector(m=8, ef_cons=64, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.973&amp;nbsp; &amp;nbsp;13742&amp;nbsp; &amp;nbsp; 33.8&amp;nbsp; &amp;nbsp;PGVector_halfvec(m=8, ef_cons=96, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.995&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;17642&lt;/span&gt;&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;24.5&lt;/span&gt;&amp;nbsp; &amp;nbsp;MariaDB(m=12, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/4180757878987185773/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb_13.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/4180757878987185773'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/4180757878987185773'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb_13.html' title='Evaluating vector indexes in MariaDB and pgvector: part 2'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjwZTN_FzRaGaJpqj9u_wPfNDiA1lvgGEC1gD7InRmF0J_kh_tPolrBfx3BMh5xZYkUeVR7J6149J1ARo5M8RpDfcE6abdoVTiWo4cF2q7NI-JaAZlLxOGOfyuVb1WQD3KLPtRytimMzIYIxcp8Sfqkbj7nTpjOts-PqkTswfbnVba-dVp_DcVJK83kU2ri/s72-w640-h422-c/fashion-mnist-784-euclidean-batch.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-6588931153785014379</id><published>2025-01-12T15:59:00.008-08:00</published><updated>2025-01-25T09:21:15.827-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="ann-benchmarks"/><category scheme="http://www.blogger.com/atom/ns#" term="mariadb"/><category scheme="http://www.blogger.com/atom/ns#" term="pgvector"/><category scheme="http://www.blogger.com/atom/ns#" term="postgres"/><category scheme="http://www.blogger.com/atom/ns#" term="vector"/><title type='text'>Evaluating vector indexes in MariaDB and pgvector: part 1</title><content type='html'>&lt;p&gt;This post has results for vector index support in MariaDB and Postgres. I am new to vector indexes so I will start small and over time add more results.&amp;nbsp; This work was done by &lt;a href=&quot;https://smalldatum.github.io/&quot;&gt;Small Datum LLC&lt;/a&gt; and sponsored by the MariaDB Corporation.&lt;br /&gt;&lt;br /&gt;I compare MariaDB with pgvector because I respect the work that the Postgres community has done to support vector search workloads. And I am happy to report that MariaDB has also done a great job on this. While I don&#39;t know the full story of the development effort, this feature came from the MariaDB Foundation and the community and it is wonderful to see that collaboration.&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Performance for MariaDB is excellent&lt;/li&gt;&lt;ul&gt;&lt;li&gt;peak QPS for a given recall target is much better than pgvector&lt;/li&gt;&lt;li&gt;time to create the index to achieve a given recall target is almost always much better than pgvector&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;MariaDB is easier to tune than pgvector&lt;/li&gt;&lt;li&gt;MariaDB needs more documentation&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;b&gt;The good and the less good&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The good for MariaDB starts with performance. The results I get are great and match the results in &lt;a href=&quot;https://mariadb.com/resources/blog/how-fast-is-mariadb-vector/&quot;&gt;this blog post&lt;/a&gt; from upstream. Another good thing about MariaDB is that it is easier to tune. With pgvector I need to set M and ef_construction while creating an index and then ef_search while running a query, With MariaDB there is no option to set ef_construction. And evaluations are easier when there are fewer options to tune. Note that the M with pgvector isn&#39;t the same thing as the M with MariaDB, but they are similar (at least in spirit).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The less good for MariaDB is the documentation. We need more, but that is easy to fix. I have been using:&lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;this &lt;a href=&quot;https://mariadb.com/resources/blog/how-fast-is-mariadb-vector/&quot;&gt;blog post&lt;/a&gt; on benchmarks&lt;/li&gt;&lt;li&gt;&lt;a href=&quot;https://mariadb.com/kb/en/vector-overview/&quot;&gt;this overview&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href=&quot;https://mariadb.com/kb/en/create-table-with-vectors/&quot;&gt;this syntax description&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Things that need more documentation&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;the algorithm is described as a variation of HNSW (MHNSW) and the community will benefit from more details on what has been changed. For example, I can&#39;t set ef_construction and it always uses float16 in the index. But with pgvector I can use either float32 (default) or float16 (via halfvec).&lt;/li&gt;&lt;li&gt;What transaction isolation level is supported? I get that workloads will be read-heavy but I assume that some won&#39;t be read-only so I won&#39;t to know whether repeatable read and read committed are provided.&lt;/li&gt;&lt;li&gt;What concurrent operations are supported? Obviously, reads can be concurrent with other reads. But can writes be concurrent with reads or with other writes on the same table?&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;b&gt;Hardware&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;The hardware is a Beelink SER7 7840HS with a Ryzen 7 7840HS CPU, 32G of RAM and Ubuntu 22.04.&lt;br /&gt;&lt;br /&gt;While I already installed some dependencies on this server long ago, to run this benchmark I did:&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;blockquote style=&quot;border: none; margin: 0px 0px 0px 40px; padding: 0px; text-align: left;&quot;&gt;&lt;div&gt;&lt;div&gt;&lt;p class=&quot;p1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;sudo apt install libmariadb3 libmariadb-dev&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;p class=&quot;p1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;pip3 install mariadb&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;p class=&quot;p1&quot; style=&quot;font-feature-settings: normal; font-kerning: auto; font-optical-sizing: auto; font-size-adjust: none; font-stretch: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-variant-emoji: normal; font-variant-numeric: normal; font-variant-position: normal; font-variation-settings: normal; line-height: normal; margin: 0px;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;pip3 install pgvector psycopg&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;&lt;/div&gt;&lt;/blockquote&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;b&gt;Database software&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;For Postgres I compiled version 17.2 and pgvector 0.8.0 from source. Files that I used include:&lt;br /&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/pg172_o2nofp/mk.pg.o2nofp&quot;&gt;build script&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/pg172_o2nofp/conf.diff.cx10a_c8r32&quot;&gt;configuration file&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/pg172_o2nofp/ini_vector.sh&quot;&gt;initialization script&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;For MariaDB I compiled version 11.7.1 from source. Files that I used include:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/ma110701_rel_withdbg/cmk.relwithdbg&quot;&gt;build script&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/ma110701_rel_withdbg/etc/my.cnf.cz11b_lwas4k_vector_c8r32&quot;&gt;configuration file&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/ma110701_rel_withdbg/ini.sh&quot;&gt;initialization script&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;I am using the &lt;a href=&quot;https://github.com/erikbern/ann-benchmarks/&quot;&gt;ann-benchmarks&lt;/a&gt; via my &lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova&quot;&gt;fork of a fork of a fork&lt;/a&gt; at &lt;a href=&quot;https://github.com/mdcallag/ann-benchmarks-from-vuvova/commit/9dfe5c62f5abab448f767e0bf16fd9090a709f36&quot;&gt;this commit&lt;/a&gt;. These forks have changes to run the benchmark for MariaDB and pgvector without using Docker containers. A request I have for anyone writing a benchmark client is to limit dependencies, or at least make them optional. I just want to point the benchmark client at an existing installation.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The ann-benchmarks configuration files are here &lt;a href=&quot;https://gist.github.com/mdcallag/d3e56645e42b4eb19214d22f709b1219&quot;&gt;for MariaDB&lt;/a&gt; and &lt;a href=&quot;https://gist.github.com/mdcallag/90d34206e1281164f1f40ab279e64af4&quot;&gt;for pgvector&lt;/a&gt;. I am open to feedback that I should try different parameters. I added support to use float16 (halfvec) for the pgvector index (but only the index, not for the vector stored in the table).&lt;br /&gt;&lt;br /&gt;In this post I use the&amp;nbsp;fashion-mnist-784-euclidean dataset in non-batch mode where non-batch mode doesn&#39;t run concurrent queries and batch mode does. Note that I don&#39;t set --count when running the benchmark which means that all of the queries use &lt;i&gt;LIMIT 10&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;Files related to these tests are &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/arc/jan25.ann.fashion-mnist-784-euclidean&quot;&gt;archived here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The command lines for non-batch mode are:&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;POSTGRES_CONN_ARGS=root:pw:127.0.0.1:5432 POSTGRES_DB_NAME=ib \&lt;br /&gt;&amp;nbsp; &amp;nbsp; python3 -u run.py&amp;nbsp; --algorithm pgvector --dataset fashion-mnist-784-euclidean --local&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&lt;br /&gt;POSTGRES_CONN_ARGS=root:pw:127.0.0.1:5432 POSTGRES_DB_NAME=ib \&lt;br /&gt;&amp;nbsp; &amp;nbsp; python3 -u run.py&amp;nbsp; --algorithm pgvector_halfvec --dataset fashion-mnist-784-euclidean --local&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: x-small;&quot;&gt;&lt;br /&gt;MARIADB_CONN_ARGS=root:pw:127.0.0.1:3306 MARIADB_DB_NAME=test \&lt;br /&gt;&amp;nbsp; &amp;nbsp; python3 -u run.py&amp;nbsp; --algorithm mariadb --dataset fashion-mnist-784-euclidean --local&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;b&gt;Results: QPS vs recall graphs&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The recall vs QPS graph is created by: &lt;i&gt;python3 plot.py --dataset fashion-mnist-784-euclidean&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The results below for MariaDB are excellent. It gets more QPS than pgvector at a given recall target.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj2nCzmJ4Uchv7m7JJOk7lh2TBl_bGRlBE10P3XkCxGl_OovHIwXZ5MwAw78l_wI0ZirPs6OsEsvsuisBgaA4qlGIJx3aGdiv-HKvIOESdF20geaalexsmLGMz4TkUVH3sLbxVFuGLCiIaohwHqRxLV5cu_YImM0LbGajyQ4xhy3YT_phmj2mWaShW1jx4j/s1173/fashion-mnist-784-euclidean.batch0.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;778&quot; data-original-width=&quot;1173&quot; height=&quot;424&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj2nCzmJ4Uchv7m7JJOk7lh2TBl_bGRlBE10P3XkCxGl_OovHIwXZ5MwAw78l_wI0ZirPs6OsEsvsuisBgaA4qlGIJx3aGdiv-HKvIOESdF20geaalexsmLGMz4TkUVH3sLbxVFuGLCiIaohwHqRxLV5cu_YImM0LbGajyQ4xhy3YT_phmj2mWaShW1jx4j/w640-h424/fashion-mnist-784-euclidean.batch0.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: create index&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I am still trying to figure out how to present this data. All of the numbers &lt;a href=&quot;https://gist.github.com/mdcallag/361a4443dce220bd659d09ae80afae99&quot;&gt;are here&lt;/a&gt;&amp;nbsp;for the time to create an index and the size of the index. The database configs for Postgres and MariaDB are shared above, and parallel index create is disabled by the config for Postgres (and not supported yet by MariaDB). The summary is:&lt;br /&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Indexes have a similar size with MariaDB and Postgres with halfvec. The Postgres index without halfvec is about 2X larger.&lt;/li&gt;&lt;li&gt;Time to create the index for Postgres is similar with and without halfvec&lt;/li&gt;&lt;li&gt;Time to create the index for MariaDB is less than for pgvector. Perhaps the best way to compare this is the time to create the index for a similar point on the QPS/recall graph (see the last section of this blog post)&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: best QPS for a given recall&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Many benchmark results are marketed via peak performance (max throughput or min response time) but these are usually constrained optimization problems -- determine peak performance that satisfies some SLA. And the SLA might be response time or efficiency (cost).&lt;br /&gt;&lt;br /&gt;With ann-benchmarks the constraint might be recall where you seek the best QPS that satisfies a recall target. Below I share the best QPS for a given recall target along with the configuration parameters (M, ef_construction, ef_search) at which that occurs for each of the algorithms (MariaDB, pgvector with float32, pgvector with float16).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For all cases below except the first (recall = 1.000) the time to create the index is about 2X or more longer with pgvector than with MariaDB.&lt;br /&gt;&lt;br /&gt;For all cases below the best QPS at a given recall target is much better, sometimes 3X better, with MariaDB than with pgvector.&lt;br /&gt;&lt;br /&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Legend&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;* recall &amp;amp; QPS - results from the benchmark&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;* isecs - number of seconds to create the index for M and ef_construction (ef_cons)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 1.000&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; isecs&amp;nbsp; algorithm&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.000&amp;nbsp; &amp;nbsp;920&amp;nbsp; 97.4&amp;nbsp; &amp;nbsp;PGVector(m=16, ef_cons=192, ef_search=120)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.000&amp;nbsp; &amp;nbsp;991&amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;89.8&lt;/span&gt;&amp;nbsp; &amp;nbsp;PGVector_halfvec(m=16, ef_cons=256, ef_search=120)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.000&amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;3226&lt;/span&gt; 111.8&amp;nbsp; &amp;nbsp;MariaDB(m=32, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.99&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; isecs&amp;nbsp; algorithm&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.990&amp;nbsp; 2337&amp;nbsp; 70.7&amp;nbsp; &amp;nbsp;PGVector(m=16, ef_cons=96, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.991&amp;nbsp; 2558&amp;nbsp; 78.3&amp;nbsp; &amp;nbsp;PGVector_halfvec(m=16, ef_cons=192, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.995&amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;4745&lt;/span&gt;&amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;27.4&lt;/span&gt;&amp;nbsp; &amp;nbsp;MariaDB(m=12, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.98&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; isecs&amp;nbsp; algorithm&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.985&amp;nbsp; 2383&amp;nbsp; 50.4&amp;nbsp; &amp;nbsp;PGVector(m=16, ef_cons=32, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.988&amp;nbsp; 2608&amp;nbsp; 52.8&amp;nbsp; &amp;nbsp;PGVector_halfvec(m=16, ef_cons=64, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.984&amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;5351&lt;/span&gt;&amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;18.1&lt;/span&gt;&amp;nbsp; &amp;nbsp;MariaDB(m=8, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.97&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; isecs&amp;nbsp; algorithm&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.972&amp;nbsp; 3033&amp;nbsp; 42.3&amp;nbsp; &amp;nbsp;PGVector(m=8, ef_cons=96, ef_search=20)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.973&amp;nbsp; 3185&amp;nbsp; 89.8&amp;nbsp; &amp;nbsp;PGVector_halfvec(m=16, ef_cons=256, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.984&amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;5351&lt;/span&gt;&amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;18.1&lt;/span&gt;&amp;nbsp; &amp;nbsp;MariaDB(m=8, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.96&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; isecs&amp;nbsp; algorithm&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.961&amp;nbsp; 3044&amp;nbsp; &amp;nbsp;50.4&amp;nbsp; PGVector(m=16, ef_cons=32, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.967&amp;nbsp; 3250&amp;nbsp; &amp;nbsp;52.8&amp;nbsp; PGVector_halfvec(m=16, ef_cons=64, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.961&amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;5691&lt;/span&gt;&amp;nbsp; &amp;nbsp;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;14.8&lt;/span&gt;&amp;nbsp; MariaDB(m=6, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;Best QPS with recall &amp;gt;= 0.95&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;recall&amp;nbsp; QPS&amp;nbsp; isecs&amp;nbsp; algorithm&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.961&amp;nbsp; 3044&amp;nbsp; 50.4&amp;nbsp; &amp;nbsp;PGVector(m=16, ef_cons=32, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.967&amp;nbsp; 3250&amp;nbsp; 52.8&amp;nbsp; &amp;nbsp;PGVector_halfvec(m=16, ef_cons=64, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.961&amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;5691&lt;/span&gt;&amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;14.8&lt;/span&gt;&amp;nbsp; &amp;nbsp;MariaDB(m=6, ef_search=10)&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/6588931153785014379/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html#comment-form' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/6588931153785014379'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/6588931153785014379'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html' title='Evaluating vector indexes in MariaDB and pgvector: part 1'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj2nCzmJ4Uchv7m7JJOk7lh2TBl_bGRlBE10P3XkCxGl_OovHIwXZ5MwAw78l_wI0ZirPs6OsEsvsuisBgaA4qlGIJx3aGdiv-HKvIOESdF20geaalexsmLGMz4TkUVH3sLbxVFuGLCiIaohwHqRxLV5cu_YImM0LbGajyQ4xhy3YT_phmj2mWaShW1jx4j/s72-w640-h424-c/fashion-mnist-784-euclidean.batch0.png" height="72" width="72"/><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-1923017175923484200</id><published>2025-01-09T16:33:00.003-08:00</published><updated>2025-01-10T09:19:02.778-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="innodb"/><category scheme="http://www.blogger.com/atom/ns#" term="myrocks"/><category scheme="http://www.blogger.com/atom/ns#" term="mysql"/><category scheme="http://www.blogger.com/atom/ns#" term="sysbench"/><title type='text'>Sysbench performance over time for InnoDB and MyRocks: part 4</title><content type='html'>&lt;p&gt;This is part 4 in my (possibly) final series on performance regressions in MySQL using cached sysbench as the workload. For previous posts, see&amp;nbsp;&lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for.html&quot;&gt;part 1&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for_9.html&quot;&gt;part 2&lt;/a&gt;&amp;nbsp;and &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for_96.html&quot;&gt;part 3&lt;/a&gt;.&amp;nbsp;This post covers performance differences between InnoDB in upstream MySQL 8.0.32, InnoDB in FB MySQL 8.0.32 and MyRocks in FB MySQL 8.0.32 using a server with 32 cores and 128G of RAM.&lt;br /&gt;&lt;br /&gt;I don&#39;t claim that the MyRocks CPU overhead isn&#39;t relevant, but this workload (CPU-bound, database is cached) is a worst-case for it.&lt;/p&gt;&lt;p&gt;tl;dr&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;InnoDB from FB MySQL is no worse than ~10% slower than InnoDB from upstream&lt;/li&gt;&lt;li&gt;Fixing&amp;nbsp;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1506&quot;&gt;bug 1506&lt;/a&gt;&amp;nbsp;is important for InnoDB in FB MySQL&lt;/li&gt;&lt;li&gt;MyRocks is ~30% slower than upstream InnoDB at low concurrency and ~45% slower at high, as it uses ~1.5X more CPU/query&lt;/li&gt;&lt;li&gt;For writes, MyRocks does worse at high concurrency than at low&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;b&gt;Updates:&lt;/b&gt;&amp;nbsp;&lt;i&gt;For writes, MyRocks does worse at high concurrency than at low&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;I looked at vmstat metrics for the update-nonindex benchmark and the number of context switches per update is about 1.2X larger for MyRocks vs InnoDB at high concurrency.&amp;nbsp;&lt;br /&gt;&lt;br /&gt;Then I looked at PMP stacks and MyRocks has more samples for commit processing. The top stacks &lt;a href=&quot;https://gist.github.com/mdcallag/ea8c1ad4c30489680e059ec26d0054bc&quot;&gt;are here&lt;/a&gt;. This should not be a big surprise because MyRocks does more work at commit time (pushes changes from a per-session buffer into the memtable). But I need to look at this more closely.&lt;br /&gt;&lt;br /&gt;I browsed the code in &lt;a href=&quot;https://github.com/mysql/mysql-server/blob/89e1c722476deebc3ddc8675e779869f6da654c0/sql/rpl_commit_stage_manager.cc#L219&quot;&gt;Commit_stage_manager::enroll_for&lt;/a&gt;, which is on the call stack for the mutext contention, and it is kind of complicated. I am trying to figure out how many mutexes are locked in there and figuring that out will take some time.&amp;nbsp;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;b&gt;Benchmark, Hardware&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Much more detail on the benchmark and hardware is&amp;nbsp;&lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for.html&quot;&gt;here&lt;/a&gt;. I am trying to avoid repeating that information in the posts that follow.&amp;nbsp;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Results here are from the c32r128 server with 32 CPU cores and 128G of RAM. The benchmarks were repeated for 1 and 24 threads. On the charts below that is indicated by NT=1 and NT=24.&lt;/p&gt;&lt;p&gt;&lt;b&gt;Builds&lt;/b&gt;&lt;/p&gt;&lt;p&gt;The&amp;nbsp;&lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for_9.html&quot;&gt;previous post&lt;/a&gt;&amp;nbsp;has more detail on the builds, my.cnf files and bug fixes.&lt;/p&gt;&lt;div&gt;The encoded names for these builds is:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;my8032_rel_o2nofp&lt;/li&gt;&lt;ul&gt;&lt;li&gt;InnoDB from upstream MySQL 8.0.32&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbmy8032_rel_o2nofp_end_241023_ba9709c9_971&lt;/li&gt;&lt;ul&gt;&lt;li&gt;FB MySQL 8.0.32 at git hash ba9709c9 (as of 2024/10/23) using RocksDB 9.7.1. This supports InnoDB and MyRocks.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506&lt;/li&gt;&lt;ul&gt;&lt;li&gt;FB MySQL 8.0.32 at git hash ba9709c9 (as of 2024/10/23) using RocksDB 9.7.1 with patches applied for bugs 1473, 1481, 1482 and 1506, This supports InnoDB and MyRocks.&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;div&gt;&lt;div&gt;The my.cnf files are:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;my.cnf.cz11a_$x for InnoDB from upstream MySQL for&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r16/my8032_rel_o2nofp/etc/my.cnf.cz11a_bee&quot;&gt;c8r16&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/my8032_rel_o2nofp/etc/my.cnf.cz11a_c8r32&quot;&gt;c8r32&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c24r64/my8032_rel_o2nofp/etc/my.cnf.cz11a_c24r64&quot;&gt;c24r64&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/my8032_rel_o2nofp/etc/my.cnf.cz11a_c32r128&quot;&gt;c32r128&lt;/a&gt;&lt;/li&gt;&lt;li&gt;my.cnf.cia1_$x for InnoDB from FB MySQL for&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r16/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cia1_bee&quot;&gt;c8r16&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cia1_c8r32&quot;&gt;c8r32&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c24r64/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cia1_c24r64&quot;&gt;c24r64&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cia1_c32r128&quot;&gt;c32r128&lt;/a&gt;&lt;/li&gt;&lt;li&gt;my.cnf.cza2_$x for MyRocks from FB MySQL for&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r16/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cza2_bee&quot;&gt;c8r16&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cza2_c8r32&quot;&gt;c8r32&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c24r64/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cza2_c24r64&quot;&gt;c24r64&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cza2_c32r128&quot;&gt;c32r128&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Relative QPS&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The charts and summary statistics that follow use a number that I call the relative QPS (rQPS) where:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;rQPS is: (QPS for my version) / (QPS for base version)&lt;/li&gt;&lt;li&gt;&lt;i&gt;base version&lt;/i&gt;&amp;nbsp;is InnoDB from upstream MySQL 8.0.32 (my8032_rel_o2nofp)&lt;/li&gt;&lt;li&gt;&lt;i&gt;my version&lt;/i&gt;&amp;nbsp;is one of the other versions&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The microbenchmarks are split into three groups: point queries, range queries, writes. The tables below have summary statistics for InnoDB and MyRocks using the relative QPS (the same data as the charts).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Results are provided in two formats: charts and summary statistics. The summary statistics table have the min, max, average and median relative QPS per group (group = point, range and writes).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;The spreadsheets and charts&amp;nbsp;&lt;a href=&quot;https://docs.google.com/spreadsheets/d/18Gbz4yOzvCfms9326COOJ0pgS6SorIF2aCiXV0yqrU4/edit?usp=sharing&quot;&gt;are also here&lt;/a&gt;. I don&#39;t know how to prevent the microbenchmark names on the x-axis from getting truncated in the png files I use here but they are easier to read on the spreadsheet.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The charts use NT=1, NT=16 and NT=24 to indicate whether sysbench was run with 1, 16 or 24 threads. The charts and table use the following abbreviations for the DBMS versions:&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;fbinno-nofix&lt;/li&gt;&lt;ul&gt;&lt;li&gt;InnoDB from fbmy8032_rel_o2nofp_end_241023_ba9709c9_971&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbinno-somefix&lt;/li&gt;&lt;ul&gt;&lt;li&gt;InnoDB from fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;myrocks-nofix&lt;/li&gt;&lt;ul&gt;&lt;li&gt;MyRocks from fbmy8032_rel_o2nofp_end_241023_ba9709c9_971&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;myrocks-somefix&lt;/li&gt;&lt;ul&gt;&lt;li&gt;MyRocks from fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;Summary statistics: InnoDB&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Summary:&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;InnoDB from FB MySQL is no worse than ~10% slower than InnoDB from upstream&lt;/li&gt;&lt;li&gt;Fixing&amp;nbsp;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1506&quot;&gt;bug 1506&lt;/a&gt;&amp;nbsp;is important for InnoDB in FB MySQL&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;1 thread&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;fbinno-nofix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.89&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.92&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.91&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.63&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.93&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.82&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.82&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.86&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.98&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.89&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.88&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;fbinno-somefix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.92&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.00&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.95&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.89&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.91&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.91&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.89&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.99&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.92&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.92&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;div&gt;&lt;google-sheets-html-origin&gt;&lt;/google-sheets-html-origin&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;24 threads&lt;/div&gt;&lt;div&gt;&lt;google-sheets-html-origin&gt;&lt;/google-sheets-html-origin&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;fbinno-nofix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.92&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.94&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.94&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.62&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.81&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.82&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.84&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.94&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.88&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.87&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;fbinno-somefix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.94&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.99&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.97&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.98&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.78&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.99&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.89&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.91&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.86&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.95&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.90&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.88&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;b&gt;Summary statistics: MyRocks&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&lt;p&gt;Summary:&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;MyRocks does better at low concurrency than at high. The fix might be as simple as enabling the hyper clock block cache&lt;/li&gt;&lt;li&gt;MyRocks is ~30% slower than upstream InnoDB at low concurrency and ~45% slower at high&lt;/li&gt;&lt;li&gt;For writes, MyRocks does worse at high concurrency than at low&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;1 thread&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;google-sheets-html-origin&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;myrocks-nofix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.52&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.75&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.66&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.68&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.37&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.72&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.60&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.60&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.65&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.21&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.79&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.73&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;myrocks-somefix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.51&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.79&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.68&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.70&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.43&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.76&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.62&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.61&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.66&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.23&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.80&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.74&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;/google-sheets-html-origin&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;24 threads&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;google-sheets-html-origin&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;myrocks-nofix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.40&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.76&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.49&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.43&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.40&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.71&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.58&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.60&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.44&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.37&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.65&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.55&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;myrocks-somefix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.48&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.77&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.55&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.51&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.43&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.71&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.60&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.60&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.45&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.39&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.66&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.55&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;/google-sheets-html-origin&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;b&gt;Results: c32r128 with InnoDB and point queries&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;InnoDB from FB MySQL is no worse than 10% slower than upstream&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi0kKIJtNGlr5ZllJ36NnHt059c9XKkmqXTI45bjbsbtroQqNCesu6nlaSf5oeTHZFUNjAKLlIQ9qGadEqT-U4f2sEdRObODG9WRGxHIQNV0M-7x8Aj9d9k9koaFobCiANZQR-1k6HIY0XJlIuNn3Q7gi6Qm7qmvVckJ9M_u_JveKMeNGYbjdM93w9wRI-8/s600/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c32r128.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi0kKIJtNGlr5ZllJ36NnHt059c9XKkmqXTI45bjbsbtroQqNCesu6nlaSf5oeTHZFUNjAKLlIQ9qGadEqT-U4f2sEdRObODG9WRGxHIQNV0M-7x8Aj9d9k9koaFobCiANZQR-1k6HIY0XJlIuNn3Q7gi6Qm7qmvVckJ9M_u_JveKMeNGYbjdM93w9wRI-8/w640-h396/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhE1X0Z3ulY9x6bb5v32yKxl5WnqUtqo_0n5yutvMQyboZgyfw4RQR189iuIxv8IDrDLCJZk5iXoI5RtbbSxR2taCXmLkMjRpc7daKhA7uQ7UYjbr2rhL-YSTNdQApHiGesdNaKfSGmYiM-Zx4fuHRiwG3UYc-ugq2RVZQe6NCLmeUD3BRdLWRNBlD6DIO6/s600/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=24,%20c32r128.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhE1X0Z3ulY9x6bb5v32yKxl5WnqUtqo_0n5yutvMQyboZgyfw4RQR189iuIxv8IDrDLCJZk5iXoI5RtbbSxR2taCXmLkMjRpc7daKhA7uQ7UYjbr2rhL-YSTNdQApHiGesdNaKfSGmYiM-Zx4fuHRiwG3UYc-ugq2RVZQe6NCLmeUD3BRdLWRNBlD6DIO6/w640-h396/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=24,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;p&gt;&lt;b&gt;Results: c32r128 with MyRocks and point queries&lt;/b&gt;&lt;/p&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;at low concurrency the worst case for MyRocks are the tests that do point lookup on secondary indexes because that uses a range scan rather than a point lookup on the LSM tree, which means that bloom filters cannot be used&lt;/li&gt;&lt;li&gt;at high concurrency the difference between primary and secondary index queries is less significant, perhaps this is dominated by mutex contention from the LRU block cache and solved by using hyper clock&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg8aHeKaSyyx12HLci3eak51Eqv9VJ4LwbTtTh2fOMq3LDlWwzAprmpJViRdfnEHBt8aqHCDHkCB9tP-j0bEuPxxr8GoodT-RkDR2jGmpisuwBUxlO8bj-LzNslgJs5Ou90tYRRfFRGV0EgkfT5f8tne1tQZTG5sY0Wi7rnzmZlIulvVZUuupIPZmFYLEsX/s600/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c32r128.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg8aHeKaSyyx12HLci3eak51Eqv9VJ4LwbTtTh2fOMq3LDlWwzAprmpJViRdfnEHBt8aqHCDHkCB9tP-j0bEuPxxr8GoodT-RkDR2jGmpisuwBUxlO8bj-LzNslgJs5Ou90tYRRfFRGV0EgkfT5f8tne1tQZTG5sY0Wi7rnzmZlIulvVZUuupIPZmFYLEsX/w640-h396/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgeLCH7FeTCZNx6F0OrcXCjt8MZOUthZnb-OeiiXfxt3YA3E60LeaWp0xFzhFgsVSWAdjd7Ue4TGYbVB4DocWUgNk_1BZa76rvNx0wpOSEGmwBZa_6A8j1SsGQ3_fpFSW64iHcp0xjSgbQ2pdL95RCSsbBIXeWgFlVffIqyRk9fftdngplGltLt5UikhKL8/s600/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=24,%20c32r128.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgeLCH7FeTCZNx6F0OrcXCjt8MZOUthZnb-OeiiXfxt3YA3E60LeaWp0xFzhFgsVSWAdjd7Ue4TGYbVB4DocWUgNk_1BZa76rvNx0wpOSEGmwBZa_6A8j1SsGQ3_fpFSW64iHcp0xjSgbQ2pdL95RCSsbBIXeWgFlVffIqyRk9fftdngplGltLt5UikhKL8/w640-h396/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=24,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;p&gt;&lt;b&gt;Results: c32r128 with InnoDB and range queries&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Summary&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;the worst case for InnoDB from FB MySQL are the long range scans and fixing&amp;nbsp;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1506&quot;&gt;bug 1506&lt;/a&gt;&amp;nbsp;will be a big deal&lt;/li&gt;&lt;/ul&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjdGFk9PH_zr8cOrNsyDNTbfJQAyZG3NfpLLp8LGcMUV4D73PbwX1R668x5ebOUB5vjA2UeFYTDvQ0PsW_LzY8tpSiGu2phuJHFmL8NsaA-5YyuxlewEUFFwhpDdPfIJPZmelYKA8PaMv0DdEzKaLpQRG6GdTiOFqGrg5T-So1Wm6rIBGpOknwPN4Wkm5bh/s600/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c32r128.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjdGFk9PH_zr8cOrNsyDNTbfJQAyZG3NfpLLp8LGcMUV4D73PbwX1R668x5ebOUB5vjA2UeFYTDvQ0PsW_LzY8tpSiGu2phuJHFmL8NsaA-5YyuxlewEUFFwhpDdPfIJPZmelYKA8PaMv0DdEzKaLpQRG6GdTiOFqGrg5T-So1Wm6rIBGpOknwPN4Wkm5bh/w640-h396/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh96rl8U1N-WC0RZngvPrfWlWBkcWDya3ZOJ0E4UURQY-lkya4Y4C-8wIScnvav_kQ7kZnh2FV4QMIk8A-mAtd9h7aDKp4LZtoeeNCFokrpEhilm71c8N2_JY5wKeRvgwP9n-HWuSUqzswoiA35tLdBUYcHq4NrsU8aNLrJO6j-fYvEhojri0qxrNkZP52K/s600/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=24,%20c32r128.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh96rl8U1N-WC0RZngvPrfWlWBkcWDya3ZOJ0E4UURQY-lkya4Y4C-8wIScnvav_kQ7kZnh2FV4QMIk8A-mAtd9h7aDKp4LZtoeeNCFokrpEhilm71c8N2_JY5wKeRvgwP9n-HWuSUqzswoiA35tLdBUYcHq4NrsU8aNLrJO6j-fYvEhojri0qxrNkZP52K/w640-h396/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=24,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;Results: c32r128 with MyRocks and range queries&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Summary&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;while long range scans are the worst case here,&amp;nbsp;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1506&quot;&gt;bug 1506&lt;/a&gt;&amp;nbsp;is not an issue as that is InnoDB-only&lt;/li&gt;&lt;/ul&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgoAhgeDn5nxE2bpJZKdvXI-jwF5eNR-KDv-7fjvRWyv-2F2qNLsybu8PZFOSbth6jUWsaGxMudvjFH6SwchAhqqycyu4Ky-_F5iVlywRYVi4t1TfzijcxfQMvPCp5AQS5Hw3Yr4lNel58N1rste1sU1aJ23rskWK_brkpEBYf1gjTdv30wAEyWykwqpUIR/s600/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c32r128.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgoAhgeDn5nxE2bpJZKdvXI-jwF5eNR-KDv-7fjvRWyv-2F2qNLsybu8PZFOSbth6jUWsaGxMudvjFH6SwchAhqqycyu4Ky-_F5iVlywRYVi4t1TfzijcxfQMvPCp5AQS5Hw3Yr4lNel58N1rste1sU1aJ23rskWK_brkpEBYf1gjTdv30wAEyWykwqpUIR/w640-h396/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg3t-n-qpESgGfs_4EjF7to8xm4EpCNAxZDce4wSofN8DM_tLMNMuy3bFHF9LPJmWjdoSKvkg_rsk8poJ-uXUsT-6g4jX_HiNaddZ-agOVXP30pkSna9OFLYKi53rtQDYzsiDYJ266_gew9e8gLfZDLmuPUMM0BJYze8gMySBh4zEci1gG5cEHnsLzkdEi6/s600/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=24,%20c32r128.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg3t-n-qpESgGfs_4EjF7to8xm4EpCNAxZDce4wSofN8DM_tLMNMuy3bFHF9LPJmWjdoSKvkg_rsk8poJ-uXUsT-6g4jX_HiNaddZ-agOVXP30pkSna9OFLYKi53rtQDYzsiDYJ266_gew9e8gLfZDLmuPUMM0BJYze8gMySBh4zEci1gG5cEHnsLzkdEi6/w640-h396/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=24,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;Results: c32r128 with InnoDB and writes&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Summary&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;results are stable here, InnoDB from FB MySQL is no worse than ~10% slower than upstream but results at high concurrency are a bit worse than at low&lt;/li&gt;&lt;/ul&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjDY09TsXu5enj19NGidBSM0HlYl3UquiS0tmHiqqwds4_BF3TUpkAv2XPXstOzgmr5aY51zKcEgWrTOCf-_rRvE7ISkSdbuAizGgZ_EKCmdHkrshd8jg1CphFWlCpeSIkrcVePReZ2oBEb9bB352RIu1Srglp4W8DwUOYYd1ElMZueTNBYRTotnF-b_-3p/s600/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c32r128.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjDY09TsXu5enj19NGidBSM0HlYl3UquiS0tmHiqqwds4_BF3TUpkAv2XPXstOzgmr5aY51zKcEgWrTOCf-_rRvE7ISkSdbuAizGgZ_EKCmdHkrshd8jg1CphFWlCpeSIkrcVePReZ2oBEb9bB352RIu1Srglp4W8DwUOYYd1ElMZueTNBYRTotnF-b_-3p/w640-h396/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiHLGvX2dwAchhk-iRADVracMcsGnrtRSjJWn_qGgBveXMRD_dhyphenhyphennNdK8Aa849jiNhIeLSoHulCaxU0t99jTC2r4TgVUVx7DFZncu9ayaZ2bYFzXSdFpieCcURhWN6t-3_n1hwNV72VYtAUQi_Fews5FyjI02c-2C6UWoCMJ3vD1ARzxJV-50warPHDhYZW/s600/QPS%20relative%20to%20InnoDB_%20writes,%20NT=24,%20c32r128.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiHLGvX2dwAchhk-iRADVracMcsGnrtRSjJWn_qGgBveXMRD_dhyphenhyphennNdK8Aa849jiNhIeLSoHulCaxU0t99jTC2r4TgVUVx7DFZncu9ayaZ2bYFzXSdFpieCcURhWN6t-3_n1hwNV72VYtAUQi_Fews5FyjI02c-2C6UWoCMJ3vD1ARzxJV-50warPHDhYZW/w640-h396/QPS%20relative%20to%20InnoDB_%20writes,%20NT=24,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;Results: c32r128 with MyRocks and writes&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Summary&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;while MyRocks does much better than InnoDB for update-index because it does blind writes rather than RMW for non-unique secondary index maintenance&lt;/li&gt;&lt;li&gt;MyRocks does worse at high concurrency than at low&lt;/li&gt;&lt;/ul&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi6sam0KqVDLyHz_LSqUf8tQmG7lFvvk8Po4_Ymyfx_JdeumEqOpyTDQ-dxkMddu58gO6pElCJkTzYEOuFxhp6xpKHCwyBWdxDX-P13-IKWqer8ppU99qGuFfa0-aut1STACb_qLYcRNXotLIf5TL_5lyTmDGBQyJZDys_6_L23ROmCox9rh91SeN2uB2h_/s600/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c32r128.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi6sam0KqVDLyHz_LSqUf8tQmG7lFvvk8Po4_Ymyfx_JdeumEqOpyTDQ-dxkMddu58gO6pElCJkTzYEOuFxhp6xpKHCwyBWdxDX-P13-IKWqer8ppU99qGuFfa0-aut1STACb_qLYcRNXotLIf5TL_5lyTmDGBQyJZDys_6_L23ROmCox9rh91SeN2uB2h_/w640-h396/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEipcyojZNq29Ko9MoShPSCU5TdooATYHCQ8FEvUAdm7-xPrISY6eLZf9xHw8wvKYK-2_6px0uB9lFgbM-ikMCawoRSwszDnwWSnW0pWu2wqb0qfAF33CKknRXGFIyU26LVWRlO9Q8RWBDEuLahCmKgX4UOTBDOdELboC_Xw3L1x7rzhqzOEO8V7RY0LZsol/s600/QPS%20relative%20to%20InnoDB_%20writes,%20NT=24,%20c32r128.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEipcyojZNq29Ko9MoShPSCU5TdooATYHCQ8FEvUAdm7-xPrISY6eLZf9xHw8wvKYK-2_6px0uB9lFgbM-ikMCawoRSwszDnwWSnW0pWu2wqb0qfAF33CKknRXGFIyU26LVWRlO9Q8RWBDEuLahCmKgX4UOTBDOdELboC_Xw3L1x7rzhqzOEO8V7RY0LZsol/w640-h396/QPS%20relative%20to%20InnoDB_%20writes,%20NT=24,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/1923017175923484200/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for_82.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/1923017175923484200'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/1923017175923484200'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for_82.html' title='Sysbench performance over time for InnoDB and MyRocks: part 4'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi0kKIJtNGlr5ZllJ36NnHt059c9XKkmqXTI45bjbsbtroQqNCesu6nlaSf5oeTHZFUNjAKLlIQ9qGadEqT-U4f2sEdRObODG9WRGxHIQNV0M-7x8Aj9d9k9koaFobCiANZQR-1k6HIY0XJlIuNn3Q7gi6Qm7qmvVckJ9M_u_JveKMeNGYbjdM93w9wRI-8/s72-w640-h396-c/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c32r128.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-1672587175002535974</id><published>2025-01-09T15:57:00.000-08:00</published><updated>2025-01-09T18:02:37.793-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="innodb"/><category scheme="http://www.blogger.com/atom/ns#" term="myrocks"/><category scheme="http://www.blogger.com/atom/ns#" term="mysql"/><category scheme="http://www.blogger.com/atom/ns#" term="sysbench"/><title type='text'>Sysbench performance over time for InnoDB and MyRocks: part 3</title><content type='html'>&lt;p&gt;This is part 3 in my (possibly) final series on performance regressions in MySQL using cached sysbench as the workload. For previous posts, see &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for.html&quot;&gt;part 1&lt;/a&gt;&amp;nbsp;and &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for_9.html&quot;&gt;part 2&lt;/a&gt;. This post covers performance differences between InnoDB in upstream MySQL 8.0.32, InnoDB in FB MySQL 8.0.32 and MyRocks in FB MySQL 8.0.32 using a server with 24 cores and 64G of RAM.&lt;br /&gt;&lt;br /&gt;I don&#39;t claim that the MyRocks CPU overhead isn&#39;t relevant, but this workload (CPU-bound, database is cached) is a worst-case for it.&lt;/p&gt;&lt;p&gt;tl;dr&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;InnoDB from FB MySQL is no worse than ~10% slower than InnoDB from upstream&lt;/li&gt;&lt;li&gt;MyRocks is ~35% slower than InnoDB from upstream as it uses ~1.5X more CPU/query&lt;/li&gt;&lt;li&gt;Fixing&amp;nbsp;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1506&quot;&gt;bug 1506&lt;/a&gt;&amp;nbsp;is important for InnoDB in FB MySQL&lt;/li&gt;&lt;li&gt;For writes, MyRocks does worse at high concurrency than at low&lt;/li&gt;&lt;li&gt;while MyRocks does much better than InnoDB for update-index at 1 thread, that benefit goes away at 16 threads&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;b&gt;Benchmark, Hardware&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Much more detail on the benchmark and hardware is&amp;nbsp;&lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for.html&quot;&gt;here&lt;/a&gt;. I am trying to avoid repeating that information in the posts that follow.&amp;nbsp;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Results here are from the c24r64 server with 24 CPU cores and 64G of RAM. The benchmarks were repeated for 1 and 16 threads. On the charts below that is indicated by NT=1 and NT=16.&lt;/p&gt;&lt;p&gt;&lt;b&gt;Builds&lt;/b&gt;&lt;/p&gt;&lt;p&gt;The &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for_9.html&quot;&gt;previous post&lt;/a&gt; has more detail on the builds, my.cnf files and bug fixes.&lt;/p&gt;&lt;div&gt;The encoded names for these builds is:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;my8032_rel_o2nofp&lt;/li&gt;&lt;ul&gt;&lt;li&gt;InnoDB from upstream MySQL 8.0.32&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbmy8032_rel_o2nofp_end_241023_ba9709c9_971&lt;/li&gt;&lt;ul&gt;&lt;li&gt;FB MySQL 8.0.32 at git hash ba9709c9 (as of 2024/10/23) using RocksDB 9.7.1. This supports InnoDB and MyRocks.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506&lt;/li&gt;&lt;ul&gt;&lt;li&gt;FB MySQL 8.0.32 at git hash ba9709c9 (as of 2024/10/23) using RocksDB 9.7.1 with patches applied for bugs 1473, 1481, 1482 and 1506, This supports InnoDB and MyRocks.&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;div&gt;&lt;div&gt;The my.cnf files are:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;my.cnf.cz11a_$x for InnoDB from upstream MySQL for&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r16/my8032_rel_o2nofp/etc/my.cnf.cz11a_bee&quot;&gt;c8r16&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/my8032_rel_o2nofp/etc/my.cnf.cz11a_c8r32&quot;&gt;c8r32&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c24r64/my8032_rel_o2nofp/etc/my.cnf.cz11a_c24r64&quot;&gt;c24r64&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/my8032_rel_o2nofp/etc/my.cnf.cz11a_c32r128&quot;&gt;c32r128&lt;/a&gt;&lt;/li&gt;&lt;li&gt;my.cnf.cia1_$x for InnoDB from FB MySQL for&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r16/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cia1_bee&quot;&gt;c8r16&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cia1_c8r32&quot;&gt;c8r32&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c24r64/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cia1_c24r64&quot;&gt;c24r64&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cia1_c32r128&quot;&gt;c32r128&lt;/a&gt;&lt;/li&gt;&lt;li&gt;my.cnf.cza2_$x for MyRocks from FB MySQL for&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r16/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cza2_bee&quot;&gt;c8r16&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cza2_c8r32&quot;&gt;c8r32&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c24r64/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cza2_c24r64&quot;&gt;c24r64&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cza2_c32r128&quot;&gt;c32r128&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Relative QPS&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The charts and summary statistics that follow use a number that I call the relative QPS (rQPS) where:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;rQPS is: (QPS for my version) / (QPS for base version)&lt;/li&gt;&lt;li&gt;&lt;i&gt;base version&lt;/i&gt;&amp;nbsp;is InnoDB from upstream MySQL 8.0.32 (my8032_rel_o2nofp)&lt;/li&gt;&lt;li&gt;&lt;i&gt;my version&lt;/i&gt;&amp;nbsp;is one of the other versions&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The microbenchmarks are split into three groups: point queries, range queries, writes. The tables below have summary statistics for InnoDB and MyRocks using the relative QPS (the same data as the charts).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Results are provided in two formats: charts and summary statistics. The summary statistics table have the min, max, average and median relative QPS per group (group = point, range and writes).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;The spreadsheets and charts&amp;nbsp;&lt;a href=&quot;https://docs.google.com/spreadsheets/d/18Gbz4yOzvCfms9326COOJ0pgS6SorIF2aCiXV0yqrU4/edit?usp=sharing&quot;&gt;are also here&lt;/a&gt;. I don&#39;t know how to prevent the microbenchmark names on the x-axis from getting truncated in the png files I use here but they are easier to read on the spreadsheet.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The charts use NT=1, NT=16 and NT=24 to indicate whether sysbench was run with 1, 16 or 24 threads. The charts and table use the following abbreviations for the DBMS versions:&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;fbinno-nofix&lt;/li&gt;&lt;ul&gt;&lt;li&gt;InnoDB from fbmy8032_rel_o2nofp_end_241023_ba9709c9_971&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbinno-somefix&lt;/li&gt;&lt;ul&gt;&lt;li&gt;InnoDB from fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;myrocks-nofix&lt;/li&gt;&lt;ul&gt;&lt;li&gt;MyRocks from fbmy8032_rel_o2nofp_end_241023_ba9709c9_971&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;myrocks-somefix&lt;/li&gt;&lt;ul&gt;&lt;li&gt;MyRocks from fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;Summary statistics: InnoDB&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Summary:&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;InnoDB from FB MySQL is no worse than 9% slower than InnoDB from upstream&lt;/li&gt;&lt;li&gt;Fixing&amp;nbsp;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1506&quot;&gt;bug 1506&lt;/a&gt;&amp;nbsp;is important for InnoDB in FB MySQL&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;1 thread&lt;/p&gt;&lt;p&gt;&lt;google-sheets-html-origin&gt;&lt;/google-sheets-html-origin&gt;&lt;/p&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;fbinno-nofix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.88&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.01&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.94&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.95&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.68&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.97&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.83&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.83&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.86&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.95&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.90&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.89&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;fbinno-somefix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.94&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.05&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.97&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.88&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.03&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.92&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.91&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.88&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.92&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.93&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;16 threads&lt;br /&gt;&lt;p&gt;&lt;google-sheets-html-origin&gt;&lt;/google-sheets-html-origin&gt;&lt;/p&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;fbinno-nofix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.93&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.94&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.94&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.65&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.95&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.83&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.85&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.88&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.94&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.91&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.91&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;fbinno-somefix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.94&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.97&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.95&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.95&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.85&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.91&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.91&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.89&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.92&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.91&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;b&gt;Summary statistics: MyRocks&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Summary&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;MyRocks does better at low concurrency than at high. The fix might be as simple as enabling the hyper clock block cache&lt;/li&gt;&lt;li&gt;MyRocks is ~35% slower than upstream InnoDB&lt;/li&gt;&lt;li&gt;For writes, MyRocks does worse at high concurrency than at low&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;1 thread&lt;/p&gt;&lt;p&gt;&lt;google-sheets-html-origin&gt;&lt;/google-sheets-html-origin&gt;&lt;/p&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;myrocks-nofix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.46&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.78&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.67&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.70&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.48&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.73&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.63&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.64&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.65&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.49&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.81&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.73&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;myrocks-somefix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.46&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.78&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.66&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.69&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.51&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.73&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.65&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.64&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.66&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.54&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.82&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.74&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;16 threads&lt;/p&gt;&lt;p&gt;&lt;google-sheets-html-origin&gt;&lt;/google-sheets-html-origin&gt;&lt;/p&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;myrocks-nofix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.52&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.77&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.63&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.63&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.46&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.73&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.63&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.61&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.51&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.01&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.67&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.61&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;myrocks-somefix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.55&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.79&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.63&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.62&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.53&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.74&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.65&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.65&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.50&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.01&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.67&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.62&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;b&gt;Results: c24r64 with InnoDB and point queries&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Summary&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;results are stable here, InnoDB from FB MySQL is no worse than 10% slower than upstream&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEghS_SjdVHM50xqitPVDqKFRGwTminIiU3F9FzzAg04wSH-64gLqiZNWp_1dpICO6IIg56ekHQuyhH7XenJJpgCw6Zd3Z4rMkrVIpGnKSC5Jbll0AUqOF56L16w9baRDojZBp-SQKCt0-ifDtNGBu-uTRDHFDd8LNC3tncT_H6HUfA1hazD6IUe-KWhIa3x/s600/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c24r64.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEghS_SjdVHM50xqitPVDqKFRGwTminIiU3F9FzzAg04wSH-64gLqiZNWp_1dpICO6IIg56ekHQuyhH7XenJJpgCw6Zd3Z4rMkrVIpGnKSC5Jbll0AUqOF56L16w9baRDojZBp-SQKCt0-ifDtNGBu-uTRDHFDd8LNC3tncT_H6HUfA1hazD6IUe-KWhIa3x/w640-h396/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c24r64.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhrn9Yu-oMm8FbhNZjogUscSErU9i1kCgMTdjpPCeQivnQ1Zlc-wwZ15MhNetJu9FPk7g2jnihBrTUqS7131F_MhK5D4hsIbN2SUD66rojAyN8yZYuA-YOIpRDErO1_GtIC5IrBoAM8xRHX8w_i4zNSIYqCQYvpo7OhKG9mEdp6V7QOCYARp4o11bk3BxxF/s600/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=16,%20c24r64.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhrn9Yu-oMm8FbhNZjogUscSErU9i1kCgMTdjpPCeQivnQ1Zlc-wwZ15MhNetJu9FPk7g2jnihBrTUqS7131F_MhK5D4hsIbN2SUD66rojAyN8yZYuA-YOIpRDErO1_GtIC5IrBoAM8xRHX8w_i4zNSIYqCQYvpo7OhKG9mEdp6V7QOCYARp4o11bk3BxxF/w640-h396/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=16,%20c24r64.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: c24r64 with MyRocks and point queries&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;div&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;the worst case for MyRocks are the tests that do point lookup on secondary indexes because that uses a range scan rather than a point lookup on the LSM tree, which means that bloom filters cannot be used&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;/div&gt;&lt;div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjMgnpUpa55klcllPWnUHX25YnXARCYcyKNjYWibxs-tONo3gt1ew_Y4IJB0j1z7aARn_s9UCkL9gHb_Y-r87qe9zOx_D_nGYfD0AS8-SHtyRs2f-mTcK6IKi-xDVb9yG67dZL1B6KuRwmE9xNNCaOH1CKhv6CunAIJ1XgQpjm2MfrwIeAEw0-FilRToAWH/s600/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c24r64.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjMgnpUpa55klcllPWnUHX25YnXARCYcyKNjYWibxs-tONo3gt1ew_Y4IJB0j1z7aARn_s9UCkL9gHb_Y-r87qe9zOx_D_nGYfD0AS8-SHtyRs2f-mTcK6IKi-xDVb9yG67dZL1B6KuRwmE9xNNCaOH1CKhv6CunAIJ1XgQpjm2MfrwIeAEw0-FilRToAWH/w640-h396/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c24r64.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjkOQdv1zHGYPmUs2AGHRn0OQ-fQ-cfk-3gfEo7Qzz3Kkb5q1dFsVtC988rI19llo4rbz_8W67MLG8Z-aTpkEFKEKcX3I7cdJW4zYTpdXg3OGE99ujWSrFk5MmRkSiwy0aiO53flPSt7vzQn5D4fTJNWa763KPIjbN6Jx6iyHgzj9lt-0FtvgVF8PIsQTvH/s600/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=16,%20c24r64.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjkOQdv1zHGYPmUs2AGHRn0OQ-fQ-cfk-3gfEo7Qzz3Kkb5q1dFsVtC988rI19llo4rbz_8W67MLG8Z-aTpkEFKEKcX3I7cdJW4zYTpdXg3OGE99ujWSrFk5MmRkSiwy0aiO53flPSt7vzQn5D4fTJNWa763KPIjbN6Jx6iyHgzj9lt-0FtvgVF8PIsQTvH/w640-h396/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=16,%20c24r64.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;b&gt;Results: c24r64 with InnoDB and range queries&lt;/b&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Summary&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;the worst case for InnoDB from FB MySQL are the long range scans and fixing&amp;nbsp;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1506&quot;&gt;bug 1506&lt;/a&gt;&amp;nbsp;will be a big deal&lt;/li&gt;&lt;/ul&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj6NGWaZFf0WC_xrx09djTcxE2vFNxIlg4E0wfzBoqC2d3WPcW5ex0_Sdhx3mOlIIDVwocfk5ibZ7DNIPKI5cPBUPD_PKChPlEDm9m3N66K__qwZ9XSX7Mm_gwXETLjpF2gjKmSI-qQY4dwUVEg2JLlQZhpJhKJ3ZsSGPfvA_7W-3OSI3ahQ3pJzWz0szap/s600/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c24r64.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj6NGWaZFf0WC_xrx09djTcxE2vFNxIlg4E0wfzBoqC2d3WPcW5ex0_Sdhx3mOlIIDVwocfk5ibZ7DNIPKI5cPBUPD_PKChPlEDm9m3N66K__qwZ9XSX7Mm_gwXETLjpF2gjKmSI-qQY4dwUVEg2JLlQZhpJhKJ3ZsSGPfvA_7W-3OSI3ahQ3pJzWz0szap/w640-h396/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c24r64.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhdwd3bIoV6_gU_GJqIKeU4ylLqyNjkj7tn9merwPwO4jeLIHc6mmNwPC9I-UOp1guNWb62Bpzpd-xI5fpwxIX91Gs46zZEsdDQftFZF5pa3vOFRZFDtQHkhAJI5P_tveeE70Wq32lTOZaujUQwtY2UnIqHC997TUS0WRzS1fhVCjmZmbpP20-iXkqalxmi/s600/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=16,%20c24r64.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhdwd3bIoV6_gU_GJqIKeU4ylLqyNjkj7tn9merwPwO4jeLIHc6mmNwPC9I-UOp1guNWb62Bpzpd-xI5fpwxIX91Gs46zZEsdDQftFZF5pa3vOFRZFDtQHkhAJI5P_tveeE70Wq32lTOZaujUQwtY2UnIqHC997TUS0WRzS1fhVCjmZmbpP20-iXkqalxmi/w640-h396/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=16,%20c24r64.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;Results: c24r64 with MyRocks and range queries&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Summary&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;while long range scans are the worst case here,&amp;nbsp;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1506&quot;&gt;bug 1506&lt;/a&gt;&amp;nbsp;is not an issue as that is InnoDB-only&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhteCmjR-67VUIrNtBkO1vzfyWA0Z4xC64smyeDG6RgukzJ-XY7BovSRYYmE9JjUK_eOnlv1EoosYzuhLags57uEOTxS-IkZ5dly3b5ivmu2Iu4mQYUeRjbZZeZcrkRQoQUdPZfo-HpL2uvRSqcSIQ-exUmrlftXo604OPiPDejkysVHhT1cEvYvjMiJ-Dr/s600/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c24r64.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhteCmjR-67VUIrNtBkO1vzfyWA0Z4xC64smyeDG6RgukzJ-XY7BovSRYYmE9JjUK_eOnlv1EoosYzuhLags57uEOTxS-IkZ5dly3b5ivmu2Iu4mQYUeRjbZZeZcrkRQoQUdPZfo-HpL2uvRSqcSIQ-exUmrlftXo604OPiPDejkysVHhT1cEvYvjMiJ-Dr/w640-h396/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c24r64.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjI3lq4U8fvPVG1Tqwd8Erm9HNgZfSFYXVmcDyrPR4yEAro1lNbNUe7YCDbjr9W2kM-g4WnjNHBBA6IgpnDPDO3M-yrrvzdkFqOMIJklH-s69HJ0fCEop5_C6P0D_RC-Y021jHU4wDhVHcBBkcRs4aBcaPrkHXxX8XPYOyHlZ8myjuunrQfviq9ce9FffTi/s600/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=16,%20c24r64.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjI3lq4U8fvPVG1Tqwd8Erm9HNgZfSFYXVmcDyrPR4yEAro1lNbNUe7YCDbjr9W2kM-g4WnjNHBBA6IgpnDPDO3M-yrrvzdkFqOMIJklH-s69HJ0fCEop5_C6P0D_RC-Y021jHU4wDhVHcBBkcRs4aBcaPrkHXxX8XPYOyHlZ8myjuunrQfviq9ce9FffTi/w640-h396/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=16,%20c24r64.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;b&gt;Results: c24r64 with InnoDB and writes&lt;/b&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Summary&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;results are stable here, InnoDB from FB MySQL is no worse than ~10% slower than upstream&lt;/li&gt;&lt;/ul&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgJYCEFEM_QujvW85yxVw-YH91MRUS4OMCreGK_N9XO9iqAVKEfPdNJaRxaSPp0Tn_h4ZCAUWvTiqY4Rp-DBqoG_UdR1uTv3iWzDZCBEd64L19UJ3HlFkznzfK15mpiCTAMurbXfoWOnEMjHdrj-Tnvohu_xf1Ll_uCWMA5N85YG4yCJJjOxfZm6VnLukX2/s600/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c24r64.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgJYCEFEM_QujvW85yxVw-YH91MRUS4OMCreGK_N9XO9iqAVKEfPdNJaRxaSPp0Tn_h4ZCAUWvTiqY4Rp-DBqoG_UdR1uTv3iWzDZCBEd64L19UJ3HlFkznzfK15mpiCTAMurbXfoWOnEMjHdrj-Tnvohu_xf1Ll_uCWMA5N85YG4yCJJjOxfZm6VnLukX2/w640-h396/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c24r64.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh-3EzpekxtvQRbCERUzN5HcEbUx6Rf1zKO7rXgCWmFvpfVHVqg55wCdKgVipq9kZVXo8d_MdPQHIFb-aOBBMqj7Dn4mSRUZQXcTgkof58Xpwr_scI4mqvW_maDTzP3ZMOBftvMJR-Fqn35HAPGePNgmb6X7PGhsWfLrm6SUCTS629cK6YJU05qdR243CsA/s600/QPS%20relative%20to%20InnoDB_%20writes,%20NT=16,%20c24r64.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh-3EzpekxtvQRbCERUzN5HcEbUx6Rf1zKO7rXgCWmFvpfVHVqg55wCdKgVipq9kZVXo8d_MdPQHIFb-aOBBMqj7Dn4mSRUZQXcTgkof58Xpwr_scI4mqvW_maDTzP3ZMOBftvMJR-Fqn35HAPGePNgmb6X7PGhsWfLrm6SUCTS629cK6YJU05qdR243CsA/w640-h396/QPS%20relative%20to%20InnoDB_%20writes,%20NT=16,%20c24r64.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;Results: c24r64 with MyRocks and writes&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Summary&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;while MyRocks does much better than InnoDB for update-index at 1 thread, that benefit goes away at 16 threads. It does better at update-index because it does blind writes rather than RMW for non-unique secondary index maintenance. Perhaps the issue at high concurrency is memory system stalls because this server has 2 sockets.&lt;/li&gt;&lt;/ul&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEivDTiNmRFtcdar6LwI6OjkoiZzJJsXlQUlYVjIrqRNId10SqklWuEw0T5uabBQDyIH26ACSi2udp94e7l_OHBzsHpwEk_WJvElE_afuMtT_AHSUnJnSpZTAEi3eDa0ZzPHTyNpAwY83mSd1Z_wX7yEwYmzbdL_JTalK9EvktY-lVU8f2OHZF4geUGJRr2I/s600/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c24r64.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEivDTiNmRFtcdar6LwI6OjkoiZzJJsXlQUlYVjIrqRNId10SqklWuEw0T5uabBQDyIH26ACSi2udp94e7l_OHBzsHpwEk_WJvElE_afuMtT_AHSUnJnSpZTAEi3eDa0ZzPHTyNpAwY83mSd1Z_wX7yEwYmzbdL_JTalK9EvktY-lVU8f2OHZF4geUGJRr2I/w640-h396/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c24r64.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjWcvLsr-zCh5hlOU0KWWIo99FE2bGNWiPSnSklO36vZfxHaz_2Th5EWkPgWAUnMSI_du2gFr4sdaNebprwhMLZhkMEQ8zgvEwMe-khJv9SvXPtQ-jmakvHtbl4V0Y-9rhytg0TfJJL_dzRm25ktny3efhyphenhyphenQUE4G-DePOHMe5YxjDM7qq-GCZlcsa4YPl47/s600/QPS%20relative%20to%20InnoDB_%20writes,%20NT=16,%20c24r64.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjWcvLsr-zCh5hlOU0KWWIo99FE2bGNWiPSnSklO36vZfxHaz_2Th5EWkPgWAUnMSI_du2gFr4sdaNebprwhMLZhkMEQ8zgvEwMe-khJv9SvXPtQ-jmakvHtbl4V0Y-9rhytg0TfJJL_dzRm25ktny3efhyphenhyphenQUE4G-DePOHMe5YxjDM7qq-GCZlcsa4YPl47/w640-h396/QPS%20relative%20to%20InnoDB_%20writes,%20NT=16,%20c24r64.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/1672587175002535974/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for_96.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/1672587175002535974'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/1672587175002535974'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for_96.html' title='Sysbench performance over time for InnoDB and MyRocks: part 3'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEghS_SjdVHM50xqitPVDqKFRGwTminIiU3F9FzzAg04wSH-64gLqiZNWp_1dpICO6IIg56ekHQuyhH7XenJJpgCw6Zd3Z4rMkrVIpGnKSC5Jbll0AUqOF56L16w9baRDojZBp-SQKCt0-ifDtNGBu-uTRDHFDd8LNC3tncT_H6HUfA1hazD6IUe-KWhIa3x/s72-w640-h396-c/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c24r64.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-5093411626880756276</id><published>2025-01-09T12:24:00.001-08:00</published><updated>2025-01-09T16:02:22.365-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="innodb"/><category scheme="http://www.blogger.com/atom/ns#" term="myrocks"/><category scheme="http://www.blogger.com/atom/ns#" term="mysql"/><category scheme="http://www.blogger.com/atom/ns#" term="sysbench"/><title type='text'>Sysbench performance over time for InnoDB and MyRocks: part 2</title><content type='html'>&lt;p&gt;This is part 2 in my (possibly) final series on performance regressions in MySQL using cached sysbench as the workload. Part 1 of this series is &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for.html&quot;&gt;here&lt;/a&gt;. Part 1 documents performance regressions from MySQL 5.6 to 8.0. This post and the ones that follow cover performance differences between InnoDB in upstream MySQL 8.0.32, InnoDB in FB MySQL 8.0.32 and MyRocks in FB MySQL 8.0.32.&lt;/p&gt;&lt;p&gt;I don&#39;t claim that the MyRocks CPU overhead isn&#39;t relevant, but this workload (CPU-bound, database is cached) is a worst-case for it.&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;InnoDB in FB MySQL 8.0.32 is about 10% slower than InnoDB from upstream 8.0.32, once a few perf bugs get fixed&lt;/li&gt;&lt;li&gt;MyRocks in FB MySQL 8.0.32 uses more CPU than InnoDB, thus QPS for CPU-bound workloads is lower than for InnoDB. On the c8r16 server it gets between 55% and 70% of the QPS relative to InnoDB. On the c8r32 server it gets between 61% and 75% of the QPS relative to InnoDB.&lt;/li&gt;&lt;li&gt;Fixing&amp;nbsp;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1506&quot;&gt;bug 1506&lt;/a&gt;&amp;nbsp;is important for InnoDB in FB MySQL&lt;/li&gt;&lt;li&gt;MyRocks does much better at update-index because it does blind writes rather than RMW for non-unique secondary index maintenance&lt;/li&gt;&lt;ul&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;b&gt;Benchmark, Hardware&lt;/b&gt;&amp;nbsp;&lt;br /&gt;&lt;br /&gt;Much more detail on the benchmark and hardware is &lt;a href=&quot;https://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for.html&quot;&gt;here&lt;/a&gt;. I am trying to avoid repeating that information in the posts that follow.&amp;nbsp;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Tests were run on four different servers. Results in this post are only from c8r16 and c8r32. Posts that follow will have results for c24r64 and c32r128. The servers are:&lt;/p&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;c8r16&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;The c8r16 name stands for 8 CPU cores and 16G of RAM.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;c8r32&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span style=&quot;color: black;&quot;&gt;The c8r32 name stands for 8 CPU cores and 32G of RAM.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;c24r64&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;span style=&quot;color: black;&quot;&gt;The c24r64 name stands for 24 CPU cores and 64G of RAM.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;c32r128&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span style=&quot;color: black;&quot;&gt;The c32r128 name stands for 32 CPU cores and 128G of RAM.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Builds&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;In part 1 results were provided for InnoDB from upstream MySQL 5.6, 5.7 and 8.0 and then from MyRocks from FB MySQL 5.6 and 8.0. Here and in the posts that follow I use InnoDB from upstream MySQL 8.0.32, InnoDB from FB MySQL 8.0.32 and MyRocks from FB MySQL 8.0.32. Everything was compiled with gcc, CMAKE_BUILD_TYPE =Release, -O2 and -fno-omit-frame-pointer.&lt;br /&gt;&lt;br /&gt;The encoded names for these builds is:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;my8032_rel_o2nofp&lt;/li&gt;&lt;ul&gt;&lt;li&gt;InnoDB from upstream MySQL 8.0.32&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbmy8032_rel_o2nofp_end_241023_ba9709c9_971&lt;/li&gt;&lt;ul&gt;&lt;li&gt;FB MySQL 8.0.32 at git hash ba9709c9 (as of 2024/10/23) using RocksDB 9.7.1. This supports InnoDB and MyRocks.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506&lt;/li&gt;&lt;ul&gt;&lt;li&gt;FB MySQL 8.0.32 at git hash ba9709c9 (as of 2024/10/23) using RocksDB 9.7.1 with patches applied for bugs 1473, 1481, 1482 and 1506, This supports InnoDB and MyRocks.&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;div&gt;The my.cnf files are:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;my.cnf.cz11a_$x for InnoDB from upstream MySQL for &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r16/my8032_rel_o2nofp/etc/my.cnf.cz11a_bee&quot;&gt;c8r16&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/my8032_rel_o2nofp/etc/my.cnf.cz11a_c8r32&quot;&gt;c8r32&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c24r64/my8032_rel_o2nofp/etc/my.cnf.cz11a_c24r64&quot;&gt;c24r64&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/my8032_rel_o2nofp/etc/my.cnf.cz11a_c32r128&quot;&gt;c32r128&lt;/a&gt;&lt;/li&gt;&lt;li&gt;my.cnf.cia1_$x for InnoDB from FB MySQL for &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r16/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cia1_bee&quot;&gt;c8r16&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cia1_c8r32&quot;&gt;c8r32&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c24r64/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cia1_c24r64&quot;&gt;c24r64&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cia1_c32r128&quot;&gt;c32r128&lt;/a&gt;&lt;/li&gt;&lt;li&gt;my.cnf.cza2_$x for MyRocks from FB MySQL for &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r16/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cza2_bee&quot;&gt;c8r16&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c8r32/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cza2_c8r32&quot;&gt;c8r32&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c24r64/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cza2_c24r64&quot;&gt;c24r64&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cza2_c32r128&quot;&gt;c32r128&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;The bugs mentioned above are specific to FB MySQL and have simple fixes:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1473&quot;&gt;1473&lt;/a&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;is_thd_db_read_only_by_name accounts for ~2% of CPU time on CPU-bound and write-heavy microbenchmarks&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1481&quot;&gt;1481&lt;/a&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;ha_statistic_increment accounts for ~5% of CPU time on CPU-bound table scans&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1482&quot;&gt;1482&lt;/a&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;calls to clock_gettime in sql/optimizer.cc account for ~4% of CPU time on several microbenchmarks&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1505&quot;&gt;1505&lt;/a&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;the default for yield_check_frequency is zero which makes MySQL waste much time in calls to thd_wait_yield. A better default is 10. Too many my.cnf options isn&#39;t a big deal. Too many options with bad default values is a big deal. The workaround for this is to set yield_check_frequency=10 in my.cnf&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1506&quot;&gt;1506&lt;/a&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;this is limited to InnoDB in the FB MySQL tree. It is from changes in the concurrency ticket code and reduces CPU-bound table scan performance by ~20%.&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Relative QPS&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The charts and summary statistics that follow use a number that I call the relative QPS (rQPS) where:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;rQPS is: (QPS for my version) / (QPS for base version)&lt;/li&gt;&lt;li&gt;&lt;i&gt;base version&lt;/i&gt;&amp;nbsp;is InnoDB from upstream MySQL 8.0.32 (my8032_rel_o2nofp)&lt;/li&gt;&lt;li&gt;&lt;i&gt;my version&lt;/i&gt;&amp;nbsp;is one of the other versions&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The microbenchmarks are split into three groups: point queries, range queries, writes. The tables below have summary statistics for InnoDB and MyRocks using the relative QPS (the same data as the charts).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Results are provided in two formats: charts and summary statistics. The summary statistics table have the min, max, average and median relative QPS per group (group = point, range and writes).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;The spreadsheets and charts&amp;nbsp;&lt;a href=&quot;https://docs.google.com/spreadsheets/d/18Gbz4yOzvCfms9326COOJ0pgS6SorIF2aCiXV0yqrU4/edit?usp=sharing&quot;&gt;are also here&lt;/a&gt;. I don&#39;t know how to prevent the microbenchmark names on the x-axis from getting truncated in the png files I use here but they are easier to read on the spreadsheet.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The charts use NT=1, NT=16 and NT=24 to indicate whether sysbench was run with 1, 16 or 24 threads. The charts and table use the following abbreviations for the DBMS versions:&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;fbinno-nofix&lt;/li&gt;&lt;ul&gt;&lt;li&gt;InnoDB from fbmy8032_rel_o2nofp_end_241023_ba9709c9_971&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbinno-somefix&lt;/li&gt;&lt;ul&gt;&lt;li&gt;InnoDB from fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;myrocks-nofix&lt;/li&gt;&lt;ul&gt;&lt;li&gt;MyRocks from fbmy8032_rel_o2nofp_end_241023_ba9709c9_971&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;myrocks-somefix&lt;/li&gt;&lt;ul&gt;&lt;li&gt;MyRocks from fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: c8r16 with InnoDB&lt;/b&gt;&lt;/div&gt;&lt;p&gt;Summary:&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;The y-axis on the charts doesn&#39;t start at 0 to improve readability&lt;/li&gt;&lt;li&gt;InnoDB from FB MySQL with the bug fixes listed above is between 4% and 10% slower than InnoDB from upstream&lt;/li&gt;&lt;li&gt;The worst regression for InnoDB from FB MySQL occurs on long range scans where FB MySQL is ~30% slower than upstream without the fix for bug 1506&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;fbinno-nofix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.89&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.94&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.92&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.93&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.70&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.94&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.83&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.83&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.84&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.89&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.87&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.87&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;fbinno-somefix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.93&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.98&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.85&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.98&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.90&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.90&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.85&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.91&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.90&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.90&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiBAVSxQvQCFVhFQmpm2T_u8blpuDpxe12UbkwaXSgTvTXmOjiiumgOJks5DW4WqSFBauHVV2eWihBEt5302OcgMbLkfQYodiUJ35OzWZPKMf3TqCTwhKeKE-NEjnr7SOjRkrUTuCGJABGDCD12-SM5lZCSb2F_dJcXG6sAy-fOLLLVW5KI89c22CJdP2sW/s600/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c8r16.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiBAVSxQvQCFVhFQmpm2T_u8blpuDpxe12UbkwaXSgTvTXmOjiiumgOJks5DW4WqSFBauHVV2eWihBEt5302OcgMbLkfQYodiUJ35OzWZPKMf3TqCTwhKeKE-NEjnr7SOjRkrUTuCGJABGDCD12-SM5lZCSb2F_dJcXG6sAy-fOLLLVW5KI89c22CJdP2sW/w640-h396/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c8r16.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgIILV2nO_T5QvHqyKm41bQe3WZCDyEnGtKfqzgDEva96qUtGbjSyNxD2bosS-v7V79kzrkALVMLd8Y2fiKjD7vfSriw1UbdyRF7wvhIlpexogESQupIIqiCiwfNimiMM9svoEvwI3xDUIvF8Di6x25GQsykHJqpyK0yRiQAT090bpgEFz2HD7U3Y5Phtt2/s600/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c8r16.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgIILV2nO_T5QvHqyKm41bQe3WZCDyEnGtKfqzgDEva96qUtGbjSyNxD2bosS-v7V79kzrkALVMLd8Y2fiKjD7vfSriw1UbdyRF7wvhIlpexogESQupIIqiCiwfNimiMM9svoEvwI3xDUIvF8Di6x25GQsykHJqpyK0yRiQAT090bpgEFz2HD7U3Y5Phtt2/w640-h396/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c8r16.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgIILV2nO_T5QvHqyKm41bQe3WZCDyEnGtKfqzgDEva96qUtGbjSyNxD2bosS-v7V79kzrkALVMLd8Y2fiKjD7vfSriw1UbdyRF7wvhIlpexogESQupIIqiCiwfNimiMM9svoEvwI3xDUIvF8Di6x25GQsykHJqpyK0yRiQAT090bpgEFz2HD7U3Y5Phtt2/s600/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c8r16.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;/a&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEimvktmUEhFFvA4NVJifnyU4oCcx8RVgEFtSVzcEJBEm7tdmEDT8ZgwAKUCyuplALZXFPfZdBHBAzID4oudiHbA05t3inrB642BsOot27slpw31row5Jqg5tz-KfLORPfLEcUI3WZPt3pc3WLSHoZxTBu49623-zw_qIfuOJZYBWvTHyWLUCnrX6k4UKo4l/s600/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c8r16.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEimvktmUEhFFvA4NVJifnyU4oCcx8RVgEFtSVzcEJBEm7tdmEDT8ZgwAKUCyuplALZXFPfZdBHBAzID4oudiHbA05t3inrB642BsOot27slpw31row5Jqg5tz-KfLORPfLEcUI3WZPt3pc3WLSHoZxTBu49623-zw_qIfuOJZYBWvTHyWLUCnrX6k4UKo4l/w640-h396/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c8r16.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;Results: c8r16 with MyRocks&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Summary&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;The y-axis on most of the charts doesn&#39;t start at 0 to improve readability. There are two charts for writes and the second truncates outliers to improve readability.&lt;/li&gt;&lt;li&gt;MyRocks has more CPU overhead per query than InnoDB, thus CPU-bound QPS is much lower with MyRocks than with InnoDB. Improving this isn&#39;t trivial as flamegraphs show that the extra CPU is spread out over many functions.&lt;/li&gt;&lt;li&gt;Results for range queries are worse than for point queries because MyRocks uses bloom filters for point queries&lt;/li&gt;&lt;li&gt;MyRocks does much better at update-index because it does blind writes rather than RMW for non-unique secondary index maintenance&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;google-sheets-html-origin&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;myrocks-nofix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.59&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.72&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.67&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.69&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.35&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.68&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.53&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.54&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.53&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.78&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.79&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.67&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;myrocks-somefix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.47&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.74&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.66&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.70&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.39&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.68&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.55&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.55&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.53&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.80&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.80&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.68&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;/google-sheets-html-origin&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhPjrMWCUCySem9N3S5NOjD2iM7AsCUoCVtlBJQ_tOn84fWmtBQl0V57h4NM5nknyVmbRk4npVoFrv3bLtU-k7StqiJZDKagXSAqEl2oFdRbmZ98ju6Aj6c34IuFdfFUQW89_SHPQKqbBdCNtRGUE-P7aayZMhplZqu7s3y5E_6We_v3DtKt6ex_GVPF5k1/s600/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c8r16.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhPjrMWCUCySem9N3S5NOjD2iM7AsCUoCVtlBJQ_tOn84fWmtBQl0V57h4NM5nknyVmbRk4npVoFrv3bLtU-k7StqiJZDKagXSAqEl2oFdRbmZ98ju6Aj6c34IuFdfFUQW89_SHPQKqbBdCNtRGUE-P7aayZMhplZqu7s3y5E_6We_v3DtKt6ex_GVPF5k1/w640-h396/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c8r16.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEicJ3Dx64A4BRwKYaNeUI7mMkChgw95192y7lk78rVkYJTeQMu3j2m7FImONSFfOpZB8koQoLLeOyNrO_w-yJH1CdZbyRcOpnEtAhx0q2oqoK4nYuoCL-nKNTZhspSJQxxuL3zFPFLTauTkDLr6I_nYI6e0XK_eX7Vur8SmGv83IH9OtpOba_rbb76qRLt6/s600/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c8r16.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEicJ3Dx64A4BRwKYaNeUI7mMkChgw95192y7lk78rVkYJTeQMu3j2m7FImONSFfOpZB8koQoLLeOyNrO_w-yJH1CdZbyRcOpnEtAhx0q2oqoK4nYuoCL-nKNTZhspSJQxxuL3zFPFLTauTkDLr6I_nYI6e0XK_eX7Vur8SmGv83IH9OtpOba_rbb76qRLt6/w640-h396/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c8r16.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: left;&quot;&gt;There are two charts for writes. The second chart truncates the y-axis to improve readability for all but the outlier result from update-index.&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhAGcZReOmmR4tkIfnGTuVHBwNYEjtbtmCIVI8_Ifth546rAgT1E9hcatjmUN3t8M7HM-EREXB81nH-CBp7EeszaLl9UN3lAuo748zLP0va_X_5kqQ2x9WL2C5Dhhky2fHXaFifXc7N6SCJidBhbZ8q3IsEQsKyfjH4tmymCxx5hMgaf9_hXzdunsrkF1Tr/s600/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c8r16%20(1).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhAGcZReOmmR4tkIfnGTuVHBwNYEjtbtmCIVI8_Ifth546rAgT1E9hcatjmUN3t8M7HM-EREXB81nH-CBp7EeszaLl9UN3lAuo748zLP0va_X_5kqQ2x9WL2C5Dhhky2fHXaFifXc7N6SCJidBhbZ8q3IsEQsKyfjH4tmymCxx5hMgaf9_hXzdunsrkF1Tr/w640-h396/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c8r16%20(1).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg0nhVtvi8GjRyeDdy_gMJUvc4tp0_1Ey3Rg2uRIMHkXZ-d-zRX6oPB-Hq2tP3AEhZopOMNncQCmlQZYjt3TPf0H2QnUIcvgD0ncleTwC0JJEPq2YiY98g6lyAu3cvteg1p5PXgtCB5iaKMu58ektj-CdwqATiu-Vvwtru6bbCgpt96QmsV9jAcacMiBkTk/s600/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c8r16.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg0nhVtvi8GjRyeDdy_gMJUvc4tp0_1Ey3Rg2uRIMHkXZ-d-zRX6oPB-Hq2tP3AEhZopOMNncQCmlQZYjt3TPf0H2QnUIcvgD0ncleTwC0JJEPq2YiY98g6lyAu3cvteg1p5PXgtCB5iaKMu58ektj-CdwqATiu-Vvwtru6bbCgpt96QmsV9jAcacMiBkTk/w640-h396/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c8r16.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;p&gt;&lt;b&gt;Results: c8r32 with InnoDB&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Summary&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;The y-axis on the charts doesn&#39;t start at 0 to improve readability.&lt;/li&gt;&lt;li&gt;InnoDB from FB MySQL with the bug fixes listed above is between 4% and 10% slower than InnoDB from upstream&lt;/li&gt;&lt;li&gt;The worst regression for InnoDB from FB MySQL occurs on long range scans where FB MySQL is ~30% slower than upstream without the fix for bug 1506&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;fbinno-nofix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.88&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.95&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.92&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.93&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.68&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.95&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.83&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.84&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.85&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.95&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.88&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.87&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;fbinno-somefix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.93&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.97&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.85&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.97&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.90&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.90&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.89&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.96&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.91&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.90&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;google-sheets-html-origin&gt;&lt;/google-sheets-html-origin&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj48h9-I3OViZmOaWb_aSW-bjx49rG3IA9RlWDBjPN87M1Jl9pBWgjAXv0N3cwyvW0adSUavlMlU6-ZIhu7-SHA4XACZLDzvSsjuk6kuPo7ZzeQDE14Y2ilPMIEnYPHuYBxmbFqcqSM1wrT0agRpfBV6qOdqIoqsTV_QbwE2doYKoOZJSR1AUy_9QaXzAYb/s600/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c8r32.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj48h9-I3OViZmOaWb_aSW-bjx49rG3IA9RlWDBjPN87M1Jl9pBWgjAXv0N3cwyvW0adSUavlMlU6-ZIhu7-SHA4XACZLDzvSsjuk6kuPo7ZzeQDE14Y2ilPMIEnYPHuYBxmbFqcqSM1wrT0agRpfBV6qOdqIoqsTV_QbwE2doYKoOZJSR1AUy_9QaXzAYb/w640-h396/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c8r32.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhesORFQRo3xMbckTKdm3v8Bc6wBzOLAzYp2c6IwcsgQeoR90rS-yVmum0H4c7CLZiq1gwvDN0NYl8YxkQSdo-ezZyJVEf-gwmxmTE0mV0nyE53UiW4fiRIz4a1TkJ136p9LmoZXwye9RHRZ1gHy_o-7Z22y3EMjMaErQNJGCRNdLQPg4bwo3VJsjaRTVJ3/s600/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c8r32.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhesORFQRo3xMbckTKdm3v8Bc6wBzOLAzYp2c6IwcsgQeoR90rS-yVmum0H4c7CLZiq1gwvDN0NYl8YxkQSdo-ezZyJVEf-gwmxmTE0mV0nyE53UiW4fiRIz4a1TkJ136p9LmoZXwye9RHRZ1gHy_o-7Z22y3EMjMaErQNJGCRNdLQPg4bwo3VJsjaRTVJ3/w640-h396/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c8r32.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiRgG4125DO_Ff9lDE25mnAEuRPT0Nzm-cUBBGVg44TmoCA4Qjh9Mj0za6zGXXugA8vY6-iDlL741M68noeFamlwqqJ2s2dWfh8z_7GR4GDc7JXaZFaeWraId_Tj4OV_Dek71CZXs0H9qXqWoV9b_auHdY2XTH3H1UisElfFcW_amOvq9kczYMITxx4dulX/s600/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c8r32.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiRgG4125DO_Ff9lDE25mnAEuRPT0Nzm-cUBBGVg44TmoCA4Qjh9Mj0za6zGXXugA8vY6-iDlL741M68noeFamlwqqJ2s2dWfh8z_7GR4GDc7JXaZFaeWraId_Tj4OV_Dek71CZXs0H9qXqWoV9b_auHdY2XTH3H1UisElfFcW_amOvq9kczYMITxx4dulX/w640-h396/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c8r32.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;Results: c8r32 with MyRocks&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Summary&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;The y-axis on most of the charts doesn&#39;t start at 0 to improve readability. There are two charts for writes and the second truncates outliers to improve readability.&lt;/li&gt;&lt;li&gt;MyRocks has more CPU overhead per query than InnoDB, thus CPU-bound QPS is much lower with MyRocks than with InnoDB. Improving this isn&#39;t trivial as flamegraphs show that the extra CPU is spread out over many functions.&lt;/li&gt;&lt;li&gt;Results for range queries are worse than for point queries because MyRocks uses bloom filters for point queries&lt;/li&gt;&lt;li&gt;MyRocks does much better at update-index because it does blind writes rather than RMW for non-unique secondary index maintenance&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;google-sheets-html-origin&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;myrocks-nofix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.45&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.76&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.66&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.70&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.41&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.73&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.59&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.59&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.65&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.63&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.83&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.73&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;myrocks-somefix&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;average&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.48&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.79&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.68&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.72&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.42&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.75&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.61&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.61&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;writes&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.66&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.64&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.85&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.75&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;&lt;/google-sheets-html-origin&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgPQj4q1x4b_aP0Ddp5dSY-juDflkMVTcS0yInXgVMBPnhVgmapPhPMWt9UNnfokR2cj1cTQf8gbm2vv9QlE-NCtPU_Rb-olYvyrvPuCcEedTaRjWLki2sMey6mG1G0fR_3dpUqYL64LuWKTYDg5yrwouMvcN4ArzSQN-abgvk0-Z3Ko4K5iFo-Qr_uz76V/s600/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c8r32.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgPQj4q1x4b_aP0Ddp5dSY-juDflkMVTcS0yInXgVMBPnhVgmapPhPMWt9UNnfokR2cj1cTQf8gbm2vv9QlE-NCtPU_Rb-olYvyrvPuCcEedTaRjWLki2sMey6mG1G0fR_3dpUqYL64LuWKTYDg5yrwouMvcN4ArzSQN-abgvk0-Z3Ko4K5iFo-Qr_uz76V/w640-h396/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c8r32.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjyAwWLys4DBnjAqj4QeMbaa-gxbQ-eLMuAr2OlApyo2qx83ct5OG4N0ZD1F5IeAJO_qxK5MGdFCmIJotf16GpFxznCTdy3x99NwR4sB47021g4_wNLNaSV-3POEmbEbds0xxfBibzMGaiQgN79RG7pb93JhDrqUDMHCHhibQeHxUpoIzK9lKemzM-fClhB/s600/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c8r32.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjyAwWLys4DBnjAqj4QeMbaa-gxbQ-eLMuAr2OlApyo2qx83ct5OG4N0ZD1F5IeAJO_qxK5MGdFCmIJotf16GpFxznCTdy3x99NwR4sB47021g4_wNLNaSV-3POEmbEbds0xxfBibzMGaiQgN79RG7pb93JhDrqUDMHCHhibQeHxUpoIzK9lKemzM-fClhB/w640-h396/QPS%20relative%20to%20InnoDB_%20range%20queries,%20NT=1,%20c8r32.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: left;&quot;&gt;There are two charts for writes. The second chart truncates the y-axis to improve readability for all but the outlier result from update-index.&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhjYVQDnnUtKwPX1AxuvSftyQtBEAsIijCwI6mYWcwSkpvjNKSHBBf2feZsq3VoW-mBIA2xGfmF6JN86sDDzxLYMLLEUTi8mvy76pdyALaur2o7AqXBLtqUaRCugZP7g0IzdYqYq5AHMfP5LlttPzg16khWr0DuxCXLiMXjLvpHTHwHLgi57zPwbOkbS2RW/s600/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c8r32%20(1).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhjYVQDnnUtKwPX1AxuvSftyQtBEAsIijCwI6mYWcwSkpvjNKSHBBf2feZsq3VoW-mBIA2xGfmF6JN86sDDzxLYMLLEUTi8mvy76pdyALaur2o7AqXBLtqUaRCugZP7g0IzdYqYq5AHMfP5LlttPzg16khWr0DuxCXLiMXjLvpHTHwHLgi57zPwbOkbS2RW/w640-h396/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c8r32%20(1).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgTk8KLKFtBTdwoA6CffR3Hs5N7CC1J9_9tUQnGC7siBq6MHSlNk6N0GR8y-OIIYDQCY-o8aZTi5NXsu6N2vmvvlJCVJhGaabUutmtaxBPcjBezEIkprvAP5rWqZSd-o89jbH24Ub1ixZHtwivSeiJDhf7Nlk_ifs3b0JjGkUn3jXEj2Izw8rhXDJApQau3/s600/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c8r32.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgTk8KLKFtBTdwoA6CffR3Hs5N7CC1J9_9tUQnGC7siBq6MHSlNk6N0GR8y-OIIYDQCY-o8aZTi5NXsu6N2vmvvlJCVJhGaabUutmtaxBPcjBezEIkprvAP5rWqZSd-o89jbH24Ub1ixZHtwivSeiJDhf7Nlk_ifs3b0JjGkUn3jXEj2Izw8rhXDJApQau3/w640-h396/QPS%20relative%20to%20InnoDB_%20writes,%20NT=1,%20c8r32.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/5093411626880756276/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for_9.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/5093411626880756276'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/5093411626880756276'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for_9.html' title='Sysbench performance over time for InnoDB and MyRocks: part 2'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiBAVSxQvQCFVhFQmpm2T_u8blpuDpxe12UbkwaXSgTvTXmOjiiumgOJks5DW4WqSFBauHVV2eWihBEt5302OcgMbLkfQYodiUJ35OzWZPKMf3TqCTwhKeKE-NEjnr7SOjRkrUTuCGJABGDCD12-SM5lZCSb2F_dJcXG6sAy-fOLLLVW5KI89c22CJdP2sW/s72-w640-h396-c/QPS%20relative%20to%20InnoDB_%20point%20queries,%20NT=1,%20c8r16.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-2450097750537252765</id><published>2025-01-08T19:03:00.003-08:00</published><updated>2025-01-08T19:03:44.253-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="innodb"/><category scheme="http://www.blogger.com/atom/ns#" term="myrocks"/><category scheme="http://www.blogger.com/atom/ns#" term="mysql"/><category scheme="http://www.blogger.com/atom/ns#" term="sysbench"/><title type='text'>Sysbench performance over time for InnoDB and MyRocks: part 1</title><content type='html'>&lt;p&gt;I spent much time in 2024 documenting performance regressions from old to new versions of MySQL with InnoDB and MyRocks. More posts will be published in 2025, including this sequence of posts, but my work on that is winding down. Most of the problems are from many small regressions rather than a few big ones and it is very expensive to find and fix regressions long after they arrive.&lt;br /&gt;&lt;br /&gt;Hopefully someone else will emerge to do work like this for MySQL going forward or upstream finds more resources to prevent new small regressions from arriving. The trend over the past decade has been great for Postgres. And if upstream wants to grow Heatwave, then avoiding regressions in the base product is one way to help with that.&lt;br /&gt;&lt;br /&gt;The purpose of this post is to document the regressions from MySQL 5.6 through 8.0 for MyRocks and upstream InnoDB&amp;nbsp;using sysbench with a cached workload.&lt;/p&gt;&lt;p&gt;tl;dr, v1&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Modern InnoDB at low concurrency gets about 70% of the QPS relative to older InnoDB (5.6.51). The regressions aren&#39;t as bad at high concurrency.&lt;/li&gt;&lt;li&gt;Modern MyRocks gets between 80% and 95% of the QPS relative to older MyRocks (5.6.35) in most cases (high-concurrency writes is the exception where QPS is similar for old and new.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;tl;dr, v2&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Both upstream MySQL and the FB MySQL project would benefit from changepoint detection (like &lt;a href=&quot;https://nyrkio.com/&quot;&gt;Nyrkio&lt;/a&gt;) using sysbench microbenchmarks to detect regressions&lt;/li&gt;&lt;li&gt;Regressions from MySQL 5.6 to 8.0 for InnoDB are worse at low concurrency than at high. MySQL has gotten more efficient at high concurrency workloads, although some of that benefit is lost from code bloat which is more visible at low concurrency.&amp;nbsp;&lt;/li&gt;&lt;li&gt;Regressions from MySQL 5.6 to 8.0 for MyRocks are better than InnoDB at low concurrency, but worse at high concurrency. I suspect that regressions at high concurrency for MyRocks would be better than InnoDB had I enabled the hyper clock cache.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Builds&lt;/b&gt;&lt;/p&gt;&lt;div&gt;I used InnoDB from upstream MySQL 5.6.51, 5.7.44, 8.0.28 and 8.0.32 and MyRocks from FB MySQL 5.6.35, 8.0.28 and 8.0.32. Everything was compiled with gcc, CMAKE_BUILD_TYPE =Release, -O2 and -fno-omit-frame-pointer.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For MyRocks, the results in this post use these builds:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;fbmy5635_rel_o2nofp_210407_f896415f_6190&lt;/li&gt;&lt;ul&gt;&lt;li&gt;MyRocks 5.6.35 at git hash f896415f (2021/04/07) and RocksDB 6.19.0&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbmy5635_rel_o2nofp_231016_4f3a57a1_870&lt;/li&gt;&lt;ul&gt;&lt;li&gt;MyRocks 5.6.35 at git hash 4f3a57a1 (2023/10/16) and RocksDB 8.7.0&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbmy8028_rel_o2nofp_231202_4edf1eec_870&lt;/li&gt;&lt;ul&gt;&lt;li&gt;MyRocks 8.0.28 at git hash 4edf1eec (2023/12/02) and RocksDB 8.7.0&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbmy8032_rel_o2nofp_end_241023_ba9709c9_971&lt;/li&gt;&lt;ul&gt;&lt;li&gt;MyRocks 8.0.32 at git hash ba9709c9 (2024/10/23) and RocksDB 9.7.1&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506&lt;/li&gt;&lt;ul&gt;&lt;li&gt;MyRocks 8.0.32 at git hash ba9709c9 (2024/10/23) and RocksDB 9.7.1 with fixes applied for bugs 1473, 1481, 1482 and 1506. There is also a workaround for bug 1505 via my.cnf changes. But note that bug 1506 is only for InnoDB.&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;div&gt;The bugs mentioned above are specific to FB MySQL and have simple fixes:&lt;br /&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1473&quot;&gt;1473&lt;/a&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;is_thd_db_read_only_by_name accounts for ~2% of CPU time on CPU-bound and write-heavy microbenchmarks&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1481&quot;&gt;1481&lt;/a&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;ha_statistic_increment accounts for ~5% of CPU time on CPU-bound table scans&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1482&quot;&gt;1482&lt;/a&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;calls to clock_gettime in sql/optimizer.cc account for ~4% of CPU time on several microbenchmarks&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1505&quot;&gt;1505&lt;/a&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;the default for yield_check_frequency is zero which makes MySQL waste much time in calls to thd_wait_yield. A better default is 10. Too many my.cnf options isn&#39;t a big deal. Too many options with bad default values is a big deal. The workaround for this is to set yield_check_frequency=10 in my.cnf&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/facebook/mysql-5.6/issues/1506&quot;&gt;1506&lt;/a&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;this is limited to InnoDB in the FB MySQL tree. It is from changes in the concurrency ticket code and reduces CPU-bound table scan performance by ~20%.&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;div&gt;The my.cnf files are in the subdirectories &lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/conf/arc/oct24/c32r128&quot;&gt;here&lt;/a&gt;.&amp;nbsp;For InnoDB with upstream MySQL I used my.cnf.cz11a_c32r128 and these are here for &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/my5651_rel_o2nofp/etc/my.cnf.cz11a_c32r128&quot;&gt;5.6.51&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/my5744_rel_o2nofp/etc/my.cnf.cz11a_c32r128&quot;&gt;5.7.44&lt;/a&gt;, &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/my8028_rel_o2nofp/etc/my.cnf.cz11a_c32r128&quot;&gt;8.0.28&lt;/a&gt; and &lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/my8032_rel_o2nofp/etc/my.cnf.cz11a_c32r128&quot;&gt;8.0.32&lt;/a&gt;.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For MyRocks I used the following. None of these enable the hyper clock block cache which will make modern MyRocks look much better at high concurrency:&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;fbmy5635_rel_o2nofp_210407_f896415f_6190&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/fbmy5635_rel_o2nofp_210407_f896415f_6190/etc/my.cnf.cza1_c32r128&quot;&gt;my.cnf.cza1_c32r128&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbmy5635_rel_o2nofp_231016_4f3a57a1_870&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/fbmy5635_rel_o2nofp_231016_4f3a57a1_870/etc/my.cnf.cza1_c32r128&quot;&gt;my.cnf.cza1_c32r128&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbmy8028_rel_o2nofp_231202_4edf1eec_870&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/fbmy8028_rel_o2nofp_231202_4edf1eec_870/etc/my.cnf.cza1_c32r128&quot;&gt;my.cnf.cza1_c32r128&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbmy8032_rel_o2nofp_end_241023_ba9709c9_971&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/fbmy8032_rel_o2nofp_end_241023_ba9709c9_971/etc/my.cnf.cza2_c32r128&quot;&gt;my.cnf.cza2_c32r128&lt;/a&gt; (this adds yield_check_frequency=10)&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/mdcallag/mytools/blob/master/bench/conf/arc/oct24/c32r128/fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506/etc/my.cnf.cza2_c32r128&quot;&gt;my.cnf.cza2_c32r128&lt;/a&gt; (this adds yield_check_frequency=10)&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Hardware&lt;/b&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;div&gt;Tests were run on four different servers. Results in this post are only from c32r128 but the posts that follow have results from the other servers. The servers are:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;c8r16&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;The c8r16 name stands for 8 CPU cores and 16G of RAM. This is a Beelink SER4 with an AMD Ryzen 7 4700 CPU with SMT disabled, 8 cores, 16G of RAM, Ubuntu 22.04 and ext4 on 1 NVMe device.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;c8r32&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span style=&quot;color: black;&quot;&gt;The c8r32 name stands for 8 CPU cores and 32G of RAM. This is an&amp;nbsp;&lt;/span&gt;ASUS ExpertCenter PN53 with AMD Ryzen 7 7735HS, with SMT disabled, 8 cores, 32G RAM, Ubuntu 22.04 and ext4 on 1 NVMe device.&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;c24r64&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;span style=&quot;color: black;&quot;&gt;The c24r64 name stands for 24 CPU cores and 64G of RAM. This is a&amp;nbsp;&lt;/span&gt;SuperMicro SuperWorkstation 7049A-T&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;with 2 sockets, 12 cores/socket, 64G RAM, one m.2 SSD (2TB,&amp;nbsp; ext4). The CPUs are Intel Xeon Silver 4214R CPU @ 2.40GHz.&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;c32r128&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span style=&quot;color: black;&quot;&gt;The c32r128 name stands for 32 CPU cores and 128G of RAM. This is a&amp;nbsp;&lt;/span&gt;Dell Precision 7865 Tower Workstation with 1 socket, 128G RAM, AMD Ryzen Threadripper PRO 5975WX with 32-Cores, 2 m.2 SSD (each 2TB, RAID SW 0, ext4).&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I used sysbench and my usage is&amp;nbsp;&lt;a href=&quot;http://smalldatum.blogspot.com/2017/02/using-modern-sysbench-to-compare.html&quot;&gt;explained here&lt;/a&gt;. A full run has 42 microbenchmarks and most test only 1 type of SQL statement. Here I use abbreviated runs with 26 microbenchmarks to save time. The database is cached by InnoDB and MyRocks.&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;div&gt;The benchmark is run with ...&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;c8r16 - 1 thread, 1 table, 30M rows&lt;/li&gt;&lt;li&gt;c8r32 - 1 thread, 1 table, 50M rows&lt;/li&gt;&lt;li&gt;c24r64 - 1 thread and then 16 threads, 8 tables, 10M rows/table&lt;/li&gt;&lt;li&gt;c32r128 - 1 thread and then 24 threads, 8 tables, 10M rows/table&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;Each microbenchmark runs for 300 seconds if read-only and 600 seconds otherwise. Prepared statements were enabled.&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;b&gt;Relative QPS&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The charts and summary statistics that follow use a number that I call the relative QPS (rQPS) where:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;rQPS is: (QPS for my version) / (QPS for base version)&lt;/li&gt;&lt;li&gt;&lt;i&gt;base version&lt;/i&gt;&amp;nbsp;for InnoDB is 5.6.51 and for MyRocks is 5.6.35 (fbmy5635_rel_o2nofp_210407_f896415f_6190)&lt;/li&gt;&lt;li&gt;&lt;i&gt;my version&lt;/i&gt;&amp;nbsp;is one of the other versions&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Summary statistics&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The microbenchmarks are split into three groups: point queries, range queries, writes. The tables below have summary statistics for InnoDB and MyRocks using the relative QPS of InnoDB from MySQL 8.0.32 and MyRocks from fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;InnoDB with 1 thread&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;based on the median values, InnoDB in 8.0.32 gets ~72% of the QPS relative to 5.6.51&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;google-sheets-html-origin&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;avg&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.65&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.88&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.73&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.72&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.68&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.99&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.77&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.73&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;write&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.50&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.21&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.75&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.70&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;/google-sheets-html-origin&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;InnoDB with 24 threads&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;the results here are better than above. For point queries and writes, upstream made InnoDB better for high-concurrency workloads which counters the performance loss from new CPU overheads. However, that isn&#39;t true for writes. The issue is made worse by some of the refactoring in InnoDB that landed after 8.0.28.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;google-sheets-html-origin&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;avg&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.68&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.54&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.09&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.29&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.63&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.13&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.84&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.77&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;write&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.05&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;2.02&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.47&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.45&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;/google-sheets-html-origin&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;MyRocks with 1 thread&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;the regressions here aren&#39;t as bad as they are above for InnoDB&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;google-sheets-html-origin&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;avg&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.79&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.15&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.93&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.93&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.63&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.92&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.84&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.86&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;write&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.63&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.87&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.79&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.80&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;/google-sheets-html-origin&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;MyRocks with 24 threads&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Based on the median values, MyRocks at higher concurrency here does better than at low concurrency above. I am not sure why. Perhaps had I enabled the hyper clock block cache the results here would be better than they are above for InnoDB at 24 threads.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;google-sheets-html-origin&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; data-sheets-baot=&quot;1&quot; data-sheets-root=&quot;1&quot; dir=&quot;ltr&quot; style=&quot;border-collapse: collapse; border: none; font-family: Arial; font-size: 10pt; table-layout: fixed; width: 0px;&quot; xmlns=&quot;http://www.w3.org/1999/xhtml&quot;&gt;&lt;colgroup&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;col width=&quot;100&quot;&gt;&lt;/col&gt;&lt;/colgroup&gt;&lt;tbody&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;min&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;max&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;avg&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;median&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;point&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.85&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.22&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.01&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.95&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;range&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.68&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.07&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.97&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;1.01&lt;/td&gt;&lt;/tr&gt;&lt;tr style=&quot;height: 21px;&quot;&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; vertical-align: bottom;&quot;&gt;write&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.77&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.99&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.91&lt;/td&gt;&lt;td style=&quot;border: 1px solid rgb(204, 204, 204); overflow: hidden; padding: 2px 3px; text-align: right; vertical-align: bottom;&quot;&gt;0.91&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;/google-sheets-html-origin&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Results&lt;/b&gt;&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;The microbenchmarks are split into three groups: point queries, range queries, writes. The charts that follow plot the relative QPS.&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;To save space on the charts for MyRocks, the version names on the charts are:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;5635.231016&lt;/li&gt;&lt;ul&gt;&lt;li&gt;fbmy5635_rel_o2nofp_231016_4f3a57a1_870&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;8028.231202&lt;/li&gt;&lt;ul&gt;&lt;li&gt;fbmy8028_rel_o2nofp_231202_4edf1eec_870&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;8032.241023.nofix&lt;/li&gt;&lt;ul&gt;&lt;li&gt;fbmy8032_rel_o2nofp_end_241023_ba9709c9_971&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;8032.241023.fix&lt;/li&gt;&lt;ul&gt;&lt;li&gt;fbmy8032_rel_o2nofp_241023_ba9709c9_971_bug1473_1481_1482_1506&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;div&gt;The spreadsheets and charts&amp;nbsp;&lt;a href=&quot;https://docs.google.com/spreadsheets/d/18Gbz4yOzvCfms9326COOJ0pgS6SorIF2aCiXV0yqrU4/edit?usp=sharing&quot;&gt;are here&lt;/a&gt;&amp;nbsp;in the dell.fbin.1 and dell.fbin.24 tabs for InnoDB and the dell.fbrx.1 and dell.fbrx.24 tabs for MyRocks. I don&#39;t know how to prevent the microbenchmark names on the x-axis from getting truncated in the png files I use here but they are easier to read on the spreadsheet.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The chart with NT=1 in the legend has results for 1 thread tests. The chart with NT=24 has results for 24 threads.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: InnoDB and point queries&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;Notes for NT=1 (1 thread):&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;the worst regression is from hot-points where 5.6 gets almost 2X more QPS than 8.0&lt;/li&gt;&lt;li&gt;for all tests except one, InnoDB in 8.0 gets less than 75% of the throughput relative to 5.6&lt;/li&gt;&lt;li&gt;the regression for 8.0.28 in the third group from the right is from &lt;a href=&quot;https://bugs.mysql.com/bug.php?id=102037&quot;&gt;bug 102037&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiMJA01uhyphenhyphen4zJWY27ZhurQvd7NvRCAbBKkPhZ9brdouFfZA65e09eZ3Kq9KhWhdUokdwULYGHsjoSkxEUyUGP2P1f_KN2EpqWTSVSM22iCh1xNjYRbjdhQx71Q0zY264rqhwLq3OQJccQwX8wjiBP625m7dt2o4mct8VirENnU8Pqgca0Rd5jdfzoxGtGrI/s600/QPS%20relative%20to%205.6.51_%20point%20queries,%20NT=1,%20c32r128.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiMJA01uhyphenhyphen4zJWY27ZhurQvd7NvRCAbBKkPhZ9brdouFfZA65e09eZ3Kq9KhWhdUokdwULYGHsjoSkxEUyUGP2P1f_KN2EpqWTSVSM22iCh1xNjYRbjdhQx71Q0zY264rqhwLq3OQJccQwX8wjiBP625m7dt2o4mct8VirENnU8Pqgca0Rd5jdfzoxGtGrI/w640-h396/QPS%20relative%20to%205.6.51_%20point%20queries,%20NT=1,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: left;&quot;&gt;Notes for NT=24 (24 threads)&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: left;&quot;&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Upstream MySQL has improvements for concurrent workloads in 5.7 and 8.0 which offset the CPU regressions.&lt;/li&gt;&lt;ul&gt;&lt;li&gt;With 5.7.44, 5 tests are much faster than 5.6.51, 2 tests have similar QPS and 2 are slower&lt;/li&gt;&lt;li&gt;With 8.0.32, 5 tests are much faster than 5.6.51 and 4 are slower&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjVeQptmLHmkMWFZ3axf5-h78V3URHVuQXd9jFQrPN_IyVy6HBGWpoNzyOFjBdLgufMqo8su5oFwQbRwY3h1DU3vzgmCSCNOan9gZKtyeBcE5xE3aIf4wtJMbWEr4SXbhzrr3ILJO7v_THq_cswt0S2_hqu5ZzWen_IDKnmhuQBpSEZZxjHuMKIwoaC1iEy/s600/QPS%20relative%20to%205.6.51_%20point%20queries,%20NT=24,%20c32r128.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjVeQptmLHmkMWFZ3axf5-h78V3URHVuQXd9jFQrPN_IyVy6HBGWpoNzyOFjBdLgufMqo8su5oFwQbRwY3h1DU3vzgmCSCNOan9gZKtyeBcE5xE3aIf4wtJMbWEr4SXbhzrr3ILJO7v_THq_cswt0S2_hqu5ZzWen_IDKnmhuQBpSEZZxjHuMKIwoaC1iEy/w640-h396/QPS%20relative%20to%205.6.51_%20point%20queries,%20NT=24,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: MyRocks and point queries&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;Notes for NT=1 (1 thread):&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Regressions from 5.6 to 8.0 are smaller here for MyRocks than above for InnoDB&lt;/li&gt;&lt;li&gt;In 8.0.32, 7 tests are slower than 5.6.35 and 2 are slightly faster&lt;/li&gt;&lt;li&gt;The regression for 8.0.28 in the third group from the right is from&amp;nbsp;&lt;a href=&quot;https://bugs.mysql.com/bug.php?id=102037&quot;&gt;bug 102037&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjLvYpj-3GoqP5hzGjUHnxWpXNHz1ZtAQH5FyCohEWBJ9LY2BdHBRLAHAnWDlnFBaJvx-2mAvknkuSsrGhejyi-oAwIijSDzD2yx2iqNgGWsU8a3SWPvLp9Rs61Eb1GXT71-FU0wXpeUU7xYhDMl8c35kBkICh16T75SpcHlg6IguDVAmIKRLW9wmdP6Dll/s600/QPS%20relative%20to%205.6.35_%20point%20queries,%20NT=1,%20c32r128.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjLvYpj-3GoqP5hzGjUHnxWpXNHz1ZtAQH5FyCohEWBJ9LY2BdHBRLAHAnWDlnFBaJvx-2mAvknkuSsrGhejyi-oAwIijSDzD2yx2iqNgGWsU8a3SWPvLp9Rs61Eb1GXT71-FU0wXpeUU7xYhDMl8c35kBkICh16T75SpcHlg6IguDVAmIKRLW9wmdP6Dll/w640-h396/QPS%20relative%20to%205.6.35_%20point%20queries,%20NT=1,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: left;&quot;&gt;&lt;div&gt;Notes for NT=24 24 threads):&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;The results here are about the same as the results above for MyRocks at 1 thread. Were I to enable the hyper clock cache they might be much better.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjvbDih16Q8SHdW9TLb_AGbIxbwzrx_Dyi2r4FSEGhm1VJpWX8NFi6t8x2_0XqWhdZ3JkLTDkHo73fbIiA9hZT0VbRiDu_dzoltolv-kCESMAeEopCL7lNMZfK6f7OOA7WuwFkVaUXeRTjK7PyK2GvSxpFzyqaGp6-EEcN3Z811CHaxIorjRzIj6LYgZP-n/s600/QPS%20relative%20to%205.6.35_%20point%20queries,%20NT=24,%20c32r128.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjvbDih16Q8SHdW9TLb_AGbIxbwzrx_Dyi2r4FSEGhm1VJpWX8NFi6t8x2_0XqWhdZ3JkLTDkHo73fbIiA9hZT0VbRiDu_dzoltolv-kCESMAeEopCL7lNMZfK6f7OOA7WuwFkVaUXeRTjK7PyK2GvSxpFzyqaGp6-EEcN3Z811CHaxIorjRzIj6LYgZP-n/w640-h396/QPS%20relative%20to%205.6.35_%20point%20queries,%20NT=24,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;b&gt;Results: InnoDB and range queries&lt;/b&gt;&lt;div&gt;&lt;br class=&quot;Apple-interchange-newline&quot; /&gt;Notes for NT=1 (1 thread):&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;The regressions are large for everything but long range scans. And for long-range scans while 5.7.44 is faster than 5.6, that benefit no longer exists in 8.0, especially in 8.0.30+.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiyIExGjAorXVfZC_IT-2H_MVsZFjY_sS9OVMc-W98rfKU0SgL2Fsmw38nPNrH4WfUgguFqerxyRUFQAIMykcz4ZLkWhUCPWh5chfR2fb0K77DXV7CTY9UxoYXLJrWCCJHdQuztX7IvvV2JbWVXZ9KJzIKMS2YG8aNlSpoD5a3NmrzNOr_HsDqngJwVZZfL/s600/QPS%20relative%20to%205.6.51_%20range%20queries,%20NT=1,%20c32r128.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em; text-align: center;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiyIExGjAorXVfZC_IT-2H_MVsZFjY_sS9OVMc-W98rfKU0SgL2Fsmw38nPNrH4WfUgguFqerxyRUFQAIMykcz4ZLkWhUCPWh5chfR2fb0K77DXV7CTY9UxoYXLJrWCCJHdQuztX7IvvV2JbWVXZ9KJzIKMS2YG8aNlSpoD5a3NmrzNOr_HsDqngJwVZZfL/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries,%20NT=1,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: left;&quot;&gt;&lt;div&gt;Notes for NT=24 (24 threads):&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;Results here are much better than above for InnoDB at 1 thread. But about half of the tests still have large regressions.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjyWM4-PnNLw2lrTOv-7dK8Kt4242HtdUG2WV_Imj1Cts6Yd27eOo8o8edudTAsxHIHAUjVbahsq5J50YNw3LZXQmdzejCLtwC0ZSd7eayGPnta11hupW9o50PZ_gtKtiWp-Ow0ItkMm0zafrfmkU7Sc_rsvaQqhRJPnZegj1M8mUTs6nOwQKURmhZSHlpy/s600/QPS%20relative%20to%205.6.51_%20range%20queries,%20NT=24,%20c32r128.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjyWM4-PnNLw2lrTOv-7dK8Kt4242HtdUG2WV_Imj1Cts6Yd27eOo8o8edudTAsxHIHAUjVbahsq5J50YNw3LZXQmdzejCLtwC0ZSd7eayGPnta11hupW9o50PZ_gtKtiWp-Ow0ItkMm0zafrfmkU7Sc_rsvaQqhRJPnZegj1M8mUTs6nOwQKURmhZSHlpy/w640-h396/QPS%20relative%20to%205.6.51_%20range%20queries,%20NT=24,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;b&gt;Results: MyRocks and range queries&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br class=&quot;Apple-interchange-newline&quot; /&gt;Notes for NT=1 (1 thread):&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Results here are similar to what I see above for InnoDB at 1 thread. I will explain more elsewhere but it has been hard to track down the root causes to anything other than code bloat, as in more cache and TLB activity.&lt;/li&gt;&lt;/ul&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhmlCkZ1-VRnloE040AKV2HsRu3TbxnPkNhhmjA5zB4B1a4C2QuLYQsF50ocoOjauv7Ua0CER5W-HLFJdF2XehWnqEHGELVyvIO4KseEk-9kFGrJ02tenrQRTvjiZQAm4p7YyniKatxj_JTjeG58a8WjC3eak7VLLk-a6spm6gUmEVwuv9KO26DilRtMkgC/s600/QPS%20relative%20to%205.6.35_%20range%20queries,%20NT=1,%20c32r128.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhmlCkZ1-VRnloE040AKV2HsRu3TbxnPkNhhmjA5zB4B1a4C2QuLYQsF50ocoOjauv7Ua0CER5W-HLFJdF2XehWnqEHGELVyvIO4KseEk-9kFGrJ02tenrQRTvjiZQAm4p7YyniKatxj_JTjeG58a8WjC3eak7VLLk-a6spm6gUmEVwuv9KO26DilRtMkgC/w640-h396/QPS%20relative%20to%205.6.35_%20range%20queries,%20NT=1,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: left;&quot;&gt;&lt;div&gt;Notes for NT=24 24 threads):&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;Results here at high concurrency are much better than above at low concurrency, with the exception of the results for the full scan (the last group on the right).&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjBtAxerWedxnypakFap2XdDmFrw6pt1bxXy6r112zqHl1O7DM8BfXXLhEzIT8QoT46tMN4q6x8Lls6xRrKwOwCLZ_5VjmAeJj9mea9o3jX27vIuFFLoET8by-u4XP-f1i0Z5O4d-nQJ41oPQZ9a_IISQTMevP9VutZjUdj3cR0ysL97Y_gEDCg-bN9z_Cn/s600/QPS%20relative%20to%205.6.35_%20range%20queries,%20NT=24,%20c32r128.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjBtAxerWedxnypakFap2XdDmFrw6pt1bxXy6r112zqHl1O7DM8BfXXLhEzIT8QoT46tMN4q6x8Lls6xRrKwOwCLZ_5VjmAeJj9mea9o3jX27vIuFFLoET8by-u4XP-f1i0Z5O4d-nQJ41oPQZ9a_IISQTMevP9VutZjUdj3cR0ysL97Y_gEDCg-bN9z_Cn/w640-h396/QPS%20relative%20to%205.6.35_%20range%20queries,%20NT=24,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;b&gt;Results: InnoDB and writes&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br class=&quot;Apple-interchange-newline&quot; /&gt;Notes for NT=1 (1 thread):&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;All of the tests have large regressions in 8.0.32 except for update-index&lt;/li&gt;&lt;/ul&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEghuBuwd5kfIXS4Zc1Tp2-6wG0LhlFqDiK7Bh_RHOyF23-2XD64FLKALSYXqHF9A5eIBSLc-zFQ6PUmLtv3jFlq2PuTWNcWZGQQ9IcDrExMFaOqG7GW3L7heDhStAc7PgyanCM2S0Ts8UMgJnEhls9EkL2SjYub2uOJEXTEGjF1FAWIxx8CmHiz0zBg9Xjq/s600/QPS%20relative%20to%205.6.51_%20writes,%20NT=1,%20c32r128.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEghuBuwd5kfIXS4Zc1Tp2-6wG0LhlFqDiK7Bh_RHOyF23-2XD64FLKALSYXqHF9A5eIBSLc-zFQ6PUmLtv3jFlq2PuTWNcWZGQQ9IcDrExMFaOqG7GW3L7heDhStAc7PgyanCM2S0Ts8UMgJnEhls9EkL2SjYub2uOJEXTEGjF1FAWIxx8CmHiz0zBg9Xjq/w640-h396/QPS%20relative%20to%205.6.51_%20writes,%20NT=1,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: left;&quot;&gt;&lt;div&gt;Notes for NT=24 24 threads):&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;Results here are much better than above at 1 thread. Modern InnoDB is much better at high-concurrency workloads.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhayaEBj83Ksyhz6mr44ZxSRgI57xCwZI_Qhpoiny3Kmp7E9NPzwbDSrgZeYWHD5R0hj4zQG4FvJRv9YjPPF3O19TkaMW-yaqgERi2uriLLpJj1yuOdxx9UZimSQADbsGe5QzjdGh__Kq5zZKbeExA63jefNQh80aXB4D1B04qzBoHkbCl8vGVUWyGuBErK/s600/QPS%20relative%20to%205.6.51_%20writes,%20NT=24,%20c32r128.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhayaEBj83Ksyhz6mr44ZxSRgI57xCwZI_Qhpoiny3Kmp7E9NPzwbDSrgZeYWHD5R0hj4zQG4FvJRv9YjPPF3O19TkaMW-yaqgERi2uriLLpJj1yuOdxx9UZimSQADbsGe5QzjdGh__Kq5zZKbeExA63jefNQh80aXB4D1B04qzBoHkbCl8vGVUWyGuBErK/w640-h396/QPS%20relative%20to%205.6.51_%20writes,%20NT=24,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: MyRocks and writes&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br class=&quot;Apple-interchange-newline&quot; /&gt;Notes for NT=1 (1 thread):&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;MyRocks has large regressions at low-concurrency.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi3hZjK59iF8CzeS11RuTCimQFFoQdnJCzQUwBmzWHM5Uk5kqiW6Re_rRiPToJr9S9lGssyLehvo4v7IQadAhSQfj2UcNlicMoHN_Rv5qu7tNfYu8FvgxvTQFGF4g70LZkdixQ1T3wJ7fwzHVw1wwFFKob2bf8dtXCIDbjflt_xa2QTkitNojwTqpLRRQ86/s600/QPS%20relative%20to%205.6.35_%20writes,%20NT=1,%20c32r128.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi3hZjK59iF8CzeS11RuTCimQFFoQdnJCzQUwBmzWHM5Uk5kqiW6Re_rRiPToJr9S9lGssyLehvo4v7IQadAhSQfj2UcNlicMoHN_Rv5qu7tNfYu8FvgxvTQFGF4g70LZkdixQ1T3wJ7fwzHVw1wwFFKob2bf8dtXCIDbjflt_xa2QTkitNojwTqpLRRQ86/w640-h396/QPS%20relative%20to%205.6.35_%20writes,%20NT=1,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: left;&quot;&gt;&lt;div&gt;Notes for NT=24 24 threads):&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;The regressions here at high-concurrency are about 10% better than above at low-concurrency.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgRXgKGKKulelY9Udigz3uRjtFnWq0tgWNWLSA5a8jlU2LxhRpqw3fCkyfCTAQFbhqqtWMxwdY21CIIfErUZarsEQGHOvRhYHwWm-1OTmhnAhKUbyv-Oo-peG-l3Aw8Zkb7O5uIbX_I51dcsgQYlOchHhIbQJcSGF4x695gPhxHQklcS-bSiI86Auq04cOF/s600/QPS%20relative%20to%205.6.35_%20writes,%20NT=24,%20c32r128.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgRXgKGKKulelY9Udigz3uRjtFnWq0tgWNWLSA5a8jlU2LxhRpqw3fCkyfCTAQFbhqqtWMxwdY21CIIfErUZarsEQGHOvRhYHwWm-1OTmhnAhKUbyv-Oo-peG-l3Aw8Zkb7O5uIbX_I51dcsgQYlOchHhIbQJcSGF4x695gPhxHQklcS-bSiI86Auq04cOF/w640-h396/QPS%20relative%20to%205.6.35_%20writes,%20NT=24,%20c32r128.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/2450097750537252765/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/2450097750537252765'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/2450097750537252765'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2025/01/sysbench-performance-over-time-for.html' title='Sysbench performance over time for InnoDB and MyRocks: part 1'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiMJA01uhyphenhyphen4zJWY27ZhurQvd7NvRCAbBKkPhZ9brdouFfZA65e09eZ3Kq9KhWhdUokdwULYGHsjoSkxEUyUGP2P1f_KN2EpqWTSVSM22iCh1xNjYRbjdhQx71Q0zY264rqhwLq3OQJccQwX8wjiBP625m7dt2o4mct8VirENnU8Pqgca0Rd5jdfzoxGtGrI/s72-w640-h396-c/QPS%20relative%20to%205.6.51_%20point%20queries,%20NT=1,%20c32r128.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-306583561436709608</id><published>2024-12-26T17:14:00.001-08:00</published><updated>2024-12-29T11:58:44.113-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="rocksdb"/><category scheme="http://www.blogger.com/atom/ns#" term="speedb"/><title type='text'>Speedb vs RocksDB on a large server</title><content type='html'>&lt;p&gt;I am happy to read about storage engines that claim to be faster than RocksDB. Sometimes the claims are true and might lead to ideas for making RocksDB better. I am wary about evaluating such claims because that takes a lot of time and when the claim is bogus I am reluctant to blog about that because I don&#39;t want to punch down on a startup.&lt;br /&gt;&lt;br /&gt;Here I share results from the RocksDB benchmark scripts to compare Speedb and RocksDB and I am happy to claim that Speedb does some things better than RocksDB.&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;RocksDB and Speedb have similar average throughput for ...&lt;/li&gt;&lt;ul&gt;&lt;li&gt;a cached database&lt;/li&gt;&lt;li&gt;an IO-bound database when using O_DIRECT&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;RocksDB 8.6+ is slower than Speedb for write-heavy workloads with an IO-bound database when O_DIRECT isn&#39;t used.&amp;nbsp;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;This problem arrived in RocksDB 8.6 (see &lt;a href=&quot;https://github.com/facebook/rocksdb/issues/12038&quot;&gt;issue 12038&lt;/a&gt;) which introduces the use of the&amp;nbsp;&lt;a href=&quot;https://github.com/facebook/rocksdb/blob/02b4197544f758bdf84d80fe9319238611848c48/env/io_posix.cc#L833&quot;&gt;readahead&lt;/a&gt;&amp;nbsp;system call to prefetch data that compaction will soon read. I am not sure what was used prior. The regression that arrived in 8.6 was partially fixed in release 9.9.&amp;nbsp;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;In general, Speedb has better QoS (less throughput variance) for write-heavy workloads&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Updates&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Update 1 - with a minor hack I can remove about 1/3 of the regression between RocksDB 8.5 and 9.9 in the overwrite benchmark for the iobuf workload&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;On issue 12038&lt;/b&gt;&lt;/p&gt;&lt;p&gt;RocksDB 8.6 switched to using the readahead system call to prefetch SSTs that will soon be read by compaction. The goal is to reduce the time that compaction threads must wait for data. But what I see with iostat is that a &lt;a href=&quot;https://man7.org/linux/man-pages/man2/readahead.2.html&quot;&gt;readahead&lt;/a&gt; call is ignored when the value of the &lt;i&gt;count&lt;/i&gt; argument is larger than max_sectors_kb for the storage device. And this happens on one or both of ext-4 and xfs. I am not a kernel guru and I have yet to read this &lt;a href=&quot;https://lwn.net/Articles/888715/&quot;&gt;nice writeup&lt;/a&gt; of readahead internals. I do read this &lt;a href=&quot;https://kernel.dk/when-2mb-turns-into-512k.pdf&quot;&gt;great note&lt;/a&gt; from Jens Axboe every few years.&lt;br /&gt;&lt;br /&gt;I opened &lt;a href=&quot;https://github.com/facebook/rocksdb/issues/12038&quot;&gt;issue 12038&lt;/a&gt; for this issue and it was fixed in RocksDB 9.9 by adding code that reduces the value of &lt;a href=&quot;https://github.com/facebook/rocksdb/blob/02b4197544f758bdf84d80fe9319238611848c48/include/rocksdb/options.h#L1066&quot;&gt;compaction_readahead_size&lt;/a&gt; to be &amp;lt;= the value of max_sectors_kb for the database&#39;s storage device. However the fix in 9.9 doesn&#39;t restore the performance that existed prior to the change (see 8.5 results). I assume the real fix is to have code in RocksDB to do the prefetches rather than rely on the readahead system call.&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Hardware&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&lt;p&gt;The server is an ax162-s from Hetzner with an AMD EPYC 9454P processor, 48 cores, AMD SMT disabled and 128G RAM. The OS is Ubuntu 22.04. Storage is 2 NVMe devices with SW RAID 1 and ext4.&lt;br /&gt;&lt;br /&gt;The values of max_sectors_kb and max_hw_sectors_kb for the database&#39;s storage device is 128 (KB) for both the SW RAID device (md2) and the underlying storage devices (nvme0n1, nvme1n1).&lt;/p&gt;&lt;div&gt;&lt;b&gt;Builds&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I compiled db_bench from source. I used RocksDB versions 7.3.2, 7.10.2, 8.5.4, 8.6.7, 9.7.4 and 9.9.3.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For Speedb I used the latest diff in their github repo which appears to be based on RocksDB 8.6.7, although I am confused that it doesn&#39;t suffer from issue 12038 which is in RocksDB 8.6.7.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;commit 8d850b666cce6f39fbd4064e80b85f9690eaf385 (HEAD -&amp;gt; main, origin/main, origin/HEAD)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Author: udi-speedb &amp;lt;106253580+udi-speedb@users.noreply.github.com&amp;gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;Date:&amp;nbsp; &amp;nbsp;Mon Mar 11 14:00:03 2024 +0200&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: xx-small;&quot;&gt;&amp;nbsp; &amp;nbsp; Support Speedb&#39;s Paired Bloom Filter in db_bloom_filter_test (#810)&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;All tests used 2MB for &lt;a href=&quot;https://github.com/facebook/rocksdb/blob/02b4197544f758bdf84d80fe9319238611848c48/include/rocksdb/options.h#L1066&quot;&gt;compaction_readahead_size&lt;/a&gt;&amp;nbsp;which and the hyper clock block cache.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I used&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/rx2&quot;&gt;my fork of the RocksDB benchmark scripts&lt;/a&gt;&amp;nbsp;that are wrappers to run db_bench. These run db_bench tests in a special sequence -- load in key order, read-only, do some overwrites, read-write and then write-only. The benchmark was run using 40 threads.&amp;nbsp;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;How I do benchmarks for RocksDB is explained&amp;nbsp;&lt;/span&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2022/08/how-i-do-performance-tests-for-rocksdb.html&quot; style=&quot;background-color: white;&quot;&gt;here&lt;/a&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;&amp;nbsp;and&amp;nbsp;&lt;/span&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2022/08/how-i-do-rocksdb-performance-tests-part.html&quot; style=&quot;background-color: white;&quot;&gt;here&lt;/a&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;. The command line to run the tests is:&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;&amp;nbsp;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt; &amp;nbsp; bash x3.sh 40 no 1800 c48r128 100000000 2000000000 iobuf iobuf iodir&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: black;&quot;&gt;The tests on the charts are named as:&lt;/span&gt;&lt;br style=&quot;color: black;&quot; /&gt;&lt;ul style=&quot;color: black;&quot;&gt;&lt;li&gt;&lt;i&gt;fillseq&lt;/i&gt;&amp;nbsp;-- load in key order with the WAL disabled&lt;/li&gt;&lt;li&gt;&lt;i&gt;revrangeww&lt;/i&gt;&amp;nbsp;-- reverse range while writing, do short reverse range scans as fast as possible while another thread does writes (Put) at a fixed rate&lt;/li&gt;&lt;li&gt;&lt;i&gt;fwdrangeww&lt;/i&gt;&amp;nbsp;-- like revrangeww except do short forward range scans&lt;/li&gt;&lt;li&gt;&lt;i&gt;readww&lt;/i&gt;&amp;nbsp;- like revrangeww except do point queries&lt;/li&gt;&lt;li&gt;&lt;i&gt;overwrite&lt;/i&gt;&amp;nbsp;- do overwrites (Put) as fast as possible&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;For configuration options that Speedb and RocksDB have in common I set those options to the same values. I didn&#39;t experiment with Speedb-only options except for use_dynamic_delay.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b style=&quot;font-family: inherit;&quot;&gt;Workloads&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;p&gt;There are three workloads, all of which use 40 threads:&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;byrx - the database is cached by RocksDB (100M KV pairs)&lt;br /&gt;&lt;/li&gt;&lt;li&gt;iobuf - the database is larger than memory and RocksDB uses buffered IO (2B KV pairs)&lt;/li&gt;&lt;li&gt;iodir - the database is larger than memory and RocksDB uses O_DIRECT (2B KV pairs)&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;Spreadsheets with charts are &lt;a href=&quot;https://docs.google.com/spreadsheets/d/1-n3NKUjfwluwsO0w30gLAolPM6SRSxAvcWUAtts01Fc/edit?usp=sharing&quot;&gt;here&lt;/a&gt; and &lt;a href=&quot;https://docs.google.com/spreadsheets/d/1ZxzA8RwCRmXNcsM2oas2PvUv80io3edOHcGd-gpsEfQ/edit?usp=sharing&quot;&gt;here&lt;/a&gt;. Performance summaries are here for &lt;a href=&quot;https://gist.github.com/mdcallag/4803f9ced1ffdfc9ac4b581524077fc8&quot;&gt;byrx&lt;/a&gt;, &lt;a href=&quot;https://gist.github.com/mdcallag/fe29fef3aab5e7fe8fa3a6955d4ebe97&quot;&gt;iobuf&lt;/a&gt; and &lt;a href=&quot;https://gist.github.com/mdcallag/5fb819f6bacc31213944827f5c1687d0&quot;&gt;iodir&lt;/a&gt;.&lt;/div&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: average throughput&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;These charts plot relative QPS by test where relative QPS is (QPS for me / QPS for speedb.udd0). When this value is less than 1.0 then the given version is slower than speedb.udd0. When the value is greater than 1.0 then the given version is faster than speedb.udd0. The base case is speedb.udd0 which is Speedb with use_dynamic_delay=0. The versions listed in the charts are:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;speedb.udd1 - Speedb with use_dynamic_delay=1&lt;/li&gt;&lt;li&gt;rocksdb.7.3 - RocksDB 7.3.2&lt;/li&gt;&lt;li&gt;rocksdb.7.10 - RocksDB 7.10.2&lt;/li&gt;&lt;li&gt;rocksdb.8.5 - RocksDB 8.5.4&lt;/li&gt;&lt;li&gt;rocksdb.8.6 - RocksDB 8.6.7&lt;/li&gt;&lt;li&gt;rocksdb.9.7 - RocksDB 9.7.4&lt;/li&gt;&lt;li&gt;rocksdb.9.9 - RocksDB 9.9.3&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;For a cached workload (byrx):&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;RocksDB is faster at fillseq (load in key order), otherwise modern RocksDB and Speedb have similar average throughput&lt;/li&gt;&lt;li&gt;RocksDB 7.3 was much slower on the read while writing tests (revrangeww, fwdrangeww and readww) but that was fixed by 7.10 and might have been related to &lt;a href=&quot;https://github.com/facebook/rocksdb/issues/9423&quot;&gt;issue 9423&lt;/a&gt;&amp;nbsp;or it might be from improvements to the hyper clock cache.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiXFnWZiADNOPfRJEFEDt6Prjh-QfWCO_0xYKYwAmYQAo-d3UCD4hQdfL6eIhkahZZGSWRQnzfwEorYxZTsCPXfmYT_goNfyXbgEe_OXA0w_AvX5zWrjbMBXJx8R-dDSZq9LZhDAAs7gJE0bT90alktpCQE_jPCm3UApIyBZ1HvmK-CWZCOiMCXLmHfsMJT/s600/QPS%20relative%20to%20speedb.udd0_%20byrx.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiXFnWZiADNOPfRJEFEDt6Prjh-QfWCO_0xYKYwAmYQAo-d3UCD4hQdfL6eIhkahZZGSWRQnzfwEorYxZTsCPXfmYT_goNfyXbgEe_OXA0w_AvX5zWrjbMBXJx8R-dDSZq9LZhDAAs7gJE0bT90alktpCQE_jPCm3UApIyBZ1HvmK-CWZCOiMCXLmHfsMJT/w640-h396/QPS%20relative%20to%20speedb.udd0_%20byrx.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;For an IO-bound workload that doesn&#39;t use O_DIRECT (iobuf)&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Modern RocksDB is faster than Speedb at fillseq, has similar perf on the read while writing tests and is slower on overwrite (write-only with keys in random order. The difference in overwrite perf is probably from &lt;a href=&quot;https://github.com/facebook/rocksdb/issues/12038&quot;&gt;issue 12038&lt;/a&gt;&amp;nbsp;which arrives in RocksDB 8.6 and then has a fix in 9.9.&lt;/li&gt;&lt;li&gt;Similar to above RocksDB 7.3 has a few perf problems that have since been fixed&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh7sDJTZmN28ZYsLWrjZypwsJMEGla0DbsS3pRRn6n-jdeSXMVnHZ_PXi28o1JoI37IAgPRfHK7B9u7h76ZxG2DvRRWjS8tEvmJQyoNw2muT1yVDCWDuDmDTAUf3zFzZ8CMgDKT5eY8eFkd4gQR2iqE3RQ691URujn1kSEqAgbnN1PnoY7roXh8jEuLJf7b/s600/QPS%20relative%20to%20speedb.udd0_%20iobuf.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh7sDJTZmN28ZYsLWrjZypwsJMEGla0DbsS3pRRn6n-jdeSXMVnHZ_PXi28o1JoI37IAgPRfHK7B9u7h76ZxG2DvRRWjS8tEvmJQyoNw2muT1yVDCWDuDmDTAUf3zFzZ8CMgDKT5eY8eFkd4gQR2iqE3RQ691URujn1kSEqAgbnN1PnoY7roXh8jEuLJf7b/w640-h396/QPS%20relative%20to%20speedb.udd0_%20iobuf.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;For an IO-bound workload that uses O_DIRECT (iodir)&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Modern RocksDB is much faster than Speedb at fillseq and then has similar perf on the other tests&lt;/li&gt;&lt;li&gt;Similar to above RocksDB 7.3 has a few perf problems that have since been fixed.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhEXMaB7jssZX7u-G8r06eU15J5Q-PndfBykjHCc2229qHyeQ2agD2P-SIKNZSU-H5LuT33TrnkrfWzn6oBS2A0zDA5tA5Frp_zzOcLWntPQVVX07wE6Fn2GyfyzXHI-7SnZ41ijhsLsyx7603Q130okrTxt-9SRGtLioiJTDz4MMfFGq6E5klPs7T_lmwX/s600/QPS%20relative%20to%20speedb.udd0_%20iodir.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhEXMaB7jssZX7u-G8r06eU15J5Q-PndfBykjHCc2229qHyeQ2agD2P-SIKNZSU-H5LuT33TrnkrfWzn6oBS2A0zDA5tA5Frp_zzOcLWntPQVVX07wE6Fn2GyfyzXHI-7SnZ41ijhsLsyx7603Q130okrTxt-9SRGtLioiJTDz4MMfFGq6E5klPs7T_lmwX/w640-h396/QPS%20relative%20to%20speedb.udd0_%20iodir.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: throughput over time&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The previous section shows average throughput. And while more average QPS is nice, if that comes with more variance than it is less than great. The charts in this section show QPS at 1-second intervals for fillseq and overwrite for two releases:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;speedb.udd1 - Speedb with use_dynamic_delay=1&lt;/li&gt;&lt;li&gt;rocksdb.9.9.3 - RocksDB 9.9.3&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;fillseq (load in key order) for byrx, iobuf and iodir&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;For the cached workload the better perf from RocksDB is obvious&lt;/li&gt;&lt;li&gt;For the IO-bound workloads the results are closer&lt;/li&gt;&lt;li&gt;Variance is less with Speedb than with RocksDB based on the thickness of the lines&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgDlmKCFHvadZvyoBVvYe83yaOSrMn4ZsQ3SONQJcw5NPbmJnxPG69D2qyW-pcdxnqTknJ4gHnQ2iIPUWwKwAGaiWw_F66NvBSsTUdMh4r7ZKPOYl38SRErRVvBeKQefkmom1R5DsrpTz4oxTGX7wMz1otLcJvI_e1JpVPhyCIGc3JK5sjjPOdoqArDVGHp/s600/fillseq,%20byrx_%20QPS%20by%20time.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgDlmKCFHvadZvyoBVvYe83yaOSrMn4ZsQ3SONQJcw5NPbmJnxPG69D2qyW-pcdxnqTknJ4gHnQ2iIPUWwKwAGaiWw_F66NvBSsTUdMh4r7ZKPOYl38SRErRVvBeKQefkmom1R5DsrpTz4oxTGX7wMz1otLcJvI_e1JpVPhyCIGc3JK5sjjPOdoqArDVGHp/w640-h396/fillseq,%20byrx_%20QPS%20by%20time.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi_o-mJ3FgIXVhMXvBZMK7DkvJuaddxPk7A7VSadZkClurNXeBhoOQp_GV5SRTS0c6A05KnuLL5Uk_Xa1SV-28-0OKh_UCEcrK8vhpp-OeCL0_oCLqohIm-5FAiuDkvLAoONiWNyhE_bC31hyphenhyphenXQ9rRtlAjWWrxbt3Wps_2Xl4YGlmdVuAMdFdoH2xNwtdzC/s600/fillseq,%20iobuf_%20QPS%20by%20time.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi_o-mJ3FgIXVhMXvBZMK7DkvJuaddxPk7A7VSadZkClurNXeBhoOQp_GV5SRTS0c6A05KnuLL5Uk_Xa1SV-28-0OKh_UCEcrK8vhpp-OeCL0_oCLqohIm-5FAiuDkvLAoONiWNyhE_bC31hyphenhyphenXQ9rRtlAjWWrxbt3Wps_2Xl4YGlmdVuAMdFdoH2xNwtdzC/w640-h396/fillseq,%20iobuf_%20QPS%20by%20time.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhg6aP_X92f1WXKZaPIRww1aV2YWtyTw7Ijp1Ydpig9inc2ktAM2Bgde0n7loCQjqukxijUv_auBRVyek5TJRvCAY_gIM66ZWJDLz-LGNImSTKx-oP_KgcUv23TIxkBUObZIpE-zeseVrJkje8CyTmPC4vVC-qe7RY3aAExU-cEGsvcTsDnLOHz9zSbGGny/s600/fillseq,%20iodir_%20QPS%20by%20time.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhg6aP_X92f1WXKZaPIRww1aV2YWtyTw7Ijp1Ydpig9inc2ktAM2Bgde0n7loCQjqukxijUv_auBRVyek5TJRvCAY_gIM66ZWJDLz-LGNImSTKx-oP_KgcUv23TIxkBUObZIpE-zeseVrJkje8CyTmPC4vVC-qe7RY3aAExU-cEGsvcTsDnLOHz9zSbGGny/w640-h396/fillseq,%20iodir_%20QPS%20by%20time.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;With overwrite for a cached workload (byrx) there are two charts - one from the entire run and one from the last 300 seconds.&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Using the eyeball method (much hand waving) the variance is similar for RocksDB and Speedb. Both suffer from write stalls.&lt;/li&gt;&lt;li&gt;Response time percentiles (see p50, p99, p99.9, p99.99 and pmax &lt;a href=&quot;https://gist.github.com/mdcallag/4803f9ced1ffdfc9ac4b581524077fc8#file-gistfile1-txt-L61-L69&quot;&gt;here&lt;/a&gt;) where pmax is &amp;lt;= 57 milliseconds for everything but RocksDB 7.3. Note the numbers in the gist are in usecs.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjj2CndxLiYvaXmb3R4O2LqYpQ1KliqCHoHqXBYTNeUTi35kwaA0n0VEZLoVn-CrqM7b2EwZw9wT6WQZDsp7I_BslZeng7VLWtnC2S4Np-vgNXebYaGBGwSnu7l3deYiOxtt7Vh5ewl_rny1nPYPy4ykbTpP4P5W_Nq8w9MkwtYquUKs6XUV4qJQd1_sJzd/s600/overwrite,%20byrx_%20QPS%20by%20time.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjj2CndxLiYvaXmb3R4O2LqYpQ1KliqCHoHqXBYTNeUTi35kwaA0n0VEZLoVn-CrqM7b2EwZw9wT6WQZDsp7I_BslZeng7VLWtnC2S4Np-vgNXebYaGBGwSnu7l3deYiOxtt7Vh5ewl_rny1nPYPy4ykbTpP4P5W_Nq8w9MkwtYquUKs6XUV4qJQd1_sJzd/w640-h396/overwrite,%20byrx_%20QPS%20by%20time.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi-KXKB1WqCvmUcuAQVQKdGYd2vktl2HsFra4UU1fse1FnXi19jTOZZbjEf2qYhR4MF-IeQV9QdBWqD-_n56REebIwsmbYY_hpdyytRHcGrKscVAngW_Gjtvk14BbBEISmeaJ1qn264hU0uVfyxBwFouuPfKVKFh5JBkCukyiOVkE8WKNjJb4i4Lg1UWMLR/s600/overwrite,%20byrx_%20QPS%20by%20time%20(last%20300%20seconds).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi-KXKB1WqCvmUcuAQVQKdGYd2vktl2HsFra4UU1fse1FnXi19jTOZZbjEf2qYhR4MF-IeQV9QdBWqD-_n56REebIwsmbYY_hpdyytRHcGrKscVAngW_Gjtvk14BbBEISmeaJ1qn264hU0uVfyxBwFouuPfKVKFh5JBkCukyiOVkE8WKNjJb4i4Lg1UWMLR/w640-h396/overwrite,%20byrx_%20QPS%20by%20time%20(last%20300%20seconds).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;With overwrite for an IO-bound workload (iobuf) without O_DIRECT there are two charts - one from the entire run and one from the last 300 seconds.&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Speedb has slightly better average throughput&lt;/li&gt;&lt;li&gt;Speedb has much better QoS (less variance). That is obvious based on the thickness of the red line vs the blue line. RocksDB also has more write stalls based on the number of times the blue lines drop to near zero.&lt;/li&gt;&lt;li&gt;Using the response time percentiles &lt;a href=&quot;https://gist.github.com/mdcallag/fe29fef3aab5e7fe8fa3a6955d4ebe97#file-gistfile1-txt-L61-L69&quot;&gt;here&lt;/a&gt; the differences between Speedb and RocksDB are less obvious.&lt;/li&gt;&lt;li&gt;The percentage of time that writes are stalled is &amp;lt;= 9% for Speedb and &amp;gt;= 20% for modern RocksDB (&lt;a href=&quot;https://gist.github.com/mdcallag/fe29fef3aab5e7fe8fa3a6955d4ebe97#file-gistfile1-txt-L61-L69&quot;&gt;see stall% here&lt;/a&gt;). This is a good way to measure QoS.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh-3jTAhgjZIJm4fvR5oTkh3tEBT2l-KlzGg9R1BXuNl921x-keF0DAUpZ97TGSLwJ-i08wzpSSb3SgsVee2Uuq4ABy86TfCWu2Q4RPitl0kakFrxYO5pgfvfDpAY5ZDRlnstJRB-5lIxXp2DMZ1AMJGmEMqFUG1Vd5uue-xK38LZtu2-_spQ5SN3StfP-2/s600/overwrite,%20iobuf_%20QPS%20by%20time.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh-3jTAhgjZIJm4fvR5oTkh3tEBT2l-KlzGg9R1BXuNl921x-keF0DAUpZ97TGSLwJ-i08wzpSSb3SgsVee2Uuq4ABy86TfCWu2Q4RPitl0kakFrxYO5pgfvfDpAY5ZDRlnstJRB-5lIxXp2DMZ1AMJGmEMqFUG1Vd5uue-xK38LZtu2-_spQ5SN3StfP-2/w640-h396/overwrite,%20iobuf_%20QPS%20by%20time.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjdAhdWKJT2ciCd6MX5OvjCy46NLTV12kTrdz8U9kRyHhc2PbSyWt0JSIxGWGjk8DfiXYPJoH3id13zDQmukItzvEfZwYJS9SdjGLQDInNCmJOdplT9c-x64b3Cu0aSEo0FXmcsL1_A6IhNS1Nrpne_zDeUqIaEddyTJ1pbdmGW2IiPK7GYvO6LDqtURxHH/s600/overwrite,%20iobuf_%20QPS%20by%20time%20(last%20300%20seconds).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjdAhdWKJT2ciCd6MX5OvjCy46NLTV12kTrdz8U9kRyHhc2PbSyWt0JSIxGWGjk8DfiXYPJoH3id13zDQmukItzvEfZwYJS9SdjGLQDInNCmJOdplT9c-x64b3Cu0aSEo0FXmcsL1_A6IhNS1Nrpne_zDeUqIaEddyTJ1pbdmGW2IiPK7GYvO6LDqtURxHH/w640-h396/overwrite,%20iobuf_%20QPS%20by%20time%20(last%20300%20seconds).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;With overwrite for an IO-bound workload (iodir) with O_DIRECT there are two charts - one from the entire run and one from the last 300 seconds.&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Average throughput is slightly better for RocksDB than for Speedb&lt;/li&gt;&lt;li&gt;QoS is slightly better for Speedb than for RocksDB (see the blue lines dropping to zero)&lt;/li&gt;&lt;li&gt;RocksDB does much better here with O_DIRECT than it does above without O_DIRECT&lt;/li&gt;&lt;li&gt;Speedb still does better than RocksDB at avoiding write stalls. See stall% &lt;a href=&quot;https://gist.github.com/mdcallag/5fb819f6bacc31213944827f5c1687d0#file-gistfile1-txt-L61-L69&quot;&gt;here&lt;/a&gt;.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjOJqFQMaGBXEKxPHS9oRpWYWCO4A1XQ6i0-rai6UYiZ74tvR4SoHTelcgwtez9DcPoksKOUEqqJyCi5LKmOm3MhhY74A-F2ZLD6mKTijf9QGGsSm3eGvzzv0nsorQD0SaAOkRE69JBlgM8pDefJWRB_q4GjpXV-dAL966-Jive0L041WBNJ8otN5BFoKaI/s600/overwrite,%20iodir_%20QPS%20by%20time.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjOJqFQMaGBXEKxPHS9oRpWYWCO4A1XQ6i0-rai6UYiZ74tvR4SoHTelcgwtez9DcPoksKOUEqqJyCi5LKmOm3MhhY74A-F2ZLD6mKTijf9QGGsSm3eGvzzv0nsorQD0SaAOkRE69JBlgM8pDefJWRB_q4GjpXV-dAL966-Jive0L041WBNJ8otN5BFoKaI/w640-h396/overwrite,%20iodir_%20QPS%20by%20time.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjzkhNvbMepjd-aufZfD0qbLoMT00thkosjlfO8b8o6dg0K4Uepmcx_A6cs6s1AN4hPzy2iHiasb_1btMvcM8TfsCJtJirHHvG2nusvw75fT2l_5D_BY3OeqiUXPDXltuwNgjFSjaCCR6pLWSnyMteV66JsZV07cK_grBFw3r4BR0SJdd89hI-MHtdvH8pJ/s600/overwrite,%20iodir_%20QPS%20by%20time%20(last%20300%20seconds).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjzkhNvbMepjd-aufZfD0qbLoMT00thkosjlfO8b8o6dg0K4Uepmcx_A6cs6s1AN4hPzy2iHiasb_1btMvcM8TfsCJtJirHHvG2nusvw75fT2l_5D_BY3OeqiUXPDXltuwNgjFSjaCCR6pLWSnyMteV66JsZV07cK_grBFw3r4BR0SJdd89hI-MHtdvH8pJ/w640-h396/overwrite,%20iodir_%20QPS%20by%20time%20(last%20300%20seconds).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Sources of write stalls&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The output from db_bench includes a summary of the sources of write stalls. However, that summary just shows the number of times each was invoked without telling you have much each contributes to the stall% (total percentage of time that write stalls are in effect).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For the IO-bound workload (iobuf and iodir) the number of write stalls is much lower with Speedb. It appears to be more clever about managing write throughput.&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The summary for iobuf with Speedb (speedb.udd1) and RocksDB (9.9.3)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;speedb&amp;nbsp; rocksdb&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1429&amp;nbsp; &amp;nbsp; &amp;nbsp; 285&amp;nbsp; &amp;nbsp;cf-l0-file-count-limit-delays-with-ongoing-compaction&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp;cf-l0-file-count-limit-stops-with-ongoing-compaction&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1429&amp;nbsp; &amp;nbsp; &amp;nbsp; 285&amp;nbsp; &amp;nbsp;l0-file-count-limit-delays&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp;l0-file-count-limit-stops&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp;memtable-limit-delays&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp;memtable-limit-stops&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1131&amp;nbsp; &amp;nbsp; 12585&amp;nbsp; &amp;nbsp;pending-compaction-bytes-delays&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp;pending-compaction-bytes-stops&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;2560&amp;nbsp; &amp;nbsp; 12870&amp;nbsp; &amp;nbsp;total-delays&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp;total-stops&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The summary for iodir with Speedb (speedb.udd1) and RocksDB (9.9.3)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;speedb&amp;nbsp; rocksdb&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;5&amp;nbsp; &amp;nbsp; &amp;nbsp; 287&amp;nbsp; &amp;nbsp;cf-l0-file-count-limit-delays-with-ongoing-compaction&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp;cf-l0-file-count-limit-stops-with-ongoing-compaction&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;5&amp;nbsp; &amp;nbsp; &amp;nbsp; 287&amp;nbsp; &amp;nbsp;l0-file-count-limit-delays&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp;l0-file-count-limit-stops&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; 22&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp;memtable-limit-delays&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp;memtable-limit-stops&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; 687&amp;nbsp; &amp;nbsp;pending-compaction-bytes-delays&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp;pending-compaction-bytes-stops&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; 27&amp;nbsp; &amp;nbsp; &amp;nbsp; 974&amp;nbsp; &amp;nbsp;total-delays&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp;total-stops&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Compaction efficiency&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The final sample of compaction IO statistics at the end of the overwrite test &lt;a href=&quot;https://gist.github.com/mdcallag/2a0d886334d4f0f741463967383e18c9&quot;&gt;is here&lt;/a&gt; for iobuf and iodir for speedb.udd1 and RocksDB 9.9.3.&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For iobuf the wall clock time for which compaction threads are busy (the &lt;i&gt;Comp(sec)&lt;/i&gt; column) is about 1.10X larger for RocksDB than Speedb. This is likely because Speedb is doing larger reads from storage (see the next section) so RocksDB has more IO waits. But the CPU time for compaction (the CompMergeCPU(sec) column) is about 1.12X larger for Speedb (I am not sure why).&lt;br /&gt;&lt;br /&gt;For iodir the values in the &lt;i&gt;Comp(sec)&lt;/i&gt; and &lt;i&gt;CompMergeCPU(sec)&lt;/i&gt; columns are not as different across Speedb and RocksDB as they were above for iobuf.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Understanding IO via iostat&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;The numbers below are the average values from iostat collected during overwrite.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For iobuf the average read size (rareqsz) drops to 4.7 in RocksDB 8.6 courtesy of &lt;a href=&quot;https://github.com/facebook/rocksdb/issues/12038&quot;&gt;issue 12038&lt;/a&gt;&amp;nbsp;while it was &amp;gt;= 50KB prior to RocksDB 8.6.&amp;nbsp;The value improves to 34.2 in RocksDB 9.9.3 but it is still much less than what it used to be in RocksDB 8.5.&lt;br /&gt;&lt;br /&gt;For iobuf the average read size (rareqsz) is ~112KB in all cases.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The RocksDB compaction threads read from the compaction input a RocksDB block at a time. For these tests I use an 8kb RocksDB block but in the IO-bound tests the block are compressed for the larger levels of the LSM tree, and a compressed block is ~4kb. Thus some kind of prefetching that does large read requests is need to improve read IO efficiency.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;Legend:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;rps - reads/s&lt;/li&gt;&lt;li&gt;rMBps - read MB/s&lt;/li&gt;&lt;li&gt;rawait - read wait latency in milliseconds&lt;/li&gt;&lt;li&gt;rareqsz - read request size in KB&lt;/li&gt;&lt;li&gt;wps - writes/s&lt;/li&gt;&lt;li&gt;wMBps - write MB/s&lt;/li&gt;&lt;li&gt;wawait - read wait latency in milliseconds&lt;/li&gt;&lt;li&gt;wareqsz - write request size in KB&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: small;&quot;&gt;iobuf (buffered IO)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;rps&amp;nbsp; &amp;nbsp; &amp;nbsp;rMBps&amp;nbsp; &amp;nbsp;rawait&amp;nbsp; rareqsz wps&amp;nbsp; &amp;nbsp; &amp;nbsp;wMBps&amp;nbsp; &amp;nbsp;wawait&amp;nbsp; wareqsz&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;6336&amp;nbsp; &amp;nbsp; 364.4&amp;nbsp; &amp;nbsp;0.43&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;52.9&lt;/span&gt;&amp;nbsp; &amp;nbsp; 14344&amp;nbsp; &amp;nbsp;1342.0&amp;nbsp; 0.40&amp;nbsp; &amp;nbsp; 95.5&amp;nbsp; &amp;nbsp; speedb.udd0&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;6209&amp;nbsp; &amp;nbsp; 357.6&amp;nbsp; &amp;nbsp;0.41&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;51.6&lt;/span&gt;&amp;nbsp; &amp;nbsp; 14177&amp;nbsp; &amp;nbsp;1322.8&amp;nbsp; 0.41&amp;nbsp; &amp;nbsp; 95.3&amp;nbsp; &amp;nbsp; speedb.udd1&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;2133&amp;nbsp; &amp;nbsp; 164.0&amp;nbsp; &amp;nbsp;0.19&amp;nbsp; &amp;nbsp; 26.1&amp;nbsp; &amp;nbsp; 8954&amp;nbsp; &amp;nbsp; 854.4&amp;nbsp; &amp;nbsp;0.31&amp;nbsp; &amp;nbsp; 99.3&amp;nbsp; &amp;nbsp; rocksdb.7.3&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;4542&amp;nbsp; &amp;nbsp; 333.5&amp;nbsp; &amp;nbsp;0.49&amp;nbsp; &amp;nbsp; 71.8&amp;nbsp; &amp;nbsp; 14734&amp;nbsp; &amp;nbsp;1361.6&amp;nbsp; 0.39&amp;nbsp; &amp;nbsp; 94.5&amp;nbsp; &amp;nbsp; rocksdb.7.10&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;6101&amp;nbsp; &amp;nbsp; 352.1&amp;nbsp; &amp;nbsp;0.42&amp;nbsp; &amp;nbsp; 52.4&amp;nbsp; &amp;nbsp; 14878&amp;nbsp; &amp;nbsp;1391.8&amp;nbsp; 0.41&amp;nbsp; &amp;nbsp; 96.1&amp;nbsp; &amp;nbsp; rocksdb.8.5&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;40471&amp;nbsp; &amp;nbsp;184.7&amp;nbsp; &amp;nbsp;0.10&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #f4cccc;&quot;&gt;4.7&lt;/span&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;8552&amp;nbsp; &amp;nbsp; 784.1&amp;nbsp; &amp;nbsp;0.31&amp;nbsp; &amp;nbsp; 93.6&amp;nbsp; &amp;nbsp; rocksdb.8.6&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;39201&amp;nbsp; &amp;nbsp;178.8&amp;nbsp; &amp;nbsp;0.10&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #f4cccc;&quot;&gt;4.7&lt;/span&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;8783&amp;nbsp; &amp;nbsp; 801.5&amp;nbsp; &amp;nbsp;0.32&amp;nbsp; &amp;nbsp; 93.7&amp;nbsp; &amp;nbsp; rocksdb.9.7&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;7733&amp;nbsp; &amp;nbsp; 268.7&amp;nbsp; &amp;nbsp;0.29&amp;nbsp; &amp;nbsp; 34.2&amp;nbsp; &amp;nbsp; 12742&amp;nbsp; &amp;nbsp;1156.0&amp;nbsp; 0.30&amp;nbsp; &amp;nbsp; 93.2&amp;nbsp; &amp;nbsp; rocksdb.9.9&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;iodir (O_DIRECT)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;rps&amp;nbsp; &amp;nbsp; &amp;nbsp;rMBps&amp;nbsp; &amp;nbsp;rawait&amp;nbsp; rareqsz wps&amp;nbsp; &amp;nbsp; &amp;nbsp;wMBps&amp;nbsp; &amp;nbsp;wawait&amp;nbsp; wareqsz&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;12757&amp;nbsp; &amp;nbsp;1415.9&amp;nbsp; 0.82&amp;nbsp; &amp;nbsp; 112.8&amp;nbsp; &amp;nbsp;16642&amp;nbsp; &amp;nbsp;1532.6&amp;nbsp; 0.40&amp;nbsp; &amp;nbsp; 93.5&amp;nbsp; &amp;nbsp; speedb.udd0&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;12539&amp;nbsp; &amp;nbsp;1392.9&amp;nbsp; 0.83&amp;nbsp; &amp;nbsp; 112.7&amp;nbsp; &amp;nbsp;16327&amp;nbsp; &amp;nbsp;1507.9&amp;nbsp; 0.41&amp;nbsp; &amp;nbsp; 93.6&amp;nbsp; &amp;nbsp; speedb.udd1&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;8090&amp;nbsp; &amp;nbsp; 903.5&amp;nbsp; &amp;nbsp;0.74&amp;nbsp; &amp;nbsp; 114.3&amp;nbsp; &amp;nbsp;10036&amp;nbsp; &amp;nbsp;976.6&amp;nbsp; &amp;nbsp;0.30&amp;nbsp; &amp;nbsp; 100.5&amp;nbsp; &amp;nbsp;rocksdb.7.3&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;12155&amp;nbsp; &amp;nbsp;1346.2&amp;nbsp; 0.90&amp;nbsp; &amp;nbsp; 112.8&amp;nbsp; &amp;nbsp;15602&amp;nbsp; &amp;nbsp;1462.4&amp;nbsp; 0.40&amp;nbsp; &amp;nbsp; 95.7&amp;nbsp; &amp;nbsp; rocksdb.7.10&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;13315&amp;nbsp; &amp;nbsp;1484.5&amp;nbsp; 0.85&amp;nbsp; &amp;nbsp; 113.6&amp;nbsp; &amp;nbsp;17436&amp;nbsp; &amp;nbsp;1607.1&amp;nbsp; 0.40&amp;nbsp; &amp;nbsp; 94.0&amp;nbsp; &amp;nbsp; rocksdb.8.5&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;12978&amp;nbsp; &amp;nbsp;1444.3&amp;nbsp; 0.84&amp;nbsp; &amp;nbsp; 112.9&amp;nbsp; &amp;nbsp;16981&amp;nbsp; &amp;nbsp;1563.0&amp;nbsp; 0.46&amp;nbsp; &amp;nbsp; 93.4&amp;nbsp; &amp;nbsp; rocksdb.8.6&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;12641&amp;nbsp; &amp;nbsp;1411.9&amp;nbsp; 0.84&amp;nbsp; &amp;nbsp; 113.9&amp;nbsp; &amp;nbsp;16217&amp;nbsp; &amp;nbsp;1535.6&amp;nbsp; 0.43&amp;nbsp; &amp;nbsp; 96.7&amp;nbsp; &amp;nbsp; rocksdb.9.7&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;12990&amp;nbsp; &amp;nbsp;1450.7&amp;nbsp; 0.83&amp;nbsp; &amp;nbsp; 113.8&amp;nbsp; &amp;nbsp;16704&amp;nbsp; &amp;nbsp;1576.0&amp;nbsp; 0.41&amp;nbsp; &amp;nbsp; 96.3&amp;nbsp; &amp;nbsp; rocksdb.9.9&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;b&gt;Update 1&lt;/b&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For the overwrite test and the iobuf (buffered IO, IO-bound) workload the QPS is 277795 for RocksDB 8.5.4 vs 227068 for 9.9.3. So 9.9.3 gets ~82% of the QPS relative to 8.5.4. Note that 9.8 only gets about 57% and the improved from 9.8 to 9.9 is from fixing &lt;a href=&quot;https://github.com/facebook/rocksdb/issues/12038&quot;&gt;issue 12038&lt;/a&gt; but that fix isn&#39;t sufficient given the gap between 8.5 and 9.9.&lt;br /&gt;&lt;br /&gt;With a hack I can improve the QPS for 9.9 from 227068/s to 241260/s. The problem, explained to me by the RocksDB team, is that while &lt;a href=&quot;https://github.com/facebook/rocksdb/blob/main/env/fs_posix.cc#L935-L948&quot;&gt;this code&lt;/a&gt; adjusts compaction readahead to be no larger than max_sectors_kb, the code that requests prefetch can request for (X + Y) bytes where X is the adjusted compaction readahead amount and Y is the block size. The sum of these is likely to be larger than max_sectors_kb. So my hack was to reduce compaction readahead to be (max_sectors_kb - 8kb) given that I am using block_size=8kb.&lt;br /&gt;&lt;br /&gt;And with the hack the QPS for overwrite improves. The performance summary from the overwrite test &lt;a href=&quot;https://gist.github.com/mdcallag/400441b498f7491ae4ab6fbd4c5003f7&quot;&gt;is here&lt;/a&gt;&amp;nbsp;and the stall% decreased from 25.9 to 22. Alas, the stall% was 9.0 with RocksDB 8.5.4 so there is still room for improvement.&lt;br /&gt;&lt;br /&gt;The compaction reasons for 8.5.4, 9.9.3 without the hack (9.9orig) and 9.9.3 with the hack (9.9hack) provide a better idea of what has changed.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;8.5.4&amp;nbsp; &amp;nbsp;9.9orig 9.9hack&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1089&amp;nbsp; &amp;nbsp; &amp;nbsp; 285&amp;nbsp; &amp;nbsp; &amp;nbsp;211&amp;nbsp; &amp;nbsp;cf-l0-file-count-limit-delays-with-ongoing-compaction&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp;cf-l0-file-count-limit-stops-with-ongoing-compaction&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1089&amp;nbsp; &amp;nbsp; &amp;nbsp; 285&amp;nbsp; &amp;nbsp; &amp;nbsp;211&amp;nbsp; &amp;nbsp;l0-file-count-limit-delays&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp;l0-file-count-limit-stops&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp;memtable-limit-delays&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp;memtable-limit-stops&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1207&amp;nbsp; &amp;nbsp; 12585&amp;nbsp; &amp;nbsp;10735&amp;nbsp; &amp;nbsp;pending-compaction-bytes-delays&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp;pending-compaction-bytes-stops&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;2296&amp;nbsp; &amp;nbsp; 12870&amp;nbsp; &amp;nbsp;10946&amp;nbsp; &amp;nbsp;total-delays&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp;total-stops&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;And finally, averages from iostat during the test show that 9.9.3 with the hack gets the largest average read request size (rareqsz is 56.4) but it still is slower than 8.5.4.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;rps&amp;nbsp; &amp;nbsp; &amp;nbsp;rmbps&amp;nbsp; &amp;nbsp;rawait&amp;nbsp; rareqsz wps&amp;nbsp; &amp;nbsp; &amp;nbsp;wmbps&amp;nbsp; &amp;nbsp;wawait&amp;nbsp; wareqsz&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;6101&amp;nbsp; &amp;nbsp; 352.1&amp;nbsp; &amp;nbsp;0.42&amp;nbsp; &amp;nbsp; 52.4&amp;nbsp; &amp;nbsp; 14878&amp;nbsp; &amp;nbsp;1391.8&amp;nbsp; 0.41&amp;nbsp; &amp;nbsp; 96.1&amp;nbsp; &amp;nbsp; 8.5.4&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;7733&amp;nbsp; &amp;nbsp; 268.7&amp;nbsp; &amp;nbsp;0.29&amp;nbsp; &amp;nbsp; 34.2&amp;nbsp; &amp;nbsp; 12742&amp;nbsp; &amp;nbsp;1156.0&amp;nbsp; 0.30&amp;nbsp; &amp;nbsp; 93.2&amp;nbsp; &amp;nbsp; 9.9orig&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;5020&amp;nbsp; &amp;nbsp; 285.7&amp;nbsp; &amp;nbsp;0.43&amp;nbsp; &amp;nbsp; 56.4&amp;nbsp; &amp;nbsp; 13467&amp;nbsp; &amp;nbsp;1220.2&amp;nbsp; 0.32&amp;nbsp; &amp;nbsp; 93.1&amp;nbsp; &amp;nbsp; 9.9hack&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/306583561436709608/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2024/12/speedb-vs-rocksdb-on-large-server.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/306583561436709608'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/306583561436709608'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2024/12/speedb-vs-rocksdb-on-large-server.html' title='Speedb vs RocksDB on a large server'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiXFnWZiADNOPfRJEFEDt6Prjh-QfWCO_0xYKYwAmYQAo-d3UCD4hQdfL6eIhkahZZGSWRQnzfwEorYxZTsCPXfmYT_goNfyXbgEe_OXA0w_AvX5zWrjbMBXJx8R-dDSZq9LZhDAAs7gJE0bT90alktpCQE_jPCm3UApIyBZ1HvmK-CWZCOiMCXLmHfsMJT/s72-w640-h396-c/QPS%20relative%20to%20speedb.udd0_%20byrx.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-5062130216026227032</id><published>2024-11-29T17:54:00.000-08:00</published><updated>2024-11-29T17:54:33.376-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="db_bench"/><category scheme="http://www.blogger.com/atom/ns#" term="rocksdb"/><title type='text'>RocksDB on a big server: LRU vs hyperclock, v2</title><content type='html'>&lt;p&gt;This post show that RocksDB has gotten much faster over time for the read-heavy benchmarks that I use. I &lt;a href=&quot;https://smalldatum.blogspot.com/2024/11/rocksdb-on-big-server-lru-vs-hyperclock.html&quot;&gt;recently shared&lt;/a&gt; results from a large server to show the speedup from the &lt;a href=&quot;https://smalldatum.blogspot.com/2022/10/hyping-hyper-clock-cache-in-rocksdb.html&quot;&gt;hyperclock&lt;/a&gt; block cache implementation for different concurrency levels with RocksDB 9.6. Here I share results from the same server and different (old and new) RocksDB releases.&lt;/p&gt;&lt;p&gt;Results are amazing on a large (48 cores) server with 40 client threads&lt;br /&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;~2X more QPS for range queries with hyperclock&lt;/li&gt;&lt;li&gt;~3X more QPS for point queries with hyperclock&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;b&gt;Software&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;p&gt;I used RocksDB versions 6.0.2, 6.29.5, 7.0.4, 7.6.0, 7.7.8, 8.5.4, 8.6.7, 9.0.1, 9.1.2, 9.3.2, 9.5.2, 9.7.4 and 9.9.0. Everything was compiled with gcc 11.4.0.&lt;br /&gt;&lt;br /&gt;The --cache_type argument selected the block cache implementation:&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;i&gt;lru_cache&lt;/i&gt; was used for versions 7.6 and earlier. Because some of the oldest releases don&#39;t support --cache_type I also used --undef_params=...,cache_type&lt;/li&gt;&lt;li&gt;&lt;i&gt;hyper_clock_cache&lt;/i&gt; was used for versions 7.7 through 8.5&lt;/li&gt;&lt;li&gt;&lt;i&gt;auth_hyper_clock_cache&lt;/i&gt; was used for versions 8.5+&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Hardware&lt;/b&gt;&lt;/p&gt;&lt;p&gt;The server is an ax162-s from Hetzner with an AMD EPYC 9454P processor, 48 cores, AMD SMT disabled and 128G RAM. The OS is Ubuntu 22.04. Storage is 2 NVMe devices with SW RAID 1 and ext4.&lt;/p&gt;&lt;p&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Overviews on how I use db_bench are&amp;nbsp;&lt;a href=&quot;https://smalldatum.blogspot.com/2022/08/how-i-do-performance-tests-for-rocksdb.html&quot;&gt;here&lt;/a&gt;&amp;nbsp;and&amp;nbsp;&lt;a href=&quot;https://smalldatum.blogspot.com/2022/08/how-i-do-rocksdb-performance-tests-part.html&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;&lt;p&gt;All of my tests here use a CPU-bound workload with a database that is cached by RocksDB and the benchmark is run for 40 threads.&lt;/p&gt;&lt;p&gt;I focus on the read-heavy benchmark steps:&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;revrangeww (reverse range while writing) - this does short reverse range scans&lt;/li&gt;&lt;li&gt;fwdrangeww (forward range while writing) - this does short forward range scans&lt;/li&gt;&lt;li&gt;readww (read while writing) - this does point queries&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;For each of these there is a fixed rate for writes done in the background and performance is reported for the reads. I prefer to measure read performance when there are concurrent writes because read-only benchmarks with an LSM suffer from non-determinism as the state (shape) of the LSM tree has a large impact on CPU overhead and throughput.&lt;/p&gt;&lt;p&gt;&lt;b&gt;Results&lt;/b&gt;&lt;/p&gt;&lt;p&gt;All results are in &lt;a href=&quot;https://docs.google.com/spreadsheets/d/1onth-629vhklzvO6m6cKSIfis5xCORTz3I9KnZzU2SI/edit?usp=sharing&quot;&gt;this spreadsheet&lt;/a&gt; and the performance summary &lt;a href=&quot;https://gist.github.com/mdcallag/dbaec5b7a0733f986ab33cad02be1e64&quot;&gt;is here&lt;/a&gt;.&lt;/p&gt;&lt;p&gt;The graph below shows relative QPS which is: (QPS for my version / QPS for RocksDB 6.0.2) and the results are amazing:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;~2X more QPS for range queries with hyperclock&lt;/li&gt;&lt;li&gt;~3X more QPS for point queries with hyperclock&lt;/li&gt;&lt;/ul&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgKi7E8zsu2yvttWUcHe-QEfDjrhmO_4vZ-73K8WWoIxzrwTb1jcTmPfKgqZ3Pbwnxfq2jbde-iUSqnXGXHToC-StnnMBsGN_vZPke5ouUtoRs74xDeB9SkR64Rkkwig09E887p2eQbh_nOb8T7lt2ZglH0ZM9zIqsKpa_n9idrUSPzDULIOxVYOZctV0-e/s600/QPS%20relative%20to%20RocksDB%206.0.2.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgKi7E8zsu2yvttWUcHe-QEfDjrhmO_4vZ-73K8WWoIxzrwTb1jcTmPfKgqZ3Pbwnxfq2jbde-iUSqnXGXHToC-StnnMBsGN_vZPke5ouUtoRs74xDeB9SkR64Rkkwig09E887p2eQbh_nOb8T7lt2ZglH0ZM9zIqsKpa_n9idrUSPzDULIOxVYOZctV0-e/w640-h396/QPS%20relative%20to%20RocksDB%206.0.2.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;The average values for vmstat metrics provide more detail on why hyperclock is so good for performance. The context switch rate drops dramatically when it is enabled because there is much less mutex contention. The user CPU utilization increases by ~1.6X because more useful work can get done when there is less mutex contention.&lt;/p&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;legend&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;* cs - context switches per second per vmstat&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;* us - user CPU utilization per vmstat&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;* sy - system CPU utilization per vmstat&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;* id - idle CPU utilization per vmstat&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;* wa - wait CPU utilization per vmstat&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;* version - RocksDB version&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;cs&amp;nbsp; &amp;nbsp; &amp;nbsp; us&amp;nbsp; &amp;nbsp; &amp;nbsp; sy&amp;nbsp; &amp;nbsp; &amp;nbsp; us+sy&amp;nbsp; &amp;nbsp;id&amp;nbsp; &amp;nbsp; &amp;nbsp; wa&amp;nbsp; &amp;nbsp; &amp;nbsp; version&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;span style=&quot;background-color: #f4cccc;&quot;&gt;1495325&lt;/span&gt; &lt;span style=&quot;background-color: #f4cccc;&quot;&gt;50.3&lt;/span&gt;&amp;nbsp; &amp;nbsp; 14.0&amp;nbsp; &amp;nbsp; 64.3&amp;nbsp; &amp;nbsp; 18.5&amp;nbsp; &amp;nbsp; 0.1&amp;nbsp; &amp;nbsp; &amp;nbsp;7.6.0&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;2360&lt;/span&gt;&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;82.7&lt;/span&gt;&amp;nbsp; &amp;nbsp; 14.0&amp;nbsp; &amp;nbsp; 96.7&amp;nbsp; &amp;nbsp; 16.6&amp;nbsp; &amp;nbsp; 0.1&amp;nbsp; &amp;nbsp; &amp;nbsp;9.9.0&lt;/span&gt;&lt;/div&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/5062130216026227032/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2024/11/rocksdb-on-big-server-lru-vs-hyperclock_29.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/5062130216026227032'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/5062130216026227032'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2024/11/rocksdb-on-big-server-lru-vs-hyperclock_29.html' title='RocksDB on a big server: LRU vs hyperclock, v2'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgKi7E8zsu2yvttWUcHe-QEfDjrhmO_4vZ-73K8WWoIxzrwTb1jcTmPfKgqZ3Pbwnxfq2jbde-iUSqnXGXHToC-StnnMBsGN_vZPke5ouUtoRs74xDeB9SkR64Rkkwig09E887p2eQbh_nOb8T7lt2ZglH0ZM9zIqsKpa_n9idrUSPzDULIOxVYOZctV0-e/s72-w640-h396-c/QPS%20relative%20to%20RocksDB%206.0.2.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-488053051666274292</id><published>2024-11-25T09:50:00.002-08:00</published><updated>2024-11-25T10:10:27.016-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="db_bench"/><category scheme="http://www.blogger.com/atom/ns#" term="rocksdb"/><title type='text'>RocksDB benchmarks: large server, universal compaction</title><content type='html'>&lt;p&gt;This post has results from a large server with universal compaction from the same server for which I &lt;a href=&quot;https://smalldatum.blogspot.com/2024/11/rocksdb-benchmarks-large-server-leveled.html&quot;&gt;recently shared&lt;/a&gt; leveled compaction results. The results are boring (no large regressions) but a bit more exciting than the ones for leveled compaction because there is more variance. A somewhat educated guess is that variance more likely with universal.&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;there are some small regressions for cached workloads (see byrx below)&lt;/li&gt;&lt;li&gt;there are some small to medium improvements for IO-bound workloads (see iodir and iobuf)&lt;/li&gt;&lt;li&gt;modern RocksDB would look better were I to use the &lt;a href=&quot;https://smalldatum.blogspot.com/2022/10/hyping-hyper-clock-cache-in-rocksdb.html&quot;&gt;Hyper Clock&lt;/a&gt; block cache, but here I don&#39;t to test similar code across all versions&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Hardware&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&lt;p&gt;The server is an ax162-s from Hetzner with an AMD EPYC 9454P processor, 48 cores, AMD SMT disabled and 128G RAM. The OS is Ubuntu 22.04. Storage is 2 NVMe devices with SW RAID 1 and ext4.&lt;/p&gt;&lt;div&gt;&lt;b&gt;Builds&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I compiled db_bench from source on all servers. I used versions:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;6.x - 6.0.2, 6.10.4, 6.20.4, 6.29.5&lt;/li&gt;&lt;li&gt;7.x - 7.0.4, 7.3.2, 7.6.0, 7.10.2&lt;/li&gt;&lt;li&gt;8.x - 8.0.0, 8.3.3, 8.6.7, 8.9.2, 8.11.4&lt;/li&gt;&lt;li&gt;9.x - 9.0.1, 9.1.2, 9.2.2, 9.3.2, 9.4.1, 9.5.2, 9.6.1 and 9.7.3&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;All tests used the default value for compaction_readahead_size and the block cache (LRU).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I used&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/rx2&quot;&gt;my fork of the RocksDB benchmark scripts&lt;/a&gt;&amp;nbsp;that are wrappers to run db_bench. These run db_bench tests in a special sequence -- load in key order, read-only, do some overwrites, read-write and then write-only. The benchmark was run using 40 threads.&amp;nbsp;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;How I do benchmarks for RocksDB is explained&amp;nbsp;&lt;/span&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2022/08/how-i-do-performance-tests-for-rocksdb.html&quot; style=&quot;background-color: white;&quot;&gt;here&lt;/a&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;&amp;nbsp;and&amp;nbsp;&lt;/span&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2022/08/how-i-do-rocksdb-performance-tests-part.html&quot; style=&quot;background-color: white;&quot;&gt;here&lt;/a&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;. The command line to run the tests is:&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;bash x3.sh 40 no 1800 c48r128 100000000 2000000000 byrx iobuf iodir&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: black;&quot;&gt;The tests on the charts are named as:&lt;/span&gt;&lt;br style=&quot;color: black;&quot; /&gt;&lt;ul style=&quot;color: black;&quot;&gt;&lt;li&gt;&lt;i&gt;fillseq&lt;/i&gt;&amp;nbsp;-- load in key order with the WAL disabled&lt;/li&gt;&lt;li&gt;&lt;i&gt;revrangeww&lt;/i&gt;&amp;nbsp;-- reverse range while writing, do short reverse range scans as fast as possible while another thread does writes (Put) at a fixed rate&lt;/li&gt;&lt;li&gt;&lt;i&gt;fwdrangeww&lt;/i&gt;&amp;nbsp;-- like revrangeww except do short forward range scans&lt;/li&gt;&lt;li&gt;&lt;i&gt;readww&lt;/i&gt;&amp;nbsp;- like revrangeww except do point queries&lt;/li&gt;&lt;li&gt;&lt;i&gt;overwrite&lt;/i&gt;&amp;nbsp;- do overwrites (Put) as fast as possible&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;div&gt;&lt;b style=&quot;font-family: inherit;&quot;&gt;Workloads&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;p&gt;There are three workloads, all of which use 40 threads:&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;byrx - the database is cached by RocksDB (100M KV pairs)&lt;br /&gt;&lt;/li&gt;&lt;li&gt;iobuf - the database is larger than memory and RocksDB uses buffered IO (2B KV pairs)&lt;/li&gt;&lt;li&gt;iodir - the database is larger than memory and RocksDB uses O_DIRECT (2B KV pairs)&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;A spreadsheet with all results&amp;nbsp;&lt;a href=&quot;https://docs.google.com/spreadsheets/d/1FAW0s9cJ_0209Ru0qMr7Nacj1AxKS1xxw5Bd0pugugk/edit?usp=sharing&quot;&gt;is here&lt;/a&gt;&amp;nbsp;and performance summaries with more details are here for&amp;nbsp;&lt;a href=&quot;https://gist.github.com/mdcallag/ba424e1a2a56e0b332a89fd20e27390a&quot;&gt;byrx&lt;/a&gt;,&amp;nbsp;&lt;a href=&quot;https://gist.github.com/mdcallag/07ce3d25ef32933d8b43d4651e7c1d00&quot;&gt;iobuf&lt;/a&gt;&amp;nbsp;and&amp;nbsp;&lt;a href=&quot;https://gist.github.com/mdcallag/2c24cd76f0a41ee4ac9ae647b9ba8c10&quot;&gt;iodir&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Relative QPS&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The numbers in the spreadsheet and on the y-axis in the charts that follow are the relative QPS which is (QPS for $me) / (QPS for $base). When the value is greater than 1.0 then $me is faster than $base. When it is less than 1.0 then $base is faster (perf regression!).&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The base version is RocksDB 6.0.2.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Results: byrx&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The byrx tests use a cached database. The performance summary&amp;nbsp;&lt;a href=&quot;https://gist.github.com/mdcallag/ba424e1a2a56e0b332a89fd20e27390a&quot;&gt;is here&lt;/a&gt;.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The chart shows the relative QPS for a given version relative to RocksDB 6.0.2. There are two charts and the second narrows the range for the y-axis to make it easier to see regressions.&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Summary:&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;fillseq has new CPU overhead in 7.0 from code added for correctness checks and QPS has been stable since then&lt;/li&gt;&lt;li&gt;QPS for other tests has been stable, with some variance, since late 6.x&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiWOIp3_j51p1eENVKrvWOVMTsLPr-n23mJinL1RZ38CzoRbUwoJwiRYABXxnb6mOdg5y2jrIADfP4CLXekI9tmRjqKNyzUOpPe9NVJ5X1yZYWhak43N_P-_5FdFDqiovZo9VjkWobobxmeDBYL3aDYfu0RyAla3Kp1rNdujhP0KSJtrm0jbQzw7BvJzAML/s600/QPS%20relative%20to%20RocksDB%206.0.2_%20cached%20(byrx).png&quot; style=&quot;font-family: inherit; margin-left: 1em; margin-right: 1em; text-align: center;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiWOIp3_j51p1eENVKrvWOVMTsLPr-n23mJinL1RZ38CzoRbUwoJwiRYABXxnb6mOdg5y2jrIADfP4CLXekI9tmRjqKNyzUOpPe9NVJ5X1yZYWhak43N_P-_5FdFDqiovZo9VjkWobobxmeDBYL3aDYfu0RyAla3Kp1rNdujhP0KSJtrm0jbQzw7BvJzAML/w640-h396/QPS%20relative%20to%20RocksDB%206.0.2_%20cached%20(byrx).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjhqes8TMiuARMRWWaRZN6oYQ981TJZqqPDfkQxFZ4f274NnT2lSbCWXn8e0-EENzD9NaJ98k1vniQAxmOJ_cNbMIgeHb4O286jvBWxK33zlve0ILIAlI4XVDhM0lzDilMRiIOH5OmQ9bm4Y113XDOxk8P3_6wq6AtRS3aJmS6QercEGTXh0LQFV8_941Ft/s600/QPS%20relative%20to%20RocksDB%206.0.2_%20cached%20(byrx)%20(1).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjhqes8TMiuARMRWWaRZN6oYQ981TJZqqPDfkQxFZ4f274NnT2lSbCWXn8e0-EENzD9NaJ98k1vniQAxmOJ_cNbMIgeHb4O286jvBWxK33zlve0ILIAlI4XVDhM0lzDilMRiIOH5OmQ9bm4Y113XDOxk8P3_6wq6AtRS3aJmS6QercEGTXh0LQFV8_941Ft/w640-h396/QPS%20relative%20to%20RocksDB%206.0.2_%20cached%20(byrx)%20(1).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b style=&quot;font-family: inherit;&quot;&gt;Results: iobuf&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;The iodir tests use an IO-bound database with buffered.&lt;/span&gt;&amp;nbsp;The performance summary&amp;nbsp;&lt;a href=&quot;https://gist.github.com/mdcallag/07ce3d25ef32933d8b43d4651e7c1d00&quot;&gt;is here&lt;/a&gt;.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The chart shows the relative QPS for a given version relative to RocksDB 6.0.2. There are two charts and the second narrows the range for the y-axis to make it easier to see regressions.&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Summary:&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;fillseq has been stable since 7.6&lt;/li&gt;&lt;li&gt;readww has always been stable&lt;/li&gt;&lt;li&gt;overwrite improved in 7.6 and has been stable since then&lt;/li&gt;&lt;li&gt;fwdrangeww and revrangeww improved in late 6.0 and have been stable since then&lt;/li&gt;&lt;/ul&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiXr4DjYFNCjxh0szvLH8EzfIHjMDvvh0ULCtQ9wnKgP8SmHvYZpvVwtIJCRVzNvzv7R15kGr0kBsibzIzx74cW8Shhyphenhyphenef0Dii7cO_vUPyfFm5rgOqI_1hUY6w0gmCh3vP-HHzMd27sx2GeWES3LpGj7GfAplUpFlTPAB2gnUDwihwVsVMKPcd4yuh9ubm2/s600/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20buffered%20IO%20(iobuf).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiXr4DjYFNCjxh0szvLH8EzfIHjMDvvh0ULCtQ9wnKgP8SmHvYZpvVwtIJCRVzNvzv7R15kGr0kBsibzIzx74cW8Shhyphenhyphenef0Dii7cO_vUPyfFm5rgOqI_1hUY6w0gmCh3vP-HHzMd27sx2GeWES3LpGj7GfAplUpFlTPAB2gnUDwihwVsVMKPcd4yuh9ubm2/w640-h396/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20buffered%20IO%20(iobuf).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgZIN7V0FiMpRz_DS6parCUZ4eUfN68FHpcqTa5zHY6C2geH3fRrbg9EeZSAKlfIDwJSUEOrde9hrBmcj2jV3xHvGOLi4KLJO8iM4Y5Qz-FVsZqNRTlcsMthU2im1p7KngZpHVYaAvjlTwQ0qNlQ7oZseDI0PTyTbW3FR0gkdp0Oh5MHrA7JhrECuisHEAr/s600/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20buffered%20IO%20(iobuf)%20(1).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgZIN7V0FiMpRz_DS6parCUZ4eUfN68FHpcqTa5zHY6C2geH3fRrbg9EeZSAKlfIDwJSUEOrde9hrBmcj2jV3xHvGOLi4KLJO8iM4Y5Qz-FVsZqNRTlcsMthU2im1p7KngZpHVYaAvjlTwQ0qNlQ7oZseDI0PTyTbW3FR0gkdp0Oh5MHrA7JhrECuisHEAr/w640-h396/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20buffered%20IO%20(iobuf)%20(1).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b style=&quot;font-family: inherit;&quot;&gt;Results: iodir&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;The iodir tests use an IO-bound database with O_DIRECT.&amp;nbsp;&lt;/span&gt;The performance summary&amp;nbsp;&lt;a href=&quot;https://gist.github.com/mdcallag/2c24cd76f0a41ee4ac9ae647b9ba8c10&quot;&gt;is here&lt;/a&gt;.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The chart shows the relative QPS for a given version relative to RocksDB 6.0.2. There are two charts and the second narrows the range for the y-axis to make it easier to see regressions.&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Summary:&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;ul&gt;&lt;li&gt;fillseq has been stable since 7.6&lt;/li&gt;&lt;li&gt;readww has always been stable&lt;/li&gt;&lt;li&gt;overwrite improved in 7.6 and has been stable since then&lt;/li&gt;&lt;li&gt;fwdrangeww and revrangeww have been stable but there is some variance&lt;/li&gt;&lt;/ul&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi3UbtpTCDpJ5OJfHQWZlFiHulWT98VHaVD-dR1l7yLSlwjGVa04iN136ax7gkZdMdL3KGFr4zYwaT2QPI8c-pqPSzKntMmS0cGM5AuRRHVwZrwpFn2Q__ffB-Ckb0GJdQRgpETTpi3OZJattmALrTY3e4GbiJqQTLCTFNzBOWsYRzCYksLVjt85eun0zbR/s600/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20O_DIRECT%20(iodir).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi3UbtpTCDpJ5OJfHQWZlFiHulWT98VHaVD-dR1l7yLSlwjGVa04iN136ax7gkZdMdL3KGFr4zYwaT2QPI8c-pqPSzKntMmS0cGM5AuRRHVwZrwpFn2Q__ffB-Ckb0GJdQRgpETTpi3OZJattmALrTY3e4GbiJqQTLCTFNzBOWsYRzCYksLVjt85eun0zbR/w640-h396/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20O_DIRECT%20(iodir).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiMr0sUI0yZmYBvdeiU-XeOgZOLOr48Y3xcmyyGOZp92C97LfzCSgcGfIR7RFrg13uR3bv0im6SEQlUoPL8eLQwA1IYXIYvzybdCzOzyzHtmIypFbO02_mBJb3aMvM7U9cCpeWg089g5ur3j0NvjP44MMwbvS27vIixw3YlZp1oFTRyxtTUNgKNu8EUX239/s600/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20O_DIRECT%20(iodir)%20(1).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiMr0sUI0yZmYBvdeiU-XeOgZOLOr48Y3xcmyyGOZp92C97LfzCSgcGfIR7RFrg13uR3bv0im6SEQlUoPL8eLQwA1IYXIYvzybdCzOzyzHtmIypFbO02_mBJb3aMvM7U9cCpeWg089g5ur3j0NvjP44MMwbvS27vIixw3YlZp1oFTRyxtTUNgKNu8EUX239/w640-h396/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20O_DIRECT%20(iodir)%20(1).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/488053051666274292/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2024/11/rocksdb-benchmarks-large-server.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/488053051666274292'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/488053051666274292'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2024/11/rocksdb-benchmarks-large-server.html' title='RocksDB benchmarks: large server, universal compaction'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiWOIp3_j51p1eENVKrvWOVMTsLPr-n23mJinL1RZ38CzoRbUwoJwiRYABXxnb6mOdg5y2jrIADfP4CLXekI9tmRjqKNyzUOpPe9NVJ5X1yZYWhak43N_P-_5FdFDqiovZo9VjkWobobxmeDBYL3aDYfu0RyAla3Kp1rNdujhP0KSJtrm0jbQzw7BvJzAML/s72-w640-h396-c/QPS%20relative%20to%20RocksDB%206.0.2_%20cached%20(byrx).png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-1458227107807742731</id><published>2024-11-09T14:00:00.001-08:00</published><updated>2024-11-09T14:00:11.785-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="innodb"/><category scheme="http://www.blogger.com/atom/ns#" term="mysql"/><category scheme="http://www.blogger.com/atom/ns#" term="sysbench"/><title type='text'>Fixing some of the InnoDB scan perf regressions in a MySQL fork</title><content type='html'>&lt;p&gt;I recently learned of &lt;a href=&quot;https://smalldatum.blogspot.com/2024/10/trying-out-advanced-mysql.html&quot;&gt;Advanced MySQL&lt;/a&gt;, a MySQL fork, and ran my sysbench benchmarks for it. It fixed some, but not all, of the regressions for write heavy workloads that landed in InnoDB after MySQL 8.0.28.&lt;br /&gt;&lt;br /&gt;In response to my results, the project lead &lt;a href=&quot;https://github.com/advancedmysql/mysqlplus/issues/1&quot;&gt;filed a bug&lt;/a&gt; for performance regressions and then quickly came up with a diff. The bug in this case is for regressions that are most obvious during full table scans and the problems arrived in MySQL 8.0.29 and 8.0.30 -- see &lt;a href=&quot;https://bugs.mysql.com/bug.php?id=111538&quot;&gt;bug 111538&lt;/a&gt;&amp;nbsp;and &lt;a href=&quot;https://smalldatum.blogspot.com/2024/11/too-many-performance-regressions-for.html&quot;&gt;this post&lt;/a&gt;. The bug is closed for upstream but the perf regressions remain so I am excited to see the community working to solve this problem.&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;Advanced MySQL with the fix removes much of the regression in scan performance&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Builds&lt;/b&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;I tried 4 builds&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;my8028 - upstream MySQL 8.0.28&lt;/li&gt;&lt;li&gt;my8040 - upstream MySQL 8.0.40&lt;/li&gt;&lt;li&gt;my8040adv_pre - Advanced MySQL 8.0.40 without the fix (without &lt;a href=&quot;https://github.com/enhancedformysql/mysql-8.0.40/commit/d347cdb9ce8861003b3ae8b8a63277e858a8fb6b&quot;&gt;d347cdb&lt;/a&gt;)&lt;/li&gt;&lt;li&gt;my8040adv_post - Advanced MySQL 8.0.40 with the fix (at &lt;a href=&quot;https://github.com/enhancedformysql/mysql-8.0.40/commit/d347cdb9ce8861003b3ae8b8a63277e858a8fb6b&quot;&gt;d347cdb&lt;/a&gt;)&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;b&gt;Hardware&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The servers are&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;dell32&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Dell Precision 7865 Tower Workstation with 1 socket, 128G RAM, AMD Ryzen Threadripper PRO 5975WX with 32-Cores, 2 m.2 SSD (each 2TB, RAID SW 0, ext4).&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;ax162-s&lt;/b&gt;&lt;/i&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;span style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #222222; font-family: inherit;&quot;&gt;AMD EPYC 9454P 48-Core Processor with SMT disabled,&amp;nbsp;128G RAM, Ubuntu 22.04 and ext4 on 2 NVMe devices with SW RAID 1. This is in the Hetzner cloud.&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;span style=&quot;color: #222222;&quot;&gt;&lt;span style=&quot;background-color: white;&quot;&gt;&lt;i&gt;&lt;b&gt;bee&lt;/b&gt;&lt;/i&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;ul&gt;&lt;li style=&quot;margin: 0px 0px 0.25em; padding: 0px;&quot;&gt;Beelink SER 4700u&amp;nbsp;with Ryzen 7 4700u, 16G RAM, Ubuntu 22.04 and ext4 on NVMe&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&lt;div&gt;I used sysbench and my usage is&amp;nbsp;&lt;a href=&quot;http://smalldatum.blogspot.com/2017/02/using-modern-sysbench-to-compare.html&quot;&gt;explained here&lt;/a&gt;. A full run has 42 microbenchmarks and most test only 1 type of SQL statement. The database is cached by InnoDB.&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;The benchmark is run with ...&lt;br /&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;dell32 - 8 tables, 10M rows per table and 24 threads&lt;/li&gt;&lt;li&gt;ax162-s - 8 tables, 10M rows per table and 40 threads&lt;/li&gt;&lt;li&gt;bee - 1 table, 30M rows and 1 thread&lt;/li&gt;&lt;/ul&gt;Each microbenchmark runs for 300 seconds if read-only and 600 seconds otherwise. Prepared statements were enabled.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Results: overview&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;All of the results use&amp;nbsp;relative QPS (rQPS) where:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;rQPS is: (QPS for my version / QPS for base version)&lt;/li&gt;&lt;li&gt;&lt;i&gt;base version&lt;/i&gt;&amp;nbsp;is the QPS from MySQL 8.0.28&lt;/li&gt;&lt;li&gt;&lt;i&gt;my version&lt;/i&gt;&amp;nbsp;is one of the other versions&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Here I only share the results for the scan microbenchmark.&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: dell32&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;QPS with&amp;nbsp;&lt;a href=&quot;https://github.com/enhancedformysql/mysql-8.0.40/commit/d347cdb9ce8861003b3ae8b8a63277e858a8fb6b&quot;&gt;the fix&lt;/a&gt;&amp;nbsp;in Advanced MySQL is ~9% better than without the fix&lt;/li&gt;&lt;li&gt;QPS with the fix in Advanced MySQL is ~2% better than my8040.&lt;/li&gt;&lt;li&gt;I am not sure why my8040adv_pre did much worse than my8040&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;From the relative QPS results the QPS with my8040adv_pre was ~15% less than my8028. But my8040adv_post is only ~7% slower than my8028 so it removes half of the regression.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;Relative to: my8028&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-1 : my8040&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-2 : my8040adv_pre&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-3 : my8040adv_post&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-1&amp;nbsp; &amp;nbsp;col-2&amp;nbsp; &amp;nbsp;col-3&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.91&amp;nbsp; &amp;nbsp; 0.85&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;0.93&lt;/span&gt;&amp;nbsp; &amp;nbsp; scan&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;From vmstat and iostat metrics CPU overhead for my8040adv_pre was ~22% larger than my8028. But with the fix the CPU overhead for my8040adv_post is only ~8% larger than my8028. This is great.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;--- absolute&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;cpu/o&lt;span style=&quot;white-space: pre;&quot;&gt;		&lt;/span&gt;cs/o&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;r/o&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;rKB/o&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;wKB/o&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;o/s&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;dbms&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.093496&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;3.256&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.006&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;246&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;my8028&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.106105&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;4.065&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.006&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;225&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;my8040&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.113878&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;4.344&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.006&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;208&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;my8040adv_pre&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.101104&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;3.978&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.006&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;228&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;my8040adv_post&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;--- relative to first result&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.13&lt;span style=&quot;white-space: pre;&quot;&gt;		&lt;/span&gt;1.25&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.00&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.91&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;my8040&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.22&lt;span style=&quot;white-space: pre;&quot;&gt;		&lt;/span&gt;1.33&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.00&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.85&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;my8040adv_pre&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1.08&lt;/span&gt;&lt;span style=&quot;white-space: pre;&quot;&gt;		&lt;/span&gt;1.22&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1.00&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.93&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;my8040adv_post&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: ax162-s&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;QPS is ~18% larger with &lt;a href=&quot;https://github.com/enhancedformysql/mysql-8.0.40/commit/d347cdb9ce8861003b3ae8b8a63277e858a8fb6b&quot;&gt;the fix&lt;/a&gt; in Advanced MySQL&lt;/li&gt;&lt;li&gt;CPU overhead is ~15% smaller with the fix&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;From the relative QPS results the QPS with my8040adv_pre was the same as my8040 and both were ~17% slower than my8028. But my8040adv_post is only ~2% slower than my8028 which is excellent.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;Relative to: my8028&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-1 : my8040&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-2 : my8040adv_pre&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-3 : my8040adv_post&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-1&amp;nbsp; &amp;nbsp;col-2&amp;nbsp; &amp;nbsp;col-3&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.83&amp;nbsp; &amp;nbsp; 0.83&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;0.98&lt;/span&gt;&amp;nbsp; &amp;nbsp; scan&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;From vmstat and iostat metrics CPU overhead for my8040 and my8040adv_pre were ~20% larger than my8028. But with the fix the CPU overhead for my8040adv_post is only ~3% larger than my8028. This is great.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;--- absolute&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;cpu/o&lt;span style=&quot;white-space: pre;&quot;&gt;		&lt;/span&gt;cs/o&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;r/o&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;rKB/o&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;wKB/o&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;o/s&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;dbms&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.018767&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.552&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.052&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;872&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;my8028&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.022533&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.800&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.013&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;725&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;my8040&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.022499&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.808&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.001&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.034&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;727&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;my8040adv_pre&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;0.019305&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.731&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.03&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;851&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;my8040adv_post&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;--- relative to first result&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.20&lt;span style=&quot;white-space: pre;&quot;&gt;		&lt;/span&gt;1.45&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.25&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.83&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;my8040&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;1.20&lt;span style=&quot;white-space: pre;&quot;&gt;		&lt;/span&gt;1.46&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;inf&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.65&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.83&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;my8040adv_pre&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1.03&lt;/span&gt;&lt;span style=&quot;white-space: pre;&quot;&gt;		&lt;/span&gt;1.32&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;1&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.58&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;0.98&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;my8040adv_post&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: bee&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;QPS is ~17% larger with &lt;a href=&quot;https://github.com/enhancedformysql/mysql-8.0.40/commit/d347cdb9ce8861003b3ae8b8a63277e858a8fb6b&quot;&gt;the fix&lt;/a&gt; in Advanced MySQL&lt;/li&gt;&lt;li&gt;CPU overhead is ~15% smaller with the fix&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;I did not test my8040adv_pre on this server.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;From the relative QPS results the QPS with my8040 is ~22% less than my8028. But QPS from my8040adv_post is only ~9% less than my8028. This is great.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;Relative to: my8028&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-1 : my8040&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-2 : my8040adv_post&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;col-1&amp;nbsp; &amp;nbsp;col-2&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier;&quot;&gt;0.78&amp;nbsp; &amp;nbsp; &lt;span style=&quot;background-color: #d9ead3;&quot;&gt;0.91&lt;/span&gt;&amp;nbsp; &amp;nbsp; scan&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;From vmstat and iostat metrics CPU overhead for my8040 was ~28% larger than my8028. But with the fix the CPU overhead for my8040adv_post is only ~3% larger than my8028. This is great.&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: courier; font-size: x-small;&quot;&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;--- absolute&lt;/div&gt;&lt;div&gt;cpu/o&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;cs/o&amp;nbsp; &amp;nbsp; r/o&amp;nbsp; &amp;nbsp; &amp;nbsp;rKB/o&amp;nbsp; &amp;nbsp;wKB/o&amp;nbsp; &amp;nbsp;o/s&amp;nbsp; &amp;nbsp; &amp;nbsp;dbms&lt;/div&gt;&lt;div&gt;0.222553&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.534&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0.001&amp;nbsp; &amp;nbsp;0.035&amp;nbsp; &amp;nbsp;55&amp;nbsp; &amp;nbsp; &amp;nbsp; my8028&lt;/div&gt;&lt;div&gt;0.285792&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 7.622&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0.041&amp;nbsp; &amp;nbsp;43&amp;nbsp; &amp;nbsp; &amp;nbsp; my8040&lt;/div&gt;&lt;div&gt;0.246404&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 6.475&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0.036&amp;nbsp; &amp;nbsp;50&amp;nbsp; &amp;nbsp; &amp;nbsp; my8040adv_post&lt;/div&gt;&lt;div&gt;--- relative to first result&lt;/div&gt;&lt;div&gt;1.28&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3.01&amp;nbsp; &amp;nbsp; 1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0.00&amp;nbsp; &amp;nbsp; 1.17&amp;nbsp; &amp;nbsp; 0.78&amp;nbsp; &amp;nbsp; my8040&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;background-color: #d9ead3;&quot;&gt;1.11&lt;/span&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.56&amp;nbsp; &amp;nbsp; 1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0.00&amp;nbsp; &amp;nbsp; 1.03&amp;nbsp; &amp;nbsp; 0.91&amp;nbsp; &amp;nbsp; my8040adv_post&lt;/div&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/1458227107807742731/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2024/11/fixing-some-of-innodb-scan-perf.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/1458227107807742731'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/1458227107807742731'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2024/11/fixing-some-of-innodb-scan-perf.html' title='Fixing some of the InnoDB scan perf regressions in a MySQL fork'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-3617582214788851072</id><published>2024-11-09T11:40:00.003-08:00</published><updated>2024-11-25T10:10:30.526-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="db_bench"/><category scheme="http://www.blogger.com/atom/ns#" term="rocksdb"/><title type='text'>RocksDB benchmarks: large server, leveled compaction</title><content type='html'>&lt;p&gt;I recently shared benchmark results for RocksDB a few weeks ago for both &lt;a href=&quot;https://smalldatum.blogspot.com/2024/10/rocksdb-benchmarks-small-server-leveled.html&quot;&gt;leveled&lt;/a&gt; and &lt;a href=&quot;https://smalldatum.blogspot.com/2024/11/rocksdb-benchmarks-small-server.html&quot;&gt;universal&lt;/a&gt; compaction on a small server. This post has results from a large server with leveled compaction.&amp;nbsp;&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;there are a few regressions from &lt;a href=&quot;https://github.com/facebook/rocksdb/issues/12038&quot;&gt;bug 12038&lt;/a&gt;&lt;/li&gt;&lt;li&gt;QPS for overwrite is ~1.5X to ~2X better in 9.x than 6.0 (ignoring bug 12038)&lt;/li&gt;&lt;li&gt;otherwise QPS in 9.x is similar to 6.x&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Hardware&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&lt;p&gt;The server is an ax162-s from Hetzner with an AMD EPYC 9454P processor, 48 cores, AMD SMT disabled and 128G RAM. The OS is Ubuntu 22.04. Storage is 2 NVMe devices with SW RAID 1 and ext4.&lt;/p&gt;&lt;div&gt;&lt;b&gt;Builds&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I compiled db_bench from source on all servers. I used versions:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;6.x - 6.0.2, 6.10.4, 6.20.4, 6.29.5&lt;/li&gt;&lt;li&gt;7.x - 7.0.4, 7.3.2, 7.6.0, 7.10.2&lt;/li&gt;&lt;li&gt;8.x - 8.0.0, 8.3.3, 8.6.7, 8.9.2, 8.11.4&lt;/li&gt;&lt;li&gt;9.x - 9.0.1, 9.1.2, 9.2.2, 9.3.2, 9.4.1, 9.5.2, 9.6.1 and 9.7.3&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;All tests used the default value for compaction_readahead_size and the block cache (LRU).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I used&amp;nbsp;&lt;a href=&quot;https://github.com/mdcallag/mytools/tree/master/bench/rx2&quot;&gt;my fork of the RocksDB benchmark scripts&lt;/a&gt;&amp;nbsp;that are wrappers to run db_bench. These run db_bench tests in a special sequence -- load in key order, read-only, do some overwrites, read-write and then write-only. The benchmark was run using 40 threads.&amp;nbsp;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;How I do benchmarks for RocksDB is explained&amp;nbsp;&lt;/span&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2022/08/how-i-do-performance-tests-for-rocksdb.html&quot; style=&quot;background-color: white;&quot;&gt;here&lt;/a&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;&amp;nbsp;and&amp;nbsp;&lt;/span&gt;&lt;a href=&quot;https://smalldatum.blogspot.com/2022/08/how-i-do-rocksdb-performance-tests-part.html&quot; style=&quot;background-color: white;&quot;&gt;here&lt;/a&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;. The command line to run the tests is:&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;bash x3.sh 40 no 1800 c48r128 100000000 2000000000 byrx iobuf iodir&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;span style=&quot;color: black;&quot;&gt;The tests on the charts are named as:&lt;/span&gt;&lt;br style=&quot;color: black;&quot; /&gt;&lt;ul style=&quot;color: black;&quot;&gt;&lt;li&gt;&lt;i&gt;fillseq&lt;/i&gt;&amp;nbsp;-- load in key order with the WAL disabled&lt;/li&gt;&lt;li&gt;&lt;i&gt;revrangeww&lt;/i&gt;&amp;nbsp;-- reverse range while writing, do short reverse range scans as fast as possible while another thread does writes (Put) at a fixed rate&lt;/li&gt;&lt;li&gt;&lt;i&gt;fwdrangeww&lt;/i&gt;&amp;nbsp;-- like revrangeww except do short forward range scans&lt;/li&gt;&lt;li&gt;&lt;i&gt;readww&lt;/i&gt;&amp;nbsp;- like revrangeww except do point queries&lt;/li&gt;&lt;li&gt;&lt;i&gt;overwrite&lt;/i&gt;&amp;nbsp;- do overwrites (Put) as fast as possible&lt;/li&gt;&lt;/ul&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b style=&quot;font-family: inherit;&quot;&gt;Workloads&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;p&gt;There are three workloads, all of which use 40 threads:&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;byrx - the database is cached by RocksDB (100M KV pairs)&lt;br /&gt;&lt;/li&gt;&lt;li&gt;iobuf - the database is larger than memory and RocksDB uses buffered IO (2B KV pairs)&lt;/li&gt;&lt;li&gt;iodir - the database is larger than memory and RocksDB uses O_DIRECT (2B KV pairs)&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;A spreadsheet with all results&amp;nbsp;&lt;a href=&quot;https://docs.google.com/spreadsheets/d/1FAW0s9cJ_0209Ru0qMr7Nacj1AxKS1xxw5Bd0pugugk/edit?usp=sharing&quot;&gt;is here&lt;/a&gt;&amp;nbsp;and performance summaries with more details are here for &lt;a href=&quot;https://gist.github.com/mdcallag/cd8fcf2da52625aa644070af51146bba&quot;&gt;byrx&lt;/a&gt;, &lt;a href=&quot;https://gist.github.com/mdcallag/9241b76c83e95d657dcc1ef1dc62dce5&quot;&gt;iobuf&lt;/a&gt; and &lt;a href=&quot;https://gist.github.com/mdcallag/47406d12b80ae6235d73baa7442a4231&quot;&gt;iodir&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Relative QPS&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The numbers in the spreadsheet and on the y-axis in the charts that follow are the relative QPS which is (QPS for $me) / (QPS for $base). When the value is greater than 1.0 then $me is faster than $base. When it is less than 1.0 then $base is faster (perf regression!).&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The base version is RocksDB 6.0.2.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: byrx&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The byrx tests use a cached database. The performance summary&amp;nbsp;&lt;a href=&quot;https://gist.github.com/mdcallag/cd8fcf2da52625aa644070af51146bba&quot;&gt;is here&lt;/a&gt;.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;This chart shows the relative QPS for a given version relative to RocksDB 6.0.2. The y-axis doesn&#39;t start at 0 in the second chart to improve readability for some lines.&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Summary:&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;fillseq is worse from 6.0 to 8.0 but stable since then&lt;/li&gt;&lt;li&gt;overwrite has large improvements late in 6.0 and small improvements since then&lt;/li&gt;&lt;li&gt;fwdrangeww has small improvements in early 7.0 and is stable since then&lt;/li&gt;&lt;li&gt;revrangeww and readww are stable from 6.0 through 9.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgDwpwkWb2_gytt5TiIFzkqerS4XMU_ZQgQiuKS1qmHddkUi2yYaL4rPw21Tlx8ux6I-0YdsEo0qjRuzBgg5D6MjirDtmBhfoN9mU3bEbbVpXgDHNdlBbhlDtmgCAly8TAjubPZBpU_WBW9my4SQMjdoef5rafW0aRyORKoFj1lxHNXYc_Z8v9_GMcTWMRw/s600/QPS%20relative%20to%20RocksDB%206.0.2_%20cached%20(byrx).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgDwpwkWb2_gytt5TiIFzkqerS4XMU_ZQgQiuKS1qmHddkUi2yYaL4rPw21Tlx8ux6I-0YdsEo0qjRuzBgg5D6MjirDtmBhfoN9mU3bEbbVpXgDHNdlBbhlDtmgCAly8TAjubPZBpU_WBW9my4SQMjdoef5rafW0aRyORKoFj1lxHNXYc_Z8v9_GMcTWMRw/w640-h396/QPS%20relative%20to%20RocksDB%206.0.2_%20cached%20(byrx).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjyQ0u6eul0acTGM48P3K59QFje-5wNo2BSxnc8IZEx_1o_AnuYpvcd-Wkdf8d37ztfkerlWUKz35QqUqEZU3QxF-gjTxaWoXhcrhFnDhRzcKhS8YknB88OuIbO9i-9oUmL1lrVbjdo9L9KaNrv3_A4ngorO3PpNrWmp_AnOX7_K900TfEBEBO0ryqTZGsg/s600/QPS%20relative%20to%20RocksDB%206.0.2_%20cached%20(byrx)%20(1).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjyQ0u6eul0acTGM48P3K59QFje-5wNo2BSxnc8IZEx_1o_AnuYpvcd-Wkdf8d37ztfkerlWUKz35QqUqEZU3QxF-gjTxaWoXhcrhFnDhRzcKhS8YknB88OuIbO9i-9oUmL1lrVbjdo9L9KaNrv3_A4ngorO3PpNrWmp_AnOX7_K900TfEBEBO0ryqTZGsg/w640-h396/QPS%20relative%20to%20RocksDB%206.0.2_%20cached%20(byrx)%20(1).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b style=&quot;font-family: inherit;&quot;&gt;Results: iobuf&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The iobuf tests use an IO-bound database with buffered IO. The performance summary&amp;nbsp;&lt;a href=&quot;https://gist.github.com/mdcallag/8c7b8d1071e6f8308325da88badb6778&quot;&gt;is here&lt;/a&gt;.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;This chart shows the relative QPS for a given version relative to RocksDB 6.0.2. The y-axis doesn&#39;t start at 0 in the second chart to improve readability for some lines.&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Summary:&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;&lt;a href=&quot;https://github.com/facebook/rocksdb/issues/12038&quot;&gt;bug 12038&lt;/a&gt; explains the drop in throughput for overwrite since 8.6.7&lt;/li&gt;&lt;li&gt;otherwise QPS in 9.x is similar to 6.0&lt;/li&gt;&lt;/ul&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiGEPMBA-kN0oS1ccseIOS-2z5i_jH3Rbj_BDFmNXYADlnXzbnGP7l-84dH32CUDeI_rcTpKN_DAFksJtn85Z_5PYUygxxQHAZiLrXFe8La15bPxAQOEltqrxDTh1dvki57ivLlaMxD1R95QYLR4giBzK0YjwfmVE0TlNpylmCgkwig5OmK_bputDkeUu40/s600/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20buffered%20IO%20(iobuf).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiGEPMBA-kN0oS1ccseIOS-2z5i_jH3Rbj_BDFmNXYADlnXzbnGP7l-84dH32CUDeI_rcTpKN_DAFksJtn85Z_5PYUygxxQHAZiLrXFe8La15bPxAQOEltqrxDTh1dvki57ivLlaMxD1R95QYLR4giBzK0YjwfmVE0TlNpylmCgkwig5OmK_bputDkeUu40/w640-h396/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20buffered%20IO%20(iobuf).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj5HerNuZaPadkSfXF4GKGwoMOa040f-TvG1XxhwFIFJtFDZcoic-YLBG8t7icrp6S7-37tjs7ZAk3Uhmh1S-uAIYMK7z_A_96D9EzW8_Sr8LxZDr3vDuUKCRjUcdh-qUwnv4fd5r_yyHXoMIF_c5F-gVt3GdW9EpBhbqmJD0EHPm2sYfdHlW01L8HPE68C/s600/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20buffered%20IO%20(iobuf)%20(1).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj5HerNuZaPadkSfXF4GKGwoMOa040f-TvG1XxhwFIFJtFDZcoic-YLBG8t7icrp6S7-37tjs7ZAk3Uhmh1S-uAIYMK7z_A_96D9EzW8_Sr8LxZDr3vDuUKCRjUcdh-qUwnv4fd5r_yyHXoMIF_c5F-gVt3GdW9EpBhbqmJD0EHPm2sYfdHlW01L8HPE68C/w640-h396/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20buffered%20IO%20(iobuf)%20(1).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;b style=&quot;font-family: inherit;&quot;&gt;Results: iodir&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;The iodir tests use an IO-bound database with O_DIRECT.&lt;/span&gt;&amp;nbsp;The performance summary&amp;nbsp;&lt;a href=&quot;https://gist.github.com/mdcallag/47406d12b80ae6235d73baa7442a4231&quot;&gt;is here&lt;/a&gt;.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;This chart shows the relative QPS for a given version relative to RocksDB 6.0.2. The y-axis doesn&#39;t start at 0 in the second chart to improve readability for some lines.&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Summary:&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;the QPS drop for overwrite in 8.6.7 occurs because the db_bench client wasn&#39;t updated to use the new default value for compaction readahead size&lt;/li&gt;&lt;li&gt;QPS for overwrite is ~2X better in 9.x relative to 6.0&lt;/li&gt;&lt;li&gt;otherwise QPS in 9.x is similar to 6.0&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;todo&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiU0sk5n2Vr-Uzh4VW37mBN-SyD4rJDWt6DqopcoJiDIbMHsj7igm4302Hidmp4k3tcgN3NvA1v6-bERUAn1yFCBcdBWVGp2uUUg3j4eYWc0XZASD2enps7H7kkJtGuyZxvjiNJR_Ny01C7yig2OyjbGdVyAqf_5p3cK59M_pimyN4Ja0fHNP4He0RuX48S/s600/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20O_DIRECT%20(iodir).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiU0sk5n2Vr-Uzh4VW37mBN-SyD4rJDWt6DqopcoJiDIbMHsj7igm4302Hidmp4k3tcgN3NvA1v6-bERUAn1yFCBcdBWVGp2uUUg3j4eYWc0XZASD2enps7H7kkJtGuyZxvjiNJR_Ny01C7yig2OyjbGdVyAqf_5p3cK59M_pimyN4Ja0fHNP4He0RuX48S/w640-h396/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20O_DIRECT%20(iodir).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgejNjdVcxYvsm3JMHkSciWC8GluY8r60H_4Jy8dL_qY8nBJPUbUADryfGycv6YFEVveVC8Gd2DCTyQWzfpqrRq1hPv3oUe9o8hIQQMNqgVzIX6jgExNOQupkUCCVPfyloOUy8I68BfIcmMDLbyIPZmAQfn7ORvklxJzbbCeWEk2vV19Yiig7-WJ8xcmth5/s600/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20O_DIRECT%20(iodir)%20(1).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgejNjdVcxYvsm3JMHkSciWC8GluY8r60H_4Jy8dL_qY8nBJPUbUADryfGycv6YFEVveVC8Gd2DCTyQWzfpqrRq1hPv3oUe9o8hIQQMNqgVzIX6jgExNOQupkUCCVPfyloOUy8I68BfIcmMDLbyIPZmAQfn7ORvklxJzbbCeWEk2vV19Yiig7-WJ8xcmth5/w640-h396/QPS%20relative%20to%20RocksDB%206.0.2_%20IO-bound,%20O_DIRECT%20(iodir)%20(1).png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/3617582214788851072/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2024/11/rocksdb-benchmarks-large-server-leveled.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/3617582214788851072'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/3617582214788851072'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2024/11/rocksdb-benchmarks-large-server-leveled.html' title='RocksDB benchmarks: large server, leveled compaction'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgDwpwkWb2_gytt5TiIFzkqerS4XMU_ZQgQiuKS1qmHddkUi2yYaL4rPw21Tlx8ux6I-0YdsEo0qjRuzBgg5D6MjirDtmBhfoN9mU3bEbbVpXgDHNdlBbhlDtmgCAly8TAjubPZBpU_WBW9my4SQMjdoef5rafW0aRyORKoFj1lxHNXYc_Z8v9_GMcTWMRw/s72-w640-h396-c/QPS%20relative%20to%20RocksDB%206.0.2_%20cached%20(byrx).png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9149523927864751087.post-3082000965737959352</id><published>2024-11-05T18:07:00.000-08:00</published><updated>2024-11-05T18:09:26.392-08:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="db_bench"/><category scheme="http://www.blogger.com/atom/ns#" term="rocksdb"/><title type='text'>RocksDB on a big server: LRU vs hyperclock</title><content type='html'>&lt;p&gt;This has benchmark results for RocksDB using a big (48-core) server. I ran tests to document the impact of the the block cache type (LRU vs hyperclock) and a few other configuration choices for a CPU-bound workload. A previous post with great results for the hyperclock block cache &lt;a href=&quot;https://smalldatum.blogspot.com/2022/10/hyping-hyper-clock-cache-in-rocksdb.html&quot;&gt;is here&lt;/a&gt;.&lt;/p&gt;&lt;p&gt;tl;dr&lt;/p&gt;&lt;div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;read QPS is up to ~3X better with auto_hyper_clock_cache vs LRU&lt;/li&gt;&lt;li&gt;read QPS is up to ~1.3X better with the per-level fanout set to 32 vs 8&lt;/li&gt;&lt;li&gt;read QPS drops by ~15% as the background write rate increases from 2 to 32 M/s&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;b&gt;Software&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;p&gt;I used RocksDB 9.6, compiled with gcc 11.4.0.&lt;/p&gt;&lt;p&gt;&lt;b&gt;Hardware&lt;/b&gt;&lt;/p&gt;&lt;p&gt;The server is an ax162-s from Hetzner with an AMD EPYC 9454P processor, 48 cores, AMD SMT disabled and 128G RAM. The OS is Ubuntu 22.04. Storage is 2 NVMe devices with SW RAID 1 and ext4.&lt;/p&gt;&lt;p&gt;&lt;b&gt;Benchmark&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Overviews on how I use db_bench are &lt;a href=&quot;https://smalldatum.blogspot.com/2022/08/how-i-do-performance-tests-for-rocksdb.html&quot;&gt;here&lt;/a&gt; and &lt;a href=&quot;https://smalldatum.blogspot.com/2022/08/how-i-do-rocksdb-performance-tests-part.html&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;&lt;p&gt;All of my tests here use a CPU-bound workload with a database that is cached by RocksDB and are repeated for 1, 10, 20 and 40 threads.&amp;nbsp;&lt;/p&gt;&lt;p&gt;I focus on the readwhilewriting benchmark where performance is reported for the reads (point queries) while there is a fixed rate for writes done in the background. I prefer to measure read performance when there are concurrent writes because read-only benchmarks with an LSM suffer from non-determinism as the state (shape) of the LSM tree has a large impact on CPU overhead and throughput.&lt;/p&gt;&lt;p&gt;To save time I did not run the fwdrangewhilewriting benchmark. Were I to repeat this work I would include it because the results from it would be interesting for a few of the configuration options I compared.&lt;/p&gt;&lt;p&gt;I did tests to understand the following:&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;LRU vs auto_hyper_clock_cache for the block cache implementation&lt;/li&gt;&lt;ul&gt;&lt;li&gt;LRU is the original implementation. The code was simple, which is nice. The implementation for LRU is sharded with a mutex per shard and that mutex can become a hot spot. The &lt;a href=&quot;https://smalldatum.blogspot.com/2022/10/hyping-hyper-clock-cache-in-rocksdb.html&quot;&gt;hyperclock&lt;/a&gt; implementation is much better at avoiding hot spots.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;per level fanout (8 vs 32)&lt;/li&gt;&lt;ul&gt;&lt;li&gt;By per level fanout I mean the value of&amp;nbsp;--max_bytes_for_level_multiplier which determines the target size difference between adjacent levels. By default I use 8, while 10 is also a common choice. Here I compare 8 vs 32. When the fanout is larger the LSM tree has fewer levels -- meaning there are fewer places to check for data which should reduce CPU overhead and increase QPS.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;background write rate&lt;/li&gt;&lt;ul&gt;&lt;li&gt;I repeated tests with the background write rate (--benchmark_write_rate_limit) set to 2, 8 and 32 MB/s. With a higher write rate there is more chance for interference between reads and writes. The interference might be from mutex contention, compaction threads using more CPU, more L0 files to check or more data in levels L1 and larger.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;target size for L0&lt;/li&gt;&lt;ul&gt;&lt;li&gt;By target size I mean the number of files in the L0 that trigger compaction. The db_bench option for this is --level0_file_num_compaction_trigger. When the value is larger there will be more L0 files on average that a query might have to check and that means there is more CPU overhead. Unfortunately, I configured RocksDB incorrectly so I don&#39;t have results to share. The issue is that when the L0 is configured to be larger, the L1 should be configured to be at least as large as the L0 (L1 target size should be &amp;gt;= sizeof(SST) * num(L0 files). If not, then L0-&amp;gt;L1 compaction will happen sooner than expected.&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;div&gt;All of the results are in &lt;a href=&quot;https://docs.google.com/spreadsheets/d/1ur7A9b5Y6JxWfsJh9mVOhGQUDiAEhbhQ3jUmuqtc7rc/edit?usp=sharing&quot;&gt;this spreadsheet&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: LRU vs auto_hyper_clock_cache&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;These graphs have QPS from the readwhilewriting benchmark for the LRU and AHCC block cache implementations where LRU is the original version with a sharded hash table and a mutex per shard while AHCC is the hyper clock cache (--cache_type=auto_hyper_clock_cache).&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;QPS is much better with AHCC than LRU (~3.3X faster at 40 threads)&lt;/li&gt;&lt;li&gt;QPS with AHCC scales linearly with the thread count&lt;/li&gt;&lt;li&gt;QPS with LRU does not scale linearly and suffers from mutex contention&lt;/li&gt;&lt;li&gt;There are some odd effects in the results for 1 thread&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;With a 2M/s background write rate AHCC is ~1.1X faster at 1 thread and ~3.3X faster at 40 threads relative to LRU.&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgheKpQ1wicaDdA1Ygal6SfyKpJcqWfIdTS_A1OyeqpzQ-ly7vf_s_vglBzoRjLUZy1K2LSbNWXAX9CObbF6-wNIKgA-Pl0ckr2Dm96460OmOwKmXl3vLT8QlfDW9CXHKUlF3WqFg7pwRcLsruwouNyb1Gk0L0DLvaYmYRivID0cE6J2xTnEQggyoAbQ-mb/s600/QPS_%20fanout=8,%20write%20rate%20=%202M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;397&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgheKpQ1wicaDdA1Ygal6SfyKpJcqWfIdTS_A1OyeqpzQ-ly7vf_s_vglBzoRjLUZy1K2LSbNWXAX9CObbF6-wNIKgA-Pl0ckr2Dm96460OmOwKmXl3vLT8QlfDW9CXHKUlF3WqFg7pwRcLsruwouNyb1Gk0L0DLvaYmYRivID0cE6J2xTnEQggyoAbQ-mb/w640-h397/QPS_%20fanout=8,%20write%20rate%20=%202M_s,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiYtvyGOL5hXOM66ryLQXIFNtudnUkGQT2brcwdDLxqXEV3bJMyviwSuuseckmYIk0WJyI7E35b6FCrWnSFjA2tkmk67zurHLnY_USIvh3kML938MSA1A1pVzLhqtYsJM9VikSSQneHIx0unRdtRaqn7CLukvcUDkbxgsBZCHsOByMLyVyq7JrwOZtA9Ksl/s600/QPS%20per%20thread_%20fanout=8,%20write%20rate%20=%202M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiYtvyGOL5hXOM66ryLQXIFNtudnUkGQT2brcwdDLxqXEV3bJMyviwSuuseckmYIk0WJyI7E35b6FCrWnSFjA2tkmk67zurHLnY_USIvh3kML938MSA1A1pVzLhqtYsJM9VikSSQneHIx0unRdtRaqn7CLukvcUDkbxgsBZCHsOByMLyVyq7JrwOZtA9Ksl/w640-h396/QPS%20per%20thread_%20fanout=8,%20write%20rate%20=%202M_s,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;With an 8M/s background write rate AHCC is ~1.1X faster at 1 thread and ~3.3X faster at 40 threads relative to LRU.&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhfWQpZslw8nc-nC9biAF7ErgmuuiN0qphC2uh2MBYgczZsnYdDJznhJloJr2fa4H0o_J8aBypWF-AAYb6P-Lg_ygctAx5O40u050TSSwfek-cq06-RACYBrWuwo1kV6IHZMauIxyJBJGdJ2aj-v8Ei9hgdojQTPMAMp9H3WxEHuHDHiMswPHYFKEbMqFxL/s600/QPS_%20fanout=8,%20write%20rate%20=%208M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;397&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhfWQpZslw8nc-nC9biAF7ErgmuuiN0qphC2uh2MBYgczZsnYdDJznhJloJr2fa4H0o_J8aBypWF-AAYb6P-Lg_ygctAx5O40u050TSSwfek-cq06-RACYBrWuwo1kV6IHZMauIxyJBJGdJ2aj-v8Ei9hgdojQTPMAMp9H3WxEHuHDHiMswPHYFKEbMqFxL/w640-h397/QPS_%20fanout=8,%20write%20rate%20=%208M_s,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhfWQpZslw8nc-nC9biAF7ErgmuuiN0qphC2uh2MBYgczZsnYdDJznhJloJr2fa4H0o_J8aBypWF-AAYb6P-Lg_ygctAx5O40u050TSSwfek-cq06-RACYBrWuwo1kV6IHZMauIxyJBJGdJ2aj-v8Ei9hgdojQTPMAMp9H3WxEHuHDHiMswPHYFKEbMqFxL/s600/QPS_%20fanout=8,%20write%20rate%20=%208M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;/a&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh2lzQz5mfGMX1aRjo0_NZFH9xIp03ZPmxk0OW6keMvN3YmXIuSiPMr29KwDwD-Zd7rhMMw8YUMkRWyWzzqkzDzRyQMZ_FFDnM99IXtnGJRy3FQJGlURIRMD433kvHcnn86H-d5slJMl5b4h80jhU3vVcuK2hiw5flnAIYzsZZd_OJvV4yYYyCoZ_05sh_Z/s600/QPS%20per%20thread_%20fanout=8,%20write%20rate%20=%208M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;397&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh2lzQz5mfGMX1aRjo0_NZFH9xIp03ZPmxk0OW6keMvN3YmXIuSiPMr29KwDwD-Zd7rhMMw8YUMkRWyWzzqkzDzRyQMZ_FFDnM99IXtnGJRy3FQJGlURIRMD433kvHcnn86H-d5slJMl5b4h80jhU3vVcuK2hiw5flnAIYzsZZd_OJvV4yYYyCoZ_05sh_Z/w640-h397/QPS%20per%20thread_%20fanout=8,%20write%20rate%20=%208M_s,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;With a 32M/s background write rate AHCC is ~1.1X faster at 1 thread and ~2.9X faster at 40 threads relative to LRU.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjWB4IHPeLw11UMhuKeuXlBu8EDVFS6uJQaGlT9fnP6wubeMZ4dZqrhSD3HcfRD5rJdW2v2ZvoiA1DbQP6FYCil8-XFb4C3uBV4aqslhSHoZl9N0yyjwshNIEOX5iHHF7ysJayeP31yZ4t433gTI4ERkAZCpLyC_iXpjFJDLUWcWVnVEdCZINT7wZNBlyg9/s600/QPS_%20fanout=8,%20write%20rate%20=%2032M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;397&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjWB4IHPeLw11UMhuKeuXlBu8EDVFS6uJQaGlT9fnP6wubeMZ4dZqrhSD3HcfRD5rJdW2v2ZvoiA1DbQP6FYCil8-XFb4C3uBV4aqslhSHoZl9N0yyjwshNIEOX5iHHF7ysJayeP31yZ4t433gTI4ERkAZCpLyC_iXpjFJDLUWcWVnVEdCZINT7wZNBlyg9/w640-h397/QPS_%20fanout=8,%20write%20rate%20=%2032M_s,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgYAH-rRU74cpvmRd_8UYlHcUEAbqJPWeOrk0hpTNk5z2-W0eQABZQQRMMFLaRoZi5C2UEd6BUEbY_Qkfs2L-GfKr81wI1KrtNvcVFz8DKm6wvWnQVY8R487N9ieNcw8yeSNuTKvBwIJGao4zwz5yfFkBttjgilURv-y1p05-PO1wYaApYSQn2SB0E-yGoL/s600/QPS%20per%20thread_%20fanout=8,%20write%20rate%20=%2032M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;397&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgYAH-rRU74cpvmRd_8UYlHcUEAbqJPWeOrk0hpTNk5z2-W0eQABZQQRMMFLaRoZi5C2UEd6BUEbY_Qkfs2L-GfKr81wI1KrtNvcVFz8DKm6wvWnQVY8R487N9ieNcw8yeSNuTKvBwIJGao4zwz5yfFkBttjgilURv-y1p05-PO1wYaApYSQn2SB0E-yGoL/w640-h397/QPS%20per%20thread_%20fanout=8,%20write%20rate%20=%2032M_s,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results: per level fanout&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;These graphs have QPS from the readwhilewriting benchmark to compare results with per-level fanout set to 8 and 32.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Summary&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;QPS is often 1.1X to 1.3X larger with fanout=32 vs fanout=8&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;With an 8M/s background write rate and LRU, fanout=8 is faster at 1 thread but then fanout=32 is from 1.1X to 1.3X faster at 10 to 40 threads.&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh40Q2nuBiCXdV0pYFHW3M8pxFRLlCHY9ULBN6TKMdMUNb4-K_TZVyAPaxns5THXVITypRjYrdzEJ3fWy5ONygUiQ_9mAU7Pk4uc4aboi46BSrdZ55cmOBKR1tBt_IIg7Fheo4NRkSi09vpwMK6Y7m-Vljhc5qOsrBMgahbXccKyTnSyhsdoZKGopwFgV41/s600/QPS_%20LRU,%20write%20rate%20=%208M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;397&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh40Q2nuBiCXdV0pYFHW3M8pxFRLlCHY9ULBN6TKMdMUNb4-K_TZVyAPaxns5THXVITypRjYrdzEJ3fWy5ONygUiQ_9mAU7Pk4uc4aboi46BSrdZ55cmOBKR1tBt_IIg7Fheo4NRkSi09vpwMK6Y7m-Vljhc5qOsrBMgahbXccKyTnSyhsdoZKGopwFgV41/w640-h397/QPS_%20LRU,%20write%20rate%20=%208M_s,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgo2zT4qYQOe_KxOLlmMP7DARrXdS4QwslLFnjT0oOh8mDV2lHkeVTMPNrjliwsqut7TLkHmIy-Ryl7gTBEk44PTbO47KoMxUKO187XrMMXC2OeepjGEyb2C_lxQWt6ks9NSFOXDVcPd1UhGs-8w79fB8OcXEHkz9MIALAbxIehaE8pdoxVtGtxazYSZarp/s600/QPS%20per%20thread_%20LRU,%20write%20rate%20=%208M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;397&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgo2zT4qYQOe_KxOLlmMP7DARrXdS4QwslLFnjT0oOh8mDV2lHkeVTMPNrjliwsqut7TLkHmIy-Ryl7gTBEk44PTbO47KoMxUKO187XrMMXC2OeepjGEyb2C_lxQWt6ks9NSFOXDVcPd1UhGs-8w79fB8OcXEHkz9MIALAbxIehaE8pdoxVtGtxazYSZarp/w640-h397/QPS%20per%20thread_%20LRU,%20write%20rate%20=%208M_s,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;With an 8M/s background write rate and AHCC, fanout=8 is faster at 1 thread but then fanout=32 is ~1.1X faster at 10 to 40 threads.&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhX2wkYMCdcfHbuO_W2VECkcKMfKHmIqSGBQsqFoNYcBIZgvdWtb1nCcOK_i1VxURLihrnagle1QO3MTpRPWJ8dxsNqoiqaJCOBnymx6SjOvIpFiPE0pp66HdsSmpgF_1CGTVA3uIHUP6pJRokn8NB1mJVhKzLGRLPIm-qFBWdV0tNZqK2AVWlSWqpKVoSz/s600/QPS_%20AHCC,%20write%20rate%20=%208M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;397&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhX2wkYMCdcfHbuO_W2VECkcKMfKHmIqSGBQsqFoNYcBIZgvdWtb1nCcOK_i1VxURLihrnagle1QO3MTpRPWJ8dxsNqoiqaJCOBnymx6SjOvIpFiPE0pp66HdsSmpgF_1CGTVA3uIHUP6pJRokn8NB1mJVhKzLGRLPIm-qFBWdV0tNZqK2AVWlSWqpKVoSz/w640-h397/QPS_%20AHCC,%20write%20rate%20=%208M_s,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhgrlbPWtMD5pxQNSEJI-hROL_XjI1-QqZ-xmJY9AkY4pUKtx20g_X6IoJr4EHC9xrPn8LPCFHYfy86NCTurJZuRy8H32Xp2KxHzccQonvmIwrrW__gc11T8NzYoMOEwWLQ49hwoBr4E2m4FkL_7rLSf3HnJgmnu_IwTaGiMNeL-06oWCBhFA25uAsm4XU1/s600/QPS%20per%20thread_%20AHCC,%20write%20rate%20=%208M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;397&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhgrlbPWtMD5pxQNSEJI-hROL_XjI1-QqZ-xmJY9AkY4pUKtx20g_X6IoJr4EHC9xrPn8LPCFHYfy86NCTurJZuRy8H32Xp2KxHzccQonvmIwrrW__gc11T8NzYoMOEwWLQ49hwoBr4E2m4FkL_7rLSf3HnJgmnu_IwTaGiMNeL-06oWCBhFA25uAsm4XU1/w640-h397/QPS%20per%20thread_%20AHCC,%20write%20rate%20=%208M_s,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;With a 32M/s background write rate and LRU, fanout=8 is ~2X faster at 1 thread but then fanout=32 is from 1.1X to 1.2X faster at 10 to 40 threads.&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEix2LEo0_skiwfwYeWThv2w-dnbAAyK-B5bSfR1kqPeFKmS5pL1UGcd3fFPK38b6_T2aJnmnl4auVa_KqPeSqF_kPH9GyYzJCQAeG-fx5mbTicXlvA_GqeXm6AzllYEZ3ao9GJqwq-EXkR7Dtvo5kqDdRLoa6JE6sYiimv491e6exrm3qx-tFjoFLew6pRI/s600/QPS_%20LRU,%20write%20rate%20=%2032M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;397&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEix2LEo0_skiwfwYeWThv2w-dnbAAyK-B5bSfR1kqPeFKmS5pL1UGcd3fFPK38b6_T2aJnmnl4auVa_KqPeSqF_kPH9GyYzJCQAeG-fx5mbTicXlvA_GqeXm6AzllYEZ3ao9GJqwq-EXkR7Dtvo5kqDdRLoa6JE6sYiimv491e6exrm3qx-tFjoFLew6pRI/w640-h397/QPS_%20LRU,%20write%20rate%20=%2032M_s,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjTZCalC-vvFAUdQz8q_E7f42sLUsRE3-MEQGSRexcDiwkJlquuRfNJ3rnybxNOH-H_ejke6UrsocedOdDU_qOsq9zRMggtlXQBSGR8SJAOA0yUnTVhjw5eR6GmqCb2ieMCsa8S6ACN9cAIt4AQFZRw2pC5zT-XyT3cH2XqUXVL_oIo3Sn6KL6tnye3P2vd/s600/QPS%20per%20thread_%20LRU,%20write%20rate%20=%2032M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;397&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjTZCalC-vvFAUdQz8q_E7f42sLUsRE3-MEQGSRexcDiwkJlquuRfNJ3rnybxNOH-H_ejke6UrsocedOdDU_qOsq9zRMggtlXQBSGR8SJAOA0yUnTVhjw5eR6GmqCb2ieMCsa8S6ACN9cAIt4AQFZRw2pC5zT-XyT3cH2XqUXVL_oIo3Sn6KL6tnye3P2vd/w640-h397/QPS%20per%20thread_%20LRU,%20write%20rate%20=%2032M_s,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;With a 32M/s background write rate and AHCC, fanout=8 is ~2X faster at 1 thread but then fanout=32 is ~1.1X faster at 10 to 40 threads.&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhCbH43nuUlYboTLKw4oF0uHPwCDoVbviyvUstXaMee-O6yrlbWpsy1eLqjklFuWX8GxOLwNa0YaMEl3qeshDQJKMn3wno4k399vKDHhA5I62tHuVIWeDP4enGWd7y8vNzTLjFXZu03Bs-cFxppZoepYztSe8QuxjpLBDjLOhTLUBPEZYOG4O3ylor2dIhh/s600/QPS_%20AHCC,%20write%20rate%20=%2032M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhCbH43nuUlYboTLKw4oF0uHPwCDoVbviyvUstXaMee-O6yrlbWpsy1eLqjklFuWX8GxOLwNa0YaMEl3qeshDQJKMn3wno4k399vKDHhA5I62tHuVIWeDP4enGWd7y8vNzTLjFXZu03Bs-cFxppZoepYztSe8QuxjpLBDjLOhTLUBPEZYOG4O3ylor2dIhh/w640-h396/QPS_%20AHCC,%20write%20rate%20=%2032M_s,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEithz_73qsgJJbwYEE9APivcsewd75I4yo0qKLObATRTZ6WX8Ieui0WwwsvquEf2wOjTfQi4vyAIIg_XXXJvVCbMaux9LhFcQNVjYClJxZtVYpkPLdZsAxjjcNbly8Rj3udGLPqZysyd2DQkHIOauOXcuS63Tzd6sYFl-9YVlvDJVMlFW3a6PYYkKkmFWoF/s600/QPS%20per%20thread_%20AHCC,%20write%20rate%20=%2032M_s,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEithz_73qsgJJbwYEE9APivcsewd75I4yo0qKLObATRTZ6WX8Ieui0WwwsvquEf2wOjTfQi4vyAIIg_XXXJvVCbMaux9LhFcQNVjYClJxZtVYpkPLdZsAxjjcNbly8Rj3udGLPqZysyd2DQkHIOauOXcuS63Tzd6sYFl-9YVlvDJVMlFW3a6PYYkKkmFWoF/w640-h396/QPS%20per%20thread_%20AHCC,%20write%20rate%20=%2032M_s,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;Results: background write rate&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;Summary:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;With LRU&lt;/li&gt;&lt;ul&gt;&lt;li&gt;QPS drops by up to ~15% as the background write rate grows from 2M/s to 32M/s&lt;/li&gt;&lt;li&gt;QPS does not scale linearly and suffers from mutex contention&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;With AHCC&lt;/li&gt;&lt;ul&gt;&lt;li&gt;QPS drops by up to 13% as the background write rate grows from 2M/s to 32M/s&lt;/li&gt;&lt;li&gt;QPS scales linearly with the thread count&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;There are some odd effects in the results for 1 thread&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;Results with LRU show that per-thread QPS doesn&#39;t scale linearly&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj6qB2vJg0ZavYWaocRnZg-bOD-G0FQ0dcJRcIWskAt6sNPxGr86XfIMtKqgiAYdCLlOqPIneCE8GLN9v2RvF1JI4Xa6Rs0etJCJx1d4txcsIBIr2kf_9xOK9jBx2AczZyWrzEi4fCzoM1dSLfcRHCogNBMb3daH_IItH0WSpR5hY5PXAQOXvCBJppHyZUc/s600/QPS_%20LRU,%20fanout=8,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj6qB2vJg0ZavYWaocRnZg-bOD-G0FQ0dcJRcIWskAt6sNPxGr86XfIMtKqgiAYdCLlOqPIneCE8GLN9v2RvF1JI4Xa6Rs0etJCJx1d4txcsIBIr2kf_9xOK9jBx2AczZyWrzEi4fCzoM1dSLfcRHCogNBMb3daH_IItH0WSpR5hY5PXAQOXvCBJppHyZUc/w640-h396/QPS_%20LRU,%20fanout=8,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgDK0kxkTKvBtgyudO76lU5IF91-oPS1AOGjbRtxjFfqHEtPfywn_gJDefff-3eCzMsDGM7c4WdgmlEW6E8IfoTIGVplmtXyGLWsva5hPrE_VvQ6AGireqa9PTW4gOni5kX2mkYM8DpSjAEXb7SIQQ5U0i0jJc9B2kebFlnd7ZZuCCZZqZREQ7V2eyWwsLR/s600/QPS%20per%20thread_%20LRU,%20fanout=8,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgDK0kxkTKvBtgyudO76lU5IF91-oPS1AOGjbRtxjFfqHEtPfywn_gJDefff-3eCzMsDGM7c4WdgmlEW6E8IfoTIGVplmtXyGLWsva5hPrE_VvQ6AGireqa9PTW4gOni5kX2mkYM8DpSjAEXb7SIQQ5U0i0jJc9B2kebFlnd7ZZuCCZZqZREQ7V2eyWwsLR/w640-h396/QPS%20per%20thread_%20LRU,%20fanout=8,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;Results with AHCC show that per-thread QPS scales linearly ignoring the odd results for 1 thread&lt;br /&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi6gb-bXRt99wiT5sxlT8m8eSgsoukADs9UX27oWJg7HN99PdDuiy9AakivgtSh02zBRnK8lbqoxtNgzPf2rUzyskxPZosXc1VwUDTm-G9FQTn4WU43gtMIz5kYnMyVt2-ISf1Lh5dUJK7IkB4VE9h0yrMikLAE_fStscsME_2joj3j5bTmv-OHjmo5dGq7/s600/QPS_%20AHCC,%20fanout=8,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi6gb-bXRt99wiT5sxlT8m8eSgsoukADs9UX27oWJg7HN99PdDuiy9AakivgtSh02zBRnK8lbqoxtNgzPf2rUzyskxPZosXc1VwUDTm-G9FQTn4WU43gtMIz5kYnMyVt2-ISf1Lh5dUJK7IkB4VE9h0yrMikLAE_fStscsME_2joj3j5bTmv-OHjmo5dGq7/w640-h396/QPS_%20AHCC,%20fanout=8,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjMVhv1Kbc_eouU_B9qLlN4vF8pErEF8Ig6ieuAtE8kwUg9ameCUsB29yk8m6WNaQTZ3-ze17ln7qxJJfxq5zv2_uHjuohI6vD_29zWq3kFxCI2IKFF7j_dVR9CDqZAjWovnvMLMDklKB_b3ZmL82rY6lIo1P_K0tYMFSX4BP5wYQ63qvIBbYwby6B36sZE/s600/QPS%20per%20thread_%20AHCC,%20fanout=8,%20size(L0)=4.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;371&quot; data-original-width=&quot;600&quot; height=&quot;396&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjMVhv1Kbc_eouU_B9qLlN4vF8pErEF8Ig6ieuAtE8kwUg9ameCUsB29yk8m6WNaQTZ3-ze17ln7qxJJfxq5zv2_uHjuohI6vD_29zWq3kFxCI2IKFF7j_dVR9CDqZAjWovnvMLMDklKB_b3ZmL82rY6lIo1P_K0tYMFSX4BP5wYQ63qvIBbYwby6B36sZE/w640-h396/QPS%20per%20thread_%20AHCC,%20fanout=8,%20size(L0)=4.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://smalldatum.blogspot.com/feeds/3082000965737959352/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://smalldatum.blogspot.com/2024/11/rocksdb-on-big-server-lru-vs-hyperclock.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/3082000965737959352'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9149523927864751087/posts/default/3082000965737959352'/><link rel='alternate' type='text/html' href='http://smalldatum.blogspot.com/2024/11/rocksdb-on-big-server-lru-vs-hyperclock.html' title='RocksDB on a big server: LRU vs hyperclock'/><author><name>Mark Callaghan</name><uri>http://www.blogger.com/profile/09590445221922043181</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='25' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEin4Iva6UJLBfFY-593_pkfROxrcdFyziIC2F514xuWeWyxUmBpCZKrQLA_ny4b_BGpQonLQRMu_GgsmLolQfQz_x7BHDqwjtScaOWqXV3ZWgS_03oZjnfdvRG7DI342g/s113/1975067_10152057247028527_535202432_n+%281%29.jpg'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgheKpQ1wicaDdA1Ygal6SfyKpJcqWfIdTS_A1OyeqpzQ-ly7vf_s_vglBzoRjLUZy1K2LSbNWXAX9CObbF6-wNIKgA-Pl0ckr2Dm96460OmOwKmXl3vLT8QlfDW9CXHKUlF3WqFg7pwRcLsruwouNyb1Gk0L0DLvaYmYRivID0cE6J2xTnEQggyoAbQ-mb/s72-w640-h397-c/QPS_%20fanout=8,%20write%20rate%20=%202M_s,%20size(L0)=4.png" height="72" width="72"/><thr:total>0</thr:total></entry></feed>