Improve read consistency with database replicas #4558

zsparal · 2024-06-07T13:41:47Z

Background

At Mollie, we have been using ProxySQL for our MySQL databases for a while now. Some time ago, we started using read-replicas as an alternative, read-only data-source in our applications. The decision of which DB handle (rw/ro) to use is ultimately left to the application itself: it can choose to either read from the primary database or from read-replicas, if the use-case allows. In the latter case, the application has to be tolerate some replication lag, but this is not a problem for a lot of situations.

Problem

The issue we have run into is that read operations are not always atomic and might have dependencies. Consider a case where we insert to records (A and B) in quick succession to the database, where B causally depends on A. The application loads B and then tries to load A, assuming it exists (because of the causal dependency). Because these queries can be routed to replicas with different amounts of replication lag, the query to load A might return no results if the writes haven't been replicated yet.

In certain cases, this lead to high error rates whenever replication lag for one of the replicas increased (a 3 orders of magnitude jump)

Solution

We implemented a solution for this problem in our own fork of ProxySQL that we have been using in production since. The patch has eliminated there errors and made our systems more predictable. We opened a Pull Request with the patch in #4553.

Considered Alternatives

We considered using GTID Consistent Reads as a potential solution, but decided to go with our patch for the following reasons:

GTID requires more components deployed in the cluster (the binlog reader), increasing operational burden
It passes the implementation complexity to the client applications if they use separate connections for reads and writes. In these cases, clients would need to keep a track of GTID's and send them as query annotations along all queries

We believe that our implementation provides a simple solution when an application just wants to make sure that all reads get a consistent state from the replicas in the pool.

The text was updated successfully, but these errors were encountered:

zsparal linked a pull request Jun 7, 2024 that will close this issue

Add sticky backend query hint #4553

Open

zsparal changed the title ~~Improved read consistency with database replicas~~ Improve read consistency with database replicas Jun 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve read consistency with database replicas #4558

Improve read consistency with database replicas #4558

zsparal commented Jun 7, 2024 •

edited

Loading

Improve read consistency with database replicas #4558

Improve read consistency with database replicas #4558

Comments

zsparal commented Jun 7, 2024 • edited Loading

Background

Problem

Solution

Considered Alternatives

zsparal commented Jun 7, 2024 •

edited

Loading