Add new index for the unique constraint on imapuid #326

drewler · 2022-03-04T10:54:45Z

Queries such as this one and this one have a high impact on IO, related to the elevated number of messages returned plus the data per row (as all the columns are fetched).

This situation could be mitigated by using a covering index on ("account_id", "folder_id", "msg_uid"). Thing is, there's an index there already because of the UniqueContraint which creates an index behind the scenes... just in the wrong order. By reordering it and leaving msg_uid as the last column on the index, the unique constraint should double as a covering index resulting in a significative reduction in IO (around 75%).

This first migrations creates a new unique constraint (plus related index) using the new ordering. After that we'll use a second one to drop the old constraint.

squeaky-pl · 2022-03-18T13:01:20Z

This is on hold for now, commenting so the bot does not remind me every day.

wojcikstefan · 2022-06-14T14:47:03Z

Is there a particular action item or some other thing that needs to happen before we can revisit this PR @drewler @squeaky-pl?

drewler · 2022-06-14T15:56:18Z

I don't have the screenshot showing the difference for the cluster where these changes had been applied, but comparing that cluster with another one with a similar load today, the number of rows read dropped by ~33%. One MAX query (before, the third worst performing query) is now instantaneous thanks to the index. As a result, IO usage drops and query latency improves. We didn't complete the upgrade because:

There was another query that should have seen its performance improved because of this index, but we couldn't notice any difference and didn't understand why.
Either for the query that did improve or for the one that didn't improve, we couldn't show any meaningful impact. In the end, we had started investigating this because of an alert related to CPU usage. While looking into that, we found about this IO performance issue. While performance improved, we couldn't demonstrate a meaningful impact so this has been kept in the freezer until we could investigate some more.

Guess we should either revert to the old index (we put the new one in place in one of the clusters) or update it for all clusters.

Query that improved:

SELECT MAX(imapuid.msg_uid)
FROM imapuid
WHERE imapuid.account_id =...
    AND imapuid.folder_id =...

Query that didn't improve:

-- we also tried FORCE INDEX on this one
SELECT imapuid.msg_uid
FROM imapuid
WHERE imapuid.account_id =...
    AND imapuid.folder_id =...

@squeaky-pl feel free to expand my takes (or correct them if there's something wrong).

Add new index for the unique constraint on imapuid

045c3e4

drewler requested a review from squeaky-pl March 4, 2022 10:54

drewler added 3 commits March 4, 2022 11:56

Fix typo

280d12b

Rename

881e6b8

Merge branch 'master' into reorg-unique-contraint-index

fb58586

squeaky-pl removed their request for review August 11, 2022 13:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new index for the unique constraint on imapuid #326

Add new index for the unique constraint on imapuid #326

drewler commented Mar 4, 2022 •

edited

squeaky-pl commented Mar 18, 2022

wojcikstefan commented Jun 14, 2022

drewler commented Jun 14, 2022 •

edited

Add new index for the unique constraint on imapuid #326

Are you sure you want to change the base?

Add new index for the unique constraint on imapuid #326

Conversation

drewler commented Mar 4, 2022 • edited

squeaky-pl commented Mar 18, 2022

wojcikstefan commented Jun 14, 2022

drewler commented Jun 14, 2022 • edited

drewler commented Mar 4, 2022 •

edited

drewler commented Jun 14, 2022 •

edited