TS.MGET / TS.MRANGE - Multi-shard command failed. #1553

Skoucail · 2023-12-19T09:28:43Z

When executing the commands TS.MGET/TS.MRANGE I keep getting the error: Multi-shard command failed. This may happen if a shard needs to process too much data. Try to apply strict filters, if possible.

The strange thing is that even if i make my TS.MGET/TS.MRANGE FILTER so specific it should only return 1 time serie (so basicly applying a strict filter) the same error is returned.

I tried with and without executing 'timeseries.REFRESHCLUSTER' before the TS.MGET/TS;MRANGE commands. But getting the same results (error).

Example:

Setup:
6 node redis stack cluster. (3 master, 3 slaves)
Timeseries version v1.10.04
Number of time series: +-800 (split over 3 master nodes)

LiorKogan · 2023-12-19T11:34:22Z

You'll get this error when libMR reaches a timeout while waiting for results from all the shards. This means that at least one shard needs to process too much data or there is some communication problem/slowdown. It can also happen if one of the shards crashes or is otherwise not available.

I'm not sure why it is happening in your specific case. MGET shouldn't generate a large reply for less than 800 time series. You are also not using too many labels, so the processing is expected to be fast.

Unrelated: it seems that you are using a label named DUPLICATE_POLICY. Is it intentional?

tezc · 2023-12-19T11:55:03Z

Do you have access to redis log files? If so, we might see some log lines that indicate the error.

Skoucail · 2023-12-19T12:16:08Z

Update:

It seems i'm only getting the error when sending the command to SLAVE nodes.

The nodes run in docker. So i have access to the log files.
But it basicly is saying the same as LiorKogan guessed.

Logging from the node running on port 6383

9:S 19 Dec 2023 12:05:47.040 * <timeseries> Got cluster refresh command
9:S 19 Dec 2023 12:05:56.659 # <timeseries> message was not sent because status is not connected
9:S 19 Dec 2023 12:05:56.660 # <timeseries> message was not sent because status is not connected
9:S 19 Dec 2023 12:05:56.660 # <timeseries> message was not sent because status is not connected
9:S 19 Dec 2023 12:05:56.660 * <timeseries> connected : xxx.xxx.xxx.xxx:6381, status = 0
9:S 19 Dec 2023 12:05:56.660 * <timeseries> connected : xxx.xxx.xxx.xxx:6379, status = 0
9:S 19 Dec 2023 12:05:56.660 * <timeseries> connected : xxx.xxx.xxx.xxx:6380, status = 0
9:S 19 Dec 2023 12:06:01.663 # <timeseries> got libmr error:
9:S 19 Dec 2023 12:06:01.663 # <timeseries> execution max idle reached
9:S 19 Dec 2023 12:11:03.273 # <timeseries> got libmr error:
9:S 19 Dec 2023 12:11:03.273 # <timeseries> execution max idle reached

LiorKogan · 2023-12-19T13:57:22Z

@MeirShpilraien is it possible to use libMR with slave nodes?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TS.MGET / TS.MRANGE - Multi-shard command failed. #1553

TS.MGET / TS.MRANGE - Multi-shard command failed. #1553

Skoucail commented Dec 19, 2023

LiorKogan commented Dec 19, 2023 •

edited

tezc commented Dec 19, 2023

Skoucail commented Dec 19, 2023

LiorKogan commented Dec 19, 2023

TS.MGET / TS.MRANGE - Multi-shard command failed. #1553

TS.MGET / TS.MRANGE - Multi-shard command failed. #1553

Comments

Skoucail commented Dec 19, 2023

LiorKogan commented Dec 19, 2023 • edited

tezc commented Dec 19, 2023

Skoucail commented Dec 19, 2023

LiorKogan commented Dec 19, 2023

LiorKogan commented Dec 19, 2023 •

edited