ES with Multi-Cluster Search Support #4473

gseng · 2013-12-16T23:40:19Z

ES with Multi-Cluster Search Support

It'd be nice to be able to query across multiple clusters and get their aggregated results.

Our initial motivation is to view Kibana results across multiple clusters. See: Enhancement: Allow conntections to multiple ES backends from a single Kibana instance.

However, since we also use/query ES directly, a proxy would work far better than changing Kibana.

In the discussion above, it was decided (by Shay) that the place to do it would be at the ES level.

The following is a proposal to achieve it at the ES level. This is some very early planning and I've decided to post it early to make sure I'm not duplicating work or am on a totally wrong path (which is totally possible). Any feedback/comments are appreciated.

Proposal

The general plan is to make a query only node (termed 'search load balancer' in elasticsearch.yml) and have a list of cluster names that we want to query.

During a search, the node will query from each of the shards in each of the clusters and aggregate the results.

Details

Make a query only node
- In elasticsearch.yml
  - node.master: false
  - node.data: false
- I'm hoping this means that we can isolate code changes to only the search portion.
Accommodate multiple clusters
- Have ZenDiscover reach out to all the listed clusters to get their state.
- ClusterService will contain a map of ClusterName -> ClusterState.
  - To maintain the interface, clusterService.state() will just return the first cluster.
  - We can have another interface that will allow us to get the the cluster map.
- MulticastZenPing will have to listen for changes in the listed clusters and update the corresponding cluster state.
Searching across multiple clusters
- I've only looked at TransportSearchTypeAction and TransportSearchQueryThenFetchAction so far.
- In the BaseAsyncAction, we'll need to get all the relevant shards across each cluster and query them (sendExecuteFirstPhase).
- We change expectedSuccessfulOps and expectedTotalOps to the multi-cluster counts so that in onFirstPhaseResult we know when to move on (innerMoveToSecondPhase).
- In moveToSecondPhase, we again use the metadata from each cluster and do the actual fetch of the documents.
- Finally in innerFinishHim, we merge all the results with the SearchPhaseController and return the response via the normally.

Thanks!

The text was updated successfully, but these errors were encountered:

brusic · 2014-01-13T19:28:16Z

Just noticed an interesting commit that addresses this issue: #4708

Definitely a more thought out approach since it merges the cluster states, but requires a new "tribe" node. I wonder how it will works for single cluster actions that do not require a merged cluster state.

gseng · 2014-01-13T23:37:02Z

Thanks brusic, that sounds like exactly what we want, and as you said, more elegant. Am working on seeing if it works out for us (comments in #4708 ).

brusic · 2014-01-14T00:02:54Z

Glad to see that my comment was useful and that you are already testing out the feature.

gseng closed this as completed Jan 13, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ES with Multi-Cluster Search Support #4473

ES with Multi-Cluster Search Support #4473

gseng commented Dec 16, 2013

brusic commented Jan 13, 2014

gseng commented Jan 13, 2014

brusic commented Jan 14, 2014

ES with Multi-Cluster Search Support #4473

ES with Multi-Cluster Search Support #4473

Comments

gseng commented Dec 16, 2013