[Feature]: Make Elasticsearch service restart after config change more graceful #343

frankhetterich · 2024-08-27T11:52:15Z

Describe the feature request

We updated our Elasticsearch instances to Version 8.15.0 using the Rolling update feature of the collection, which works good. The cluster was available all the time.

With the update we changed some parameters in the Elasticsearch config. Now the Collection performs the parameter change as part of the "normal" installation process, after all Nodes where updated. This means the config is changed on all nodes and all Nodes are restarted at once using a handler. This full cluster restart causes that the cluster is unavailable for some time.

For us it makes no sense to perform a rolling update with a lot of tasks to make shure that the Cluster is available all the time and perform afterwards a full cluster restart which leads to the opposite.

Please implement a "graceful" cluster restart (with rolling restarts and cluster health checks) after a config change of Elasticsearch

ivareri · 2024-08-29T10:51:19Z

ouch.

I'd say this is a bug and not a feature. Should be easy enough to create a handler for a rolling cluster restart (all the code is in the repo already), but I'm not sure how to best handle it without duplicating most of the code from elasticsearch-rolling-upgrade.yml in a handler.

Is there a way to inject tasks into a handler? have two tasks files, one with all the tasks to gracefully stop a node, and one with everything to bring it back online and wait for cluster to become green. They could then be included in both a cluster restart handler and the rolling-upgrade file?

So the handler would look something like this:

 - name: Gracefully stop node
      ansible.builtin.include_tasks:
        file: cluster_restart_stop_node.yaml
        
 - name: Start node and wait for green cluster
      ansible.builtin.include_tasks:
        file: cluster_restart_start_node.yaml

And the Be careful about upgrade when Elasticsearch is running block in elasticsearch-rolling-upgrade.yml would be reduced to something like this

 - name: Gracefully stop node
      ansible.builtin.include_tasks:
        file: cluster_restart_stop_node.yaml

# Tasks to upgrade packages        

 - name: Start node and wait for green cluster
      ansible.builtin.include_tasks:
        file: cluster_restart_start_node.yaml

widhalmt · 2024-09-25T07:45:38Z

Good find and sorry for your bad experience. Yes, definitely a bug. We'll look into it. Thanks also for the suggestions, @ivareri .

frankhetterich added feature New feature or request needs-triage Needs to be triaged labels Aug 27, 2024

widhalmt added bug Something isn't working and removed feature New feature or request needs-triage Needs to be triaged labels Sep 25, 2024

widhalmt self-assigned this Sep 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Make Elasticsearch service restart after config change more graceful #343

[Feature]: Make Elasticsearch service restart after config change more graceful #343

frankhetterich commented Aug 27, 2024

ivareri commented Aug 29, 2024

widhalmt commented Sep 25, 2024

[Feature]: Make Elasticsearch service restart after config change more graceful #343

[Feature]: Make Elasticsearch service restart after config change more graceful #343

Comments

frankhetterich commented Aug 27, 2024

Describe the feature request

ivareri commented Aug 29, 2024

widhalmt commented Sep 25, 2024