Resiliency in ECH and ECE deployments
ECE Elastic Cloud Hosted
With Elastic Cloud Hosted (ECH) and Elastic Cloud Enterprise (ECE), your deployment can be spread across up to three separate availability zones, each hosted in an isolated infrastructure domain, such as separate data centers in the case of Elastic Cloud Hosted.
While this document focuses on how ECH and ECE handle resilience, all the concepts and recommendations described in this section are also applicable to other deployment types. For example, in Elastic Cloud on Kubernetes, you can configure availability zone distribution and node scheduling through your Kubernetes platform.
Why this matters:
- Data centers can have issues with availability. Internet outages, earthquakes, floods, or other events could affect the availability of a single data center. With a single availability zone, you have a single point of failure that can bring down your deployment.
- Multiple availability zones help your deployment remain available. This includes your Elasticsearch cluster, provided that your cluster is sized so that it can sustain your workload on the remaining data centers and that your indices are configured to have at least one replica.
- Multiple availability zones enable you to perform changes to resize your deployment with zero downtime.
ECH and ECE orchestators automatically handle several aspects of cluster resilience that are discussed in the self-managed resiliency guidance for small and large clusters:
- Configuring master-eligible nodes:
- Setting up a voting-only tiebreaker when the cluster spans to two availability zones.
- Promoting dedicated master nodes as the cluster grows to improve stability and fault tolerance.
- Enabling shard allocation awareness to maintain resilience during whole-zone failures.
- Automatically assigning node roles based on the configured data tiers.
- Creating automatic snapshots at regular configurable intervals, to provide an extra level of redundancy.
- For ECH, review plan for production page for how to plan your deployment for production.
- For ECE, review high availability in ECE page for how to configure your ECE installation to be highly available.