Skip to content

eltonplima/elixir_graceful_shutdown_demo

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Graceful Shutdown in Elixir DEMO

Demo app for my Code BEAM V talk "Graceful shutdown in Elixir - try not to drop the ball".

Slides

Versions

v1 - Simple calc API

Dead simple calc API with a single sum endpoint, which slowly adds the numbers (100ms sleep). Deployed to k8s with very min config.

Problems:

  • Failures when deploying: 503
  • Failures when scaling up: 503

v2 - Fix scaling up

Tell k8s when a pod is ready to accept HTTP traffic, i.e. create Health plug and define k8s readiness and liveness probes.

No failures, because containers ignore SIGTERM and are killed after 30s.

v3 - Pass SIGTERM to Elixir app

Directly call calc start. Setup task like DB migrations should be done with k8s Init Containers or Tasks.

Problems:

  • Still few failures when deploying: EOF, 502

v4 - Simulate Phoenix before 1.5

Phoenix 1.5 comes by default with Plug.Cowboy.Drainer. To simulate behavior before that, we can configure Endpoint with drainer: false.

Problems:

  • Few times more failures when deploying than in v3: EOF, 502

v5 - Add pre-stop delay

Give k8s time to remove Pod from Endpoints, before sending SIGTERM to Elixir (preStop hook)

No failures for both HTTPS and HTTP2 🚀

v6 - Log calculations using Task

For each calculation start a Task that sleeps for 5s and then sends operation log to InfluxDB over HTTP.

Problems:

  • Failures when deploying: calculations not logged

v7 - Add graceful shutdown for Task

All calculations are logged when deploying 🚀

v8 - Log calculations in batches

Send calculation to InfluxDB in batches of 500 or every 30s.

Problems:

  • Failures when deploying: calculations not logged

v9 - Add graceful shutdown for batching

All calculations are logged when deploying 🚀

How to

Play locally

Run local server (IEx):

make local-server

Test local-server:

make local-test

Run local server in Docker:

make docker-local-server

Deploy to k8s

Create DigitalOcean (DO) account.

Install doctl and authenticate with your DO account:

doctl auth init

Create DO Docker container repository, e.g.:

make do-create-registry name=pawel-szafran

Create k8s cluster with 7 nodes in Frankfurt:

make do-create-k8s-cluster

Add Docker registry to created k8s cluster:

make do-add-registry-to-k8s-cluster

Build and push Docker image to registry:

make docker-push-image v=0.1.0

Deploy app to k8s:

make k8s-deploy v=0.1.0

This will create a deployment with 13 app replicas:

$ kubectl get deployment/calc
NAME   READY   UP-TO-DATE   AVAILABLE   AGE
calc   13/13   13           13          5m31s

And will create a service with DO Load Balancer (may take few minutes):

$ kubectl get services/calc
NAME   TYPE           CLUSTER-IP    EXTERNAL-IP      PORT(S)        AGE
calc   LoadBalancer   10.245.0.14   157.230.76.110   80:30886/TCP   3m57s

Test k8s service:

make k8s-test

Watch app logs:

make k8s-logs

Load test k8s deployment

Create load test VM and install load testing tool k6 (retry if installing k6 fails):

make do-create-load-test-vm

Load test the app with 500 virtual users for 5m (press ctrl+c to finish test earlier):

make do-load-test

Test with HTTPS and HTTP/2

Add your domain e.g. pawelszafran.online to your DO account.

Create TLS certificate for calc subdomain:

make do-create-cert domain=calc.pawelszafran.online

Create DNS record A for calc subdomain directing to the external IP of DO Load Balancer with TTL of e.g. 10m.

Update DO Load Balancer's forwarding rules:

Old:
  TCP 80 -> TCP 30475

HTTPS:
  HTTP 80 -> HTTP 30475
  HTTPS 443 (cert: calc) -> HTTP 30475

or HTTP/2:
  HTTP 80 -> HTTP 30475
  HTTP2 443 (cert: calc) -> HTTP 30475

Set new URL for tests:

make set-calc-url url=https://calc.pawelszafran.online

Test how deploying and scaling affects user traffic

Run load test, and while it's running, bump the number or replicas up or down:

kubectl scale deployment/calc --replicas=5

Or redeploy (restart last rollout):

make k8s-redeploy

Or push the same code 2x with 2 different versions, e.g. for Git tag v1:

make docker-push-image v=0.1.0
make docker-push-image v=0.1.1

make k8s-deploy v=0.1.0
make k8s-deploy v=0.1.1

Test async calculation-logging to InfluxDB - from v6

Run InfluxDB locally:

make influxdb-local

Deploy InfluxDB to k8s:

make influxdb-k8s-deploy

Forward local port to InfluxDB k8s service:

make influxdb-k8s-port-forward

Count operations in InfluxDB (local or k8s with port-forward):

make influxdb-count
# or
watch -n 3 make influxdb-count

Clear operations in InfluxDB (local or k8s with port-forward):

make influxdb-clear

Clean up

make k8s-delete
make do-delete

About

Demo app for my "Graceful shutdown in Elixir" talk

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Elixir 70.1%
  • Makefile 22.4%
  • Dockerfile 4.2%
  • JavaScript 2.3%
  • Shell 1.0%