Prometheus monitoring server, includes prometheus, alertmanager and blackbox-exporter. This requires docker to be running (not handled by this role).
This role is still in development.
-
prometheus_targets
: Optional list of dictionaries of targets:group
: A list of Ansible groups (optional)hosts
: A list of hosts (optional)port
: The port to be monitoredjobname
: The prometheus job-namescrape_interval
: Prometheus scrape interval (optional)metrics_path
: Path to metrics (optional)
-
prometheus_sd_targets
: Optional list of dictionaries of additional targets:groupname
: The name of the configuration target file, used to name a configuration file so must be uniquegroup
: A list of Ansible groups (optional)hosts
: A list of hosts (optional)port
: The port to be monitoredjobname
: The prometheus job-name
This is intended to be an example. In practice these configuration files could be dynamically generated outside this role, prometheus will automatically reload them.
-
prometheus_custom_targets
: Optional list of dictionaries of additional targets with custom arguments, use this if you need to pass custom arguments that aren't supported by the previous two parameters. Each item is a<scrape_config>
that will be copied unchanged into the configuration. -
prometheus_port
: External Prometheus port, set to0
to disable, default9090
-
prometheus_alertmanager_port
: External Alertmanager port, set to0
to disable, default9093
-
prometheus_blackboxexporter_port
: External Blackbox-exporter port, set to0
to disable, default9115
-
prometheus_additional_command_args
: Additional command line arguments for prometheus -
prometheus_alertmanager_additional_command_args
: Additional command line arguments for alertmanager -
prometheus_additional_rules_template
: Template with additional alert rules. See https://awesome-prometheus-alerts.grep.to/rules for some ideas but note the labels may be different. -
prometheus_docker_network
: Docker network for prometheus Docker applications, defaultprometheus
-
prometheus_docker_user
: User ID that prometheus should run as, default is the container default -
prometheus_docker_data_volume
: Docker volume or host path for Prometheus data, default is a docker volume calledprometheus-data
. If this is a host path it must be writeable byprometheus_docker_user
.
This role sets the following variables which can be used in other tasks:
prometheus_internal_ip
: Internal IP of the Prometheus containerprometheus_blackboxexporter_internal_ip
: Internal IP of the Blackbox exporter containerprometheus_alertmanager_internal_ip
: Internal IP of the AlertManager container
These are intended for use when you don't want to expose the container ports using standard Docker port-forwarding (set prometheus_*port: 0
).
- hosts: localhost
roles:
- role: ome.docker
- role: ome.prometheus
Change prometheus_alertmanager_slack_webhook
in playbook.yml
to a real Slack webhook. Run:
molecule test --destroy never
molecule login
docker stop fake-metrics
Wait a few minutes, a Slack alert should be generated.
If this fails try creating a manual alert:
curl -H "Content-Type: application/json" -d '[{"labels":{"alertname":"TestAlert1"}}]' localhost:9093/api/v1/alerts
The molecule test also includes a disk space alert configuration.
To test this fill up at least 90% of the /run
partition:
molecule login
dd if=/dev/zero of=/run/fill.space bs=1M count=...
Wait a few minutes and you should see a disk space warning.