Go to file
Drew DeVault a3c0922eac Fix summary in SSL alarm 2020-02-25 12:23:28 -05:00
.build.yml .build.yml: corrections per package rename 2020-01-05 13:51:11 -05:00
LICENSE Add LICENSE 2020-01-05 13:15:14 -05:00
README.md Update README.md 2020-01-06 10:04:15 -05:00
backup_rules.yml Add SSL expiration rules 2020-02-25 11:57:31 -05:00
build_rules.yml Initial commit 2020-01-05 12:28:07 -05:00
meta_rules.yml *_rules.yml: s/alarming/urgent/ 2020-01-06 09:45:44 -05:00
node_rules.yml Remove redundant read-only filesystem alarm 2020-02-25 11:01:14 -05:00
ssl_rules.yml Fix summary in SSL alarm 2020-02-25 12:23:28 -05:00

README.md

metrics.sr.ht

This repository tracks our Prometheus alert rules. They are available as a package from mirror.sr.ht (for Alpine only) as metrics.sr.ht-rules.

Our Prometheus instance is public:

https://metrics.sr.ht

Usage instructions

  1. Install our package
  2. Add our rules_files entries to your prometheus.yml for each set of rules you wish to use
  3. Configure alertmanager accordingly

Our alerts are categorized into three severity groups:

  • interesting alerts are worth noting, as they may be useful in identifying trends over time, for forensic attention after an outage, or for addressing on a rainy day. Upstream, we send these to our IRC channel.
  • important alerts are likely to be actionable, but do not require immediate attention.
  • urgent alerts require immediate attention.