On Wed, Apr 24, 2019 at 09:52:17AM +0200, Clement Verna wrote:
Hi all,
We currently don't monitor that resultsdb sends messages on the bus.
This patch will make nagios trigger an alert if we don't see a message
in 2 days.
+1s ?
+1 for me :)
Pierre
From bb3a6083d2c50a010ab837a7c5c80017224957e4 Mon Sep 17 00:00:00
2001
From: Clement Verna <cverna(a)tutanota.com>
Date: Wed, 24 Apr 2019 09:48:26 +0200
Subject: [PATCH] Nagios: monitor that resultsdb sends messages on the bus
Signed-off-by: Clement Verna <cverna(a)tutanota.com>
---
.../nagios_client/templates/check_datanommer_history.cfg.j2 | 1 +
roles/nagios_server/files/nagios/services/fedmsg.cfg | 6 ++++++
2 files changed, 7 insertions(+)
diff --git a/roles/nagios_client/templates/check_datanommer_history.cfg.j2
b/roles/nagios_client/templates/check_datanommer_history.cfg.j2
index b967285fb..f89ebfb3a 100644
--- a/roles/nagios_client/templates/check_datanommer_history.cfg.j2
+++ b/roles/nagios_client/templates/check_datanommer_history.cfg.j2
@@ -40,6 +40,7 @@
command[check_datanommer_autocloud]={{libdir}}/nagios/plugins/check_datanommer_t
command[check_datanommer_twoweekatomic]=/usr/lib64/nagios/plugins/check_datanommer_timesince.py
org.fedoraproject.prod.releng.atomic.twoweek.complete 1296000 1382400
command[check_datanommer_mdapi]=/usr/lib64/nagios/plugins/check_datanommer_timesince.py
mdapi 28800 86400
command[check_datanommer_greenwave]=/usr/lib64/nagios/plugins/check_datanommer_timesince.py
greenwave 172800 172800
+command[check_datanommer_resultsdb]=/usr/lib64/nagios/plugins/check_datanommer_timesince.py
resultsdb 172800 172800
# This one is retired since it times out all the time. Too few messages.
#command[check_datanommer_nuancier]={{libdir}}/nagios/plugins/check_datanommer_timesince.py
nuancier 23652000 31536000
diff --git a/roles/nagios_server/files/nagios/services/fedmsg.cfg
b/roles/nagios_server/files/nagios/services/fedmsg.cfg
index 255e619e6..b10aaee58 100644
--- a/roles/nagios_server/files/nagios/services/fedmsg.cfg
+++ b/roles/nagios_server/files/nagios/services/fedmsg.cfg
@@ -320,6 +320,12 @@ define service {
check_command check_by_nrpe!check_datanommer_greenwave
use defaulttemplate
}
+define service {
+ host_name
busgateway01.phx2.fedoraproject.org
+ service_description Check datanommer for recent resultsdb messages
+ check_command check_by_nrpe!check_datanommer_resultsdb
+ use defaulttemplate
+}
# BEGIN, check consumers and producers
--
2.21.0
_______________________________________________
infrastructure mailing list -- infrastructure(a)lists.fedoraproject.org
To unsubscribe send an email to infrastructure-leave(a)lists.fedoraproject.org
Fedora Code of Conduct:
https://getfedora.org/code-of-conduct.html
List Guidelines:
https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives:
https://lists.fedoraproject.org/archives/list/infrastructure@lists.fedora...