How much downtime do we afford for nagios?
Mike McGrath
mmcgrath at redhat.com
Sun Apr 27 17:16:13 UTC 2008
On Sun, 27 Apr 2008, susmit shannigrahi wrote:
> > > So if a service or host is unreachable for 3 or 4 mins, we get a
> > > notification. (However most of the cases it is false positive, due to
> > > congestion or others).
> > Looking through my email, from what I can recall there are no false
> > positives. xen6 had to be power-cycled which caused all the other
> > collateral notifications.
>
>
> How long was it down? Why should a normal reboot will send 23 mails?
> Reboot is not any exceptional thing. Is it?
> An alert should be when its absolutely necessary...
> it should report only when xen6 comes up but a service does not come up..
> What do you think?
> Thanks.
>
A normal reboot shouldn't, but when its in a hung state, it takes a while
before people can get to it.
-Mike
More information about the infrastructure
mailing list