How much downtime do we afford for nagios?

Mike McGrath mmcgrath at redhat.com
Sun Apr 27 17:16:13 UTC 2008


On Sun, 27 Apr 2008, susmit shannigrahi wrote:

> >  > So if a service or host is unreachable for 3 or 4 mins, we get a
> >  > notification. (However most of the cases it is false positive, due to
> >  > congestion or others).
> >  Looking through my email, from what I can recall there are no false
> >  positives.  xen6 had to be power-cycled which caused all the other
> >  collateral notifications.
>
>
> How long was it down?  Why should a normal reboot will send 23 mails?
> Reboot is not any exceptional thing. Is it?
> An alert should be when its absolutely necessary...
> it should report only  when xen6 comes up but a service does not come up..
> What do you think?
> Thanks.
>


A normal reboot shouldn't, but when its in a hung state, it takes a while
before people can get to it.

	-Mike




More information about the infrastructure mailing list