I would like to plot my Nagios availability reports over time.
Does anyone know a plugin that would do so?
I found 'graphios' at https://github.com/shawn-sterling/graphios that would plot extra data provided by plugins.
What I need instead is a plugin that would plot information such as: 'the service was in ERROR state 0.5% of the time last…
Is it possible to configure nagios to group notifications into a single e-mail? Sometimes when something goes down my inbox gets spammed with all the notifications. It would just be nice if these could somehow be lumped together. Does anyone know if this is possible?
It's a big problem to me, because I'm not familiar with puppet.
ERROR on the puppetmaster:
debug: importing '/etc/puppet/manifests/nodes/group-1.pp'
err: Could not parse for environment production: Syntax error at '{'; expected '}' at /etc/puppet/manifests/nodes/group-1.pp:6
ERROR on the puppet client:
err: Could not retrieve catalog from…
Most of our servers are licensed for 2 concurrent remote desktop sessions. This is fine, so long as everyone does their administrative task and logs off, but some people accidentally close sessions (disconnect but remain logged in) instead.
I know that you can force someone off with the right Admin tools, but it's a bit ugly and may hurt…
We run some Nagios service checks via OpsView, and one of our hosts is getting a strange response for SSH:
"UNKNOWN: Service results are stale"
It happens regularly, but seems to go away as the system retries a 2nd and 3rd time. It started after a patch and reboot of the server in question last week. The system itself responds to SSH…
Lately, on my nagios 3.2.3 install (CentOS5, monitoring ~ 300 hosts, 1150 services) has sdtarted to occasionally report high packet loss on 50-60 hosts at a time. Problem is it's bogus. Manual runs of ping (or its own check_ping binary) finds no fault with any of the affected hosts. The only possible cures I found so far are:
run all the…
Can anyone let me know how I would reduce time between Last Check Time and Next Scheduled Check on a particular service. I have a very critical task to monitor and the time between checks is currently 5 minutes, which is too long for this service. Can I reduce that time? I need this to be 1 minute or even 30 seconds.
I want Nagios to…
I have configured nagios SMS alert and it takes around one minute to send notification. I want to get SMS notification withing one/two second(s) after system/service failure. I could not find any way to send sms alert in a second. Can anybody help me???
Update Wednesday, 29 August 9:26:43 a.m GMT
define host{
use …
I installed nagios a very long time ago, and have started trying to use it now. I am getting this error:
Current Status: CRITICAL (for 231d 16h 52m 49s)
Status Information: SWAP CRITICAL - 100% free (0 MB out of 0 MB)
Performance Data: swap=0MB;0;0;0;0
Current Attempt: 4/4 (HARD state)
Last Check Time: 01-09-2011…
I have a number of remote sites which have VNC running on a few computers for support purposes. They are (obviously) only available on our internal network.
I am using Nagios to keep track of all the systems in the network and I want to have it check to make sure the VNC server is running on the appropriate hosts.
There…
I am using Icinga (Nagios fork) for monitoring ~10 webservers, each one providing different services.
I would now like to provide an aggregated view on the server states on our companies intranet, providing information like:
server | state | last downtime | Ø uptime (month) | Ø uptime (year)
Srv1 | OK | 2013-10-09 |…
We are using Nagios to monitor our network with great results. There is now a new requirement we are struggling with:
We want to notify Nagios of an non
fatal but critical application errors. The
application does not stop running but
there is some sort of issue that
needs looking into.
Once the issue has been looked into,…
I am running nagios2, pnp4nagios-0.6.16 and php 5.2.4-2ubuntu5.19.
In my setup, pnp4nagios is correctly generating perfdata, which can be seen via the web interface in graphical form for lots of services.
The perfdata directory contains entries of the kind:
/usr/local/pnp4nagios/var/perfdata/zeus/Disk_Space_Home.rrd…
So my setup:
Services are shared between all hosts (CPU/RAM/Disk/Services).
Hosts are split into two main groups: "Production" and "Development".
We have two contact groups: "Production" and "Development".
Lets say my development SQL server runs low on RAM, I want it to only alert those in "Development" contact…
I'm just curios, let's say i made a definition for a host and i specified it to check/notify at certain time 10-18 yet in service i said 24/7, who's taking priority? would i get alerts 24/7 or would it fall under host's rules and it'd be 10-18?
I'm trying to use this command to check on port 587 for my postfix server.
Using nmap -P0 mail.server.com I see this:
Starting Nmap 5.51 ( http://nmap.org ) at 2013-11-04 05:01 PST
Nmap scan report for mail.server.com (xx.xx.xx.xx)
Host is up (0.0016s latency).
rDNS record for xx.xx.xx.xx: another.server.com
Not…
i using shinken for my monitoring system. Now, i have a problem when i configure shinken notification. My purpose is to discriminative between notification for warning state and critical state of check service:
with warning state:
+ time to send alert from 8h = 18 h everyday, via email and sms
+…
I'm just barely getting into programming so I do apologize for my ignorance. I'm trying to create a .bat file that will check if a service is running on XP Pro.
If service is running it will exit 0.
If the service is stopped start service
wait 10 seconds (via ping i'm guessing)
check if service is…
Hi,
I want to generate a custom report in "Nagios-3.2.0". I have defined the work-hours in "timeperiods.cfg" as follows:
'workhours' timeperiod definition
define timeperiod {
timeperiod_name 0800-2000
alias full time
monday 08:00-20:00
tuesday 08:00-20:00
wednesday 08:00-20:00 …
I use Amazon EC2 for my mobile app. Depending on load of the application at a given time, I might spawn new instances and then take them down when load is lower to save costs.
How does one keep up with Nagios configurations for such a dynamic environment? When one deals with managed hardware,…
I want to install this packages for Nagvis :
graphviz-2.28.0-1.el6.i686.rpm
graphviz-doc-2.28.0-1.el6.i686.rpm
graphviz-gd-2.28.0-1.el6.i686.rpm
graphviz-graphs-2.28.0-1.el6.i686.rpm
graphviz-perl-2.28.0-1.el6.i686.rpm
But while installing, i have this error :
# rpm -ivh…
Is there a way to view the data that Centreon uses to build graphs, from within the Centreon web interface?
We have some gaps in some of our graphs, and I would like to see if it is a problem with the data being returned from the NRPE plugins.
I have seen the…
I'd like to parse status.dat file for nagios3 and output as xml with a python script.
The xml part is the easy one but how do I go about parsing the file? Use multi line regex?
It's possible the file will be large as many hosts and services are monitored, will…
I have nagios alerts set up to come through jabber with an http link to ack.
Is is possible there is a script I can run from a terminal on a remote workstation that takes the hostname as a parameter and acks the alert?
./ack hostname
The benefit, while…