check_mk IPMI PCM sensor reading randomly fails

Posted by Julian Kessel on Server Fault See other posts from Server Fault or by Julian Kessel
Published on 2012-11-16T14:06:34Z Indexed on 2012/11/16 17:02 UTC
Read the original article Hit count: 502

Filed under:
|
|
|

I use check_mk_agent for monitoring a server with IPMI and the freeipmi-tools installed. As far as I can see, the monitoring randomly detects no value returned by the IPMI Sensor "Temperature_PCH_Temp".

That's a problem since it results in a CRITICAL state triggering a notification. The interruption lasts only over one check, the following is always OK. The temperature is in no edge area and neither the readings before the fail nor after show a Temp that is tending to overrun a treshold.

Has someone an idea on what could be the reason for this behaviour and how prevent it?

© Server Fault or respective owner

Related posts about monitoring

Related posts about nagios