check_mk IPMI PCM sensor reading randomly fails
- by Julian Kessel
I use check_mk_agent for monitoring a server with IPMI and the freeipmi-tools installed.
As far as I can see, the monitoring randomly detects no value returned by the IPMI Sensor "Temperature_PCH_Temp".
That's a problem since it results in a CRITICAL state triggering a notification. The interruption lasts only over one check, the following is always OK.
The temperature is in no edge area and neither the readings before the fail nor after show a Temp that is tending to overrun a treshold.
Has someone an idea on what could be the reason for this behaviour and how prevent it?