[collectd] Intel S5500BC: collectd ipmi plugin and IPMI error 0xff

Sergey a_s_y at sama.ru
Wed Mar 20 15:39:45 CET 2019


Hello.

I have a problem with one (hardware) system under Linux.
Sometime some sensors returns an error 0xff:

Mar 19 07:48:25 collectd[28450]: ipmi plugin: sensor_read_handler: Removing sensor Processor 1 FAN fan_cooling (29.2), because it failed with IPMI error 0xff.
Mar 19 13:39:45 collectd[28450]: ipmi plugin: sensor_read_handler: Removing sensor IOH Therm Margin system_board (7.20), because it failed with IPMI error 0xff.
Mar 20 14:08:25 collectd[28450]: ipmi plugin: sensor_read_handler: Removing sensor BB +3.3V system_board (7.12), because it failed with IPMI error 0xff.

Data isn't collected after this. Reloading collectd helps. 
I can parse a logs and reloading collectd by this case.
But can I do it  through collectd maybe ? 

More question. Is this is a problem in ipmi plugin may be?
This continues for many years on a single server. Different
kernels were used during this time. 4.4.68 is used now.
collectd is 5.7.2. Hardware: Intel S5500BC mainboard,
BIOS S5500.86B.01.00.0060.090920111354 (09/09/2011).
I don't have a similar mainboard for compare unfortunately.

-- 
Regards, Sergey



More information about the collectd mailing list