[collectd] Bug#657877: collectd: Does not try to reconnect when rrd daemon dies

Sebastian Harl tokkee at debian.org
Thu Aug 20 11:40:14 CEST 2015


Hi,

On Sun, Jan 29, 2012 at 04:20:14PM +0100, Matthias Urlichs wrote:
> When collectd logs via rrdcached, and rrdcached is restarted, 
> collectd spews errors instead of trying to reconnect.

Thanks for reporting this and sorry for the slow reply :-/

This issue is related to how the rrdcached client library works. It only
supports a single connection and caches that in a global variable.
collectd calls rrdc_connect on each write but the function will simple
return success if it succeeded previously.

I think what we'll have to do is
(a) in collectd, close the connection on error and, thus, trigger an
    actual reconnect; possibly use exponential back-off
(b) improve the RRDtool API ;-)

Cheers,
Sebastian

-- 
Sebastian "tokkee" Harl +++ GnuPG-ID: 0x2F1FFCC7 +++ http://tokkee.org/

Those who would give up Essential Liberty to purchase a little Temporary
Safety, deserve neither Liberty nor Safety.         -- Benjamin Franklin

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 836 bytes
Desc: Digital signature
URL: <http://mailman.verplant.org/pipermail/collectd/attachments/20150820/7d540922/attachment-0001.sig>


More information about the collectd mailing list