[smokeping-users] Small dips in graphs on distributed smokeping installation
Vidar.Stokkenes at hn-ikt.no
Fri May 14 10:47:48 CEST 2010
I am having slight problems with my Smokeping installation. I have about 6 smokeping slaves who are mainly (for the moment) querying 11 different routers using the CiscoRTTMonEchoICMP probe to monitor cross-WAN latency. However, on a few boxes (mainly those that have the most probes to query) I get similar errors to this one:
Apr 30 04:38:03 <hostname> smokeping: CiscoRTTMonEchoICMP: WARNING: smokeping took 633 seconds to complete 1 round of polling. It should complete polling in 300 seconds. You may have unresponsive devices in your setup.
I can also see a short "dip" in the graphs for that specific location using that specific slave on my CGI interface.
As far as my troubleshooting go, I don't have any unresponsive devices in my setup, but I can imagine that probing 11 different devices with 20 SNMP pings can take some time. Would you advise me to increase my "global" step to increase the time it expects the job to complete? Or do you have any better ideas what to do?
Any help would be appreciated!
More information about the smokeping-users