[smokeping-users] Re: Scalability

Tobias Oetiker oetiker at ee.ethz.ch
Sat Jun 22 07:44:15 MEST 2002



Today Dan Tucny wrote:

> fping -C 20 -q -s -i 12.5 <40 reachable, 40 unreachable hosts>

pelase note, that some fpings seem to regard the -i parameter as
miliseconds and others as 0.1 miliseconds ...

cheers
tobi

>
>       80 targets
>       40 alive
>       40 unreachable
>        0 unknown addresses
>
>        0 timeouts (waiting for response)
>     1600 ICMP Echos sent
>      800 ICMP Echo Replies received
>       36 other ICMP received
>
>  0.35 ms (min round trip time)
>  0.42 ms (avg round trip time)
>  1.65 ms (max round trip time)
>        32.863 sec (elapsed real time)
>
> fping -C 20 -q -s -i 12.5 <80 reachable hosts>
>
>       80 targets
>       80 alive
>        0 unreachable
>        0 unknown addresses
>
>        0 timeouts (waiting for response)
>     1600 ICMP Echos sent
>     1600 ICMP Echo Replies received
>        0 other ICMP received
>
>  0.36 ms (min round trip time)
>  0.41 ms (avg round trip time)
>  2.05 ms (max round trip time)
>        32.866 sec (elapsed real time)
>
> fping -C 20 -q -s -i 12.5 <80 unreachable hosts>
>
>       80 targets
>        0 alive
>       80 unreachable
>        0 unknown addresses
>
>        0 timeouts (waiting for response)
>     1600 ICMP Echos sent
>        0 ICMP Echo Replies received
>       62 other ICMP received
>
>  0.00 ms (min round trip time)
>  0.00 ms (avg round trip time)
>  0.00 ms (max round trip time)
>        32.958 sec (elapsed real time)
>
> The debug output you have below is due to fping always returning errors,
> even when running -q, this shouldn't affect the runtime of fping itself
> though.
>
> This is of course all purely looking at fping, there is of course also
> the time taken for Smokeping to process these results to be taken into
> consideration though I don't have any timings for that...
>
> I hope this is helpful to you...
>
> Dan
>
> -----Original Message-----
> From: smokeping-users-bounce at list.ee.ethz.ch
> [mailto:smokeping-users-bounce at list.ee.ethz.ch] On Behalf Of Marc Powell
> Sent: 20 June 2002 01:30
> To: Tobias Oetiker
> Cc: Smokeping
> Subject: [smokeping-users] Re: Scalability
>
> Sure thing. Here is what I have done, I created a test smokeping binary
> that points to my original config file with 546 hosts on this particular
> data collector. I ran it with -debug and -nodaemon (I think debug
> implies nodaemon, but I wanted to cover all bases).
>
> # [smokep at dc2 ~/bin]date ; ./smokeping.test -debug -nodaemon ; date
>
> Wed Jun 19 19:20:10 CDT 2002
> ### fping seems to report in 1 miliseconds
> Launched successfully
> FPing: probing 546 targets
> Wed Jun 19 19:28:11 CDT 2002
>
> This 8 minute duration seems to be fairly consistent, at least right now
> ;)
>
> Here's a snippet of truss about 4 minutes into the run:
>
> [smokep at dc2 ~]$ date
> Wed Jun 19 19:23:49 CDT 2002
> [smokep at dc2 ~]$ truss -fea -p 14837
> 14837:  psargs: /usr/local/bin/perl -w ./smokeping.test -debug -nodaemon
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 70
> 14837:  read(7, 0x004E380C, 5120)       (sleeping...)
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 69
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 70
> 14837:  read(7, 0x004E380C, 5120)       (sleeping...)
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 70
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 74
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 70
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 75
> 14837:  read(7, 0x004E380C, 5120)       (sleeping...)
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 70
> 14837:  read(7, 0x004E380C, 5120)       (sleeping...)
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 18
> 14837:  read(7, "   f r o m  ", 5120)                   = 6
> 14837:  read(7, " 1 0 . 5 5 . 0 . 1 1", 5120)           = 10
> 14837:  read(7, "   f o r   I C M P   E c".., 5120)     = 23
> 14837:  read(7, " 1 7 2 . 3 1 . 5 6 . 2", 5120)         = 11
> 14837:  read(7, "\n", 5120)                             = 1
> 14837:  read(7, 0x004E380C, 5120)       (sleeping...)
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 70
> 14837:  read(7, 0x004E380C, 5120)       (sleeping...)
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 70
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 74
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 70
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 75
> 14837:  read(7, 0x004E380C, 5120)       (sleeping...)
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 70
> 14837:  read(7, 0x004E380C, 5120)       (sleeping...)
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 69
> 14837:  read(7, " I C M P   T i m e   E x".., 5120)     = 70
> 14837:  read(7, 0x004E380C, 5120)       (sleeping...)
> ^C[smokep at dc2 ~]$ date
> Wed Jun 19 19:24:44 CDT 2002
>
> If there is anything else that I can provide that would be of
> assistance, please don't hesitate to let me know.
>
> Thanks,
>
> Marc
>
> 	-----Original Message-----
> 	From: Tobias Oetiker [mailto:oetiker at ee.ethz.ch]
> 	Sent: Wed 6/19/2002 5:26 PM
> 	To: Marc Powell
> 	Cc: Smokeping
> 	Subject: Re: [smokeping-users] Re: Scalability
>
>
>
> 	Yesterday Marc Powell wrote:
>
> 	> The only major problem I am having is that I see gaps in the
> graphs
> 	> (10-15 minutes) for those regions with relatively high numbers
> of hosts
> 	> down (20-30). We're monitoring schools so it's the off season
> here in
> 	> the US and the routers fluctuate depending on what maintenance
> is going
> 	> on at the schools, whether the janitor has spilt his coffee in
> the
> 	> router, etc... I am attributing the gaps to the slower
> response time for
> 	> ICMP UNREACHABLE's from fping, which lengthens the overall
> time it takes
> 	> before smokeping spawns the next run to 10-15 minutes or
> longer. Since
> 	> smokeping appears to wait until the fping process terminates
> before
> 	> writing to the RRDs or spawning the next fping process, the
> gaps are
> 	> appearing for all hosts in a region. To minimize the number of
> hosts
> 	> affected by this problem, I have just implemented unique
> configurations
> 	> for each alphabetical grouping per region so that I can spawn
> a
> 	> smokeping daemon for each grouping as opposed to each region
> (i.e. 5
> 	> smokeping processes per data collector). As a result, I have a
> feature
> 	> request or two to make things easier:
>
> 	try running smokeping by hand, at least in theory it should ping
>
> 	ALL the hosts in your config in parallel. The time it will wait
> for
> 	a 'lost' paket is about 1 second at most so this means in theory
> a
> 	fping run is over in 20 seconds regardless of the number of
> 	machines involved. Now there is a small gap between each icmp
> 	packet sent out from fping, so there is an impact per machine
> but
> 	it should not at all depend on how long the machine has to
> answer
> 	... this after all is the whole motivation behinde fping ...
>
> 	>       1) Add a pidfile directive to either complement or
> replace
> 	> piddir. Currently, it is necessary to either create a
> directory for each
> 	> pid file specifically or remove the pidfile before starting
> the next
> 	> smokeping process.
>
> 	running multiple smokeping processes is not the solution ... if
> 	fping has a bug, we will fix fping ...
>
> 	>       2) The ability to INCLUDE external files within a config
> file.
> 	> This should help cut down on the number of unique files I'm
> having to
> 	> create.
>
> 	this is already there ... check the documentation on
> 	ISG::ParseConfig
>
> 	cheers
> 	tobi
>
> 	--
> 	 ______    __   _
> 	/_  __/_  / /  (_) Oetiker, OETIKER+PARTNER AG, Gallusstrasse 25
>
> 	 / // _ \/ _ \/ / CH-4600 Olten, phoneto:+41(0)62 213 9909
> 	/_/ \.__/_.__/_/ tobi at oetiker.ch http://google.com/search?q=tobi
>
>
>
> --
> Unsubscribe
> mailto:smokeping-users-request at list.ee.ethz.ch?subject=unsubscribe
> Help        mailto:smokeping-users-request at list.ee.ethz.ch?subject=help
> Archive     http://www.ee.ethz.ch/~slist/smokeping-users
> WebAdmin    http://www.ee.ethz.ch/~slist/lsg2.cgi
>
>

-- 
 ______    __   _
/_  __/_  / /  (_) Oetiker, ETZ J97, ETH, 8092 Zurich, Switzerland
 / // _ \/ _ \/ / phoneto:+41(0)1-632-5286  faxto:+41(0)1-632-1517
/_/ \.__/_.__/_/ oetiker at ee.ethz.ch http://google.com/search?q=tobi

--
Unsubscribe mailto:smokeping-users-request at list.ee.ethz.ch?subject=unsubscribe
Help        mailto:smokeping-users-request at list.ee.ethz.ch?subject=help
Archive     http://www.ee.ethz.ch/~slist/smokeping-users
WebAdmin    http://www.ee.ethz.ch/~slist/lsg2.cgi



More information about the smokeping-users mailing list