[mrtg] Re: Graph Pitfalls

jon.hartman at verizon.com jon.hartman at verizon.com
Tue Oct 28 19:15:00 MET 2003


I remain skeptical that MRTG has not finished in 5 minutes, for multiple
reasons:

1) It is always the same log files referenced, about 4 out of 180.
2) I am running MRTG on a Dual P3-800Mhz w/512MB RAM set to fork 15
processes on a box that does very little else
3) When performing a "top" or "ps aux", any vestige of MRTG disappears
after about 20 seconds.

Let's say there's a counter, that goes from 1-100 that is currently at 50.
If it flips to 25 in the log file and then flips even lower a second poll,
would that not rob MRTG of its ability to compensate for counter rollover?
This is working under the assumption that it has such abilities, of
course... This doesn't make a whole lot of sense either, as one of the
counters referenced doesn't see a whole lot of activity. Here is the
requested error with the referenced log file snippet:

Rateup WARNING: /usr/bin/rateup could not read the primary log file for
dfwacerad1_2
Rateup ERROR: /usr/bin/rateup found dfwacerad1_2's log file was corrupt or
not in sorted order:
time: 997747200.Rateup WARNING: /usr/bin/rateup The backup log file for
dfwacerad1_2 was invalid as well
WARNING: rateup died from Signal 0 with Exit Value 1 when doing router
'dfwacerad1_2' Signal was 0, Returncode was 1

998179200 0 0 0 0
998092800 0 0 0 0
998006400 0 0 0 0
997920000 0 0 0 0
997833600 0 0 0 0
997747200 0 0 0 0
997660800 0 0 0 0
997747200 0 0 0 0
997660800 0 0 0 0

At the time of checking, there was no dfwacerad1_2.log, only
dfwacerad1_2.old. Why would that be?

Thanks in advance,

-Jon Hartman


-----Original Message-----
From: matt at petach.org [mailto:matt at petach.org] 
Sent: Monday, October 27, 2003 3:56 PM
To: Jonathan M. Hartman
Cc: mrtg at list.ee.ethz.ch
Subject: Re: [mrtg] Graph Pitfalls


> 
> Content-Type: text/plain;
>  charset=us-ascii
> Content-Transfer-Encoding: quoted-printable
> I'm getting some errors that are really skewing my graphs. I'm 
> measuring the interface traffic on an alteon 180e and during the 
> busier times of day, I'm presented with the following: =20
> Rateup WARNING: /usr/bin/rateup could not read the primary log file for
> dfwaceres1_9
> Rateup ERROR: /usr/bin/rateup found alteon1_9's log file was corrupt or
> not in sorted order: time: 1060011000.


Can you include the snippet of your logfile at that time index? I suspect
once you search for that time index, the nature of the corruption will
become evident.  :)

Matt

> Rateup WARNING: /usr/bin/rateup The backup log file for alteon1_9 was 
> invalid as well
> WARNING: rateup died from Signal 0 with Exit Value 1 when doing router 
> 'alteon1_9' Signal was 0, Returncode was 1
> 
> What causes this sort of thing? Is it a question of the counters = 
> flipping

This is almost inevitably caused by two MRTG processes trying to write to
the same file; generally, it means that you're running from cron, and the
update cycle is taking long enough that the next instance begins running
before the previous instance has completed.

Matt

> over, hence MRTG assuming the logfile is invalid? Here's a sample from 
> = the log file:
> =20
> 1067287202 646758083 3036935658
> 1067287202 3577947 13147772 3577947 13147772
> 1067286902 3577947 13147772 3577947 13147772
> 1067286900 3578046 13147772 3592941 13147772
> 1067286600 3591516 13147772 3592941 13147772
> 1067286300 3400999 13148829 4469388 13200632
> 1067286000 4469388 13200632 4469388 13200632
> 1067285700 4459151 13200632 4469388 13200632
> 1067285400 2936727 13200632 3347950 13200632
> 1067285100 3345278 13200632 3347950 13200632
> 1067284800 2947211 13200632 2947211 13200632
> 1067284500 2947809 13200632 3036923 13200632
> 1067284200 3037810 13200632 3170010 13200632
> 1067283900 3169568 13200632 3170010 13200632
> 
> Thanks in advance,
> =20
>   _____ =20
> 
>  =09
> Jon Hartman <mailto:jon.hartman at verizon.com>=20
> 
> Network Engineering <http://www.vzlink.com/>=20
> 
> Verizon Online <http://www.verizon.net/>=20
> 
> Work:   214-513-6792
> 
> Cell:   940-453-1111
> =20
> 
> 
> 
> -- Attached file removed by Ecartis and put at URL below --
> -- Type: image/jpeg
> -- Size: 4k (4796 bytes)
> -- URL : http://www.ee.ethz.ch/~slist/p/03-image001.jpg
> 
> 
> --
> Unsubscribe mailto:mrtg-request at list.ee.ethz.ch?subject=unsubscribe
> Archive     http://www.ee.ethz.ch/~slist/mrtg
> FAQ         http://faq.mrtg.org    Homepage     http://www.mrtg.org
> WebAdmin    http://www.ee.ethz.ch/~slist/lsg2.cgi
> 
> 



--
Unsubscribe mailto:mrtg-request at list.ee.ethz.ch?subject=unsubscribe
Archive     http://www.ee.ethz.ch/~slist/mrtg
FAQ         http://faq.mrtg.org    Homepage     http://www.mrtg.org
WebAdmin    http://www.ee.ethz.ch/~slist/lsg2.cgi



More information about the mrtg mailing list