[mrtg] Weird results using MRTG on multi-CPU Windows 2000 servers
Michael W. Lucas
mwlucas at blackhelicopters.org
Mon Dec 8 16:56:18 MET 2003
Hello,
I'm having poor results using MRTG to monitor CPU usage on Windows
2000 servers. The setup includes:
* mrtg-2.9.18.p11 on FreeBSD (installed via ports)
* Multiple Windows 2000 server with the SNMP4tPC "expansion" -- this
whole technique is unsupported by Microsoft, but there doesn't seem to
be another choice. Sigh.
When MRTG runs, it frequently reports one of the two CPUs at being at
100% user capacity. This does not match either what an individual
snmpwalk says, or what Performance Manager says. For example, here's
a log snippet from a very new install:
1070896963 100 0
1070896963 100 0 100 0
1070896831 0 0 0 0
1070896800 0 0 0 0
1070896500 40 0 100 0
1070896200 100 0 100 0
1070895900 100 0 100 0
1070895600 100 0 100 0
1070895300 100 0 100 0
1070895000 38 0 100 0
1070894700 100 0 100 0
....
But snmpwalk gives me:
# snmpwalk server community .1.3.6.1.4.1.311.1.1.3.1.1.2.1.4.1.49
enterprises.311.1.1.3.1.1.2.1.4.1.49 = 0
#
Odd things I've noticed:
This affect only happens on one CPU, not both.
When I removed the log of the obviously-erroneous CPU and let MRTG
start over, the graph for the other CPU started climbing and quickly
reached 100% user CPU.
I'm seeing this on several different multi-CPU Windows 2000 machines.
Here's a MRTG config for a single CPU; the CPU second is the same, with
(of course) the next MIB.
Target[webserver3.cpu2]: 1.3.6.1.4.1.311.1.1.3.1.1.2.1.4.1.49&1.3.6.1.4.1.311.1.1.3.1.1.2.1.4.5.49:community at webserver3
Title[webserver3.cpu2]: webserver3 cpu 2 usage
Options[webserver3.cpu2]: growright, gauge
MaxBytes[webserver3.cpu2]: 100
YLegend[webserver3.cpu2]: Percent (%)
WithPeak[webserver3.cpu2]: wmy
LegendI[webserver3.cpu2]: User Time: 
LegendO[webserver3.cpu2]: Privileged Time: 
Legend1[webserver3.cpu2]: % cpu 2 User Time
Legend2[webserver3.cpu2]: % cpu 2 Privileged Time
Legend3[webserver3.cpu2]: Maximal 5 Minute % cpu 2 User Time
Legend4[webserver3.cpu2]: Maximal 5 Minute % cpu 2 Privileged Time
ShortLegend[webserver3.cpu2]: %
PageTop[webserver3.cpu2]: <H1>cpu 2 usage for webserver3 </H1>
Any thoughts? If I can get this on multiple machines, surely someone
else has seen it before?
On a related note, this machine now seems to recognize the host mib
for generic processor usage, as shown below. Is this a better choice
to monitor CPU usage, or is the difference irrelevant?
# snmpwalk webserver3 community .1.3.6.1.2.1.25.3.3.1.2
host.hrDevice.hrProcessorTable.hrProcessorEntry.hrProcessorLoad.1 = 1
host.hrDevice.hrProcessorTable.hrProcessorEntry.hrProcessorLoad.2 = 0
#
Any help appreciated.
Thanks,
==ml
--
Michael Lucas mwlucas at FreeBSD.org, mwlucas at BlackHelicopters.org
Today's chance of throwing it all away to start a goat farm: 41.8%
http://www.BlackHelicopters.org/~mwlucas/
Absolute OpenBSD: http://www.AbsoluteOpenBSD.com/
--
Michael Lucas mwlucas at FreeBSD.org, mwlucas at BlackHelicopters.org
Today's chance of throwing it all away to start a goat farm: 41.8%
http://www.BlackHelicopters.org/~mwlucas/
Absolute OpenBSD: http://www.AbsoluteOpenBSD.com/
--
Unsubscribe mailto:mrtg-request at list.ee.ethz.ch?subject=unsubscribe
Archive http://www.ee.ethz.ch/~slist/mrtg
FAQ http://faq.mrtg.org Homepage http://www.mrtg.org
WebAdmin http://www.ee.ethz.ch/~slist/lsg2.cgi
More information about the mrtg
mailing list