# [rrd-users] Re: disabling interpolation?

Alex van den Bogaerdt alex at slot.hollandcasino.nl
Wed Aug 23 01:06:34 MEST 2000

Philip Molter wrote:
>
> On Tue, Aug 22, 2000 at 06:46:29PM +0200, Alex van den Bogaerdt wrote:
> : Right.  As I said: don't mess with the averages.  However, it is safe
> : to say the minimum amount of people in that room is 4, not 4.76 and
> : the maximum amount of people in that room was 5, not 4.76 .
> : It makes no sense to say that during this interval there were 4.76
> : people in the room *unless* you are talking about averages.
>
> Wrong.  You can't say that at all.  You could very easily say that
> the minimum amount of people in the room as 0 and the maxium amount
> was 12, and I could still give you a scenario where the average as
> 4.76

Ack, this can happen.  Also, you never know for sure that it did or did
not happen (with our way of monitoring).  At least I *can* be sure that
the *actual* number of people (and *not the average*) was not 4.76  This
is what the discussion is about:  is it proper to first average and then
use max/min on it?

Averaging the value first also introduces an error.  Assume the following
number of people:

12:00  12 (for the sake of this example this number doesn't change until...)
12:05  12
12:06   1
12:07  10
12:08  34
12:09   9
12:10  10
12:11  15
12:12  14

Samples are taken at times 12:02, 12:07 and 12:12.
Calculated averages (not verified, doesn't matter for this discussion):
12:00..12:05    12*2/5 +  1*3/5 =  5.4
12:05..12:10    10*2/5 + 14*3/5 = 12.4

If the proposed data type would be available:
12:00..12:05    12
12:05..12:10    10

Explain to me why 5.4 is better than 12 ?
Then again, min(12:05..12:10) should be 1 and the proposed DST cannot
do this either.

Increasing the frequency of your polling will increase the chance that
you happen to sample at the correct time.  It does *not* improve
accuracy.  Depending on the time when I run the script I get either
10 (@12:07) or 34(@12:08) as input.  Granted, I get both values if I
sample every minute.  No problem ...
12:07:00  value 10
12:07:01  value 99
12:07:02  value  0
... and then this argument is not valid.

Clearly, in the concept of RRDtool the assumption is made that gauges
do not differ this much (or perhaps: the user should make this assumption
or do not use RRDtool).  If the data does not change a lot, the original
example numbers (4..5) are more likely than yours (0..12).  If it is okay
to make this assumption then:

The example used numbers 4 and 5.  It may or may not be correct to assume
4 and 5 were the minimum and maximum.  I agree on this.  However, if I
sample those two values, and use min/max functions on them, I would at
least keep them to 4 or 5.  I never got 4.76 and therefore I do not want
to see 4.76 ...

As discussed elsewhere on the list: *I* know how to make this happen.
It's just that I think a computer program should be doing the job, not
a human.

regards,
--
__________________________________________________________________
/ alex at slot.hollandcasino.nl                  alex at ergens.op.het.net \
| work                                                         private |
| My employer is capable of speaking therefore I speak only for myself |
+----------------------------------------------------------------------+
| http://faq.mrtg.org/                                                 |
| http://rrdtool.eu.org  --> tutorial                                  |
+----------------------------------------------------------------------+

--
Unsubscribe mailto:rrd-users-request at list.ee.ethz.ch?subject=unsubscribe
Help        mailto:rrd-users-request at list.ee.ethz.ch?subject=help
Archive     http://www.ee.ethz.ch/~slist/rrd-users