IWETHEY v. 0.3.0 | TODO
1,095 registered users | 0 active users | 0 LpH | Statistics
Login | Create New User
IWETHEY Banner

Welcome to IWETHEY!

New re: "Proof"
There's a lot of rigorous mathematics behind it.

As far as I know, there is no mathematical proof that 2 delivers the most accurate result. Unlike OO, proofs either exist or don't exist in math. No fuzzy grey areas (unless perhaps "accurate" is subjective; CRC and I went round and round on this). I asked math newsgroups about such a proof once, and got no definitive answers.

I agree that a lot of math uses the technique of squaring, but that does not necessarily make it the best. Like I said, squaring seems to make the computations simpler. But simple computations and accuracy are not necessarily the same thing.

________________
oop.ismad.com
New I'll give a brief rundown...
but I don't think it'll address your criticism.

As far as I know, there is no mathematical proof that 2 delivers the most accurate result. Unlike OO, proofs either exist or don't exist in math. No fuzzy grey areas (unless perhaps "accurate" is subjective; CRC and I went round and round on this). I asked math newsgroups about such a proof once, and got no definitive answers.

There's a lot of background mathematics that you need to know to understand why the method of least squares is the most common method to best approximate a curve.

I learned this about 24 years ago from - "An Introduction to Linear Analysis" by Kreider, Kuller, Ostberg and Perkins, ISBN 0-201-03949-4. pp 290-294. It's a good text.

My HTML knowledge isn't good enough to enter the equations, but I'll try to summarize things.

First, consider an experiment repeated n times which measures a value of c, say the specific gravity of a substance. These n experiments give values x1, x2, ..., xn. Since there are experimental errors, we expect that none of these values will be exactly equal to the true value c. So how do we find the best approximation for c from these measured values?

The way to do so is to view the n experimental values as n components of a vector X = (x1, x2, ... xn) in an n-dimensional space Rn. Similarly, the true value c can be thought of as an n-dimensional vector cY = (c, c, ...c) where Y = (1, 1, ... 1).

It then follows that the best approximation, c'Y, is determined by "the requirement that c'Y be the perpendicular projection of X onto the one-dimensional subspace of Rn spanned by Y. But as we saw in the preceeding section, this projection is:

Y (X dot Y) / (Y dot Y) [where dot indicates the dot product of 2 vectors]

so that c' = (X dot Y) / (Y dot Y) = (x1 + x2 + ... +xn) / n

This, of course, is none other than the arithmetic average of the xi, and we now have a theoretical interpretation of the popular practice of averaging separate (independent) measurements of the same quantity."

This same analysis can be extended to curves. In doing so, we still end up taking the perpendicular projection and dot products and thus we end up minimizing the sum of the squares of the difference between the measured and the true value.

So it just falls out of the mathematics. It's not arbitrary.

There are other, less common, methods for curve fitting and minimizing errors. It depends on how the "error" is defined that determines which method is best. In the vast majority of cases, least-squares is most appropriate.

If you and CRC went round and round about this, I don't think I'll convince you, but I thought I'd try to answer. This is about all I have to say on the subject.

HTH.

Cheers,
Scott.
New Ouch. I'll have to reread it several times. My math is rusty
Can we stick with 2 dimensions for now?
________________
oop.ismad.com
New There is
It's what's known as a "variational problem", such as - find the shape that maximizes the area inside a fixed closed curve of constant length, having no nodes.

Least squares comes straight out of this type of analysis. Eventually it boils down to the "Gram-Schmidt inequality" or "triangle inequality" for vectors

|a|-|b| < |a+b| < |a|+|b|

The vectors here are the sequences of coefficients that represent the approximating function in some basis.

If you are interested I can provide more detail.


-drl
     Spectrum matching? - (tablizer) - (12)
         It's a standard fitting technique. - (Another Scott) - (11)
             We had a big argument about least squares once - (tablizer) - (10)
                 It's not arbitrary... - (Another Scott) - (4)
                     re: "Proof" - (tablizer) - (3)
                         I'll give a brief rundown... - (Another Scott) - (1)
                             Ouch. I'll have to reread it several times. My math is rusty - (tablizer)
                         There is - (deSitter)
                 The exponent 2 is NOT arbitrary - (ben_tilly) - (4)
                     Physical events - (ChrisR) - (1)
                         Not in an obvious way to me - (ben_tilly)
                     Let's see if I got this right - (tablizer) - (1)
                         Yes - (ben_tilly)

Quis Custodiet Custodes Ipsos?
75 ms