My Regression is Better Than Your Regression.

If regression is just a math formula, how can anyone claim “Ours is better than theirs”?

Who do we believe?  The reality is that the standard least-squares formula is just the inner algorithm.  There is the input, the magical regression algorithm, and the output.

Four useful appraiser things to consider:

  1. The analyst’s decision that regression is the right model for the problem;
  2. The analyst’s decision on what sales data is put into the model;
  3. The analyst’s decision on what predictors (features) go into the data set;
  4. The analyst’s ability to properly explain the result.

 

What’s important to me as a user of appraisal software? How that particular software helps me in each of these four parts of my job.  If only the magical software produced what was promised.  Push the button, and viola!  Another trouble free fee on the way. My Y=a+bx is better than your Y=a+bx. From My Regression is Better Than Your Regression.

If I could just push the button, get the instant answer, then deliver it to the client, my job could be so easy.  I could be rich!  So what’s the problem?There’s an inner contradiction here.

The problem is that if I can push the magic “analyze” button, so can my clients.  So can other appraisers.  So can other competitors — like accountants, economists, AVMs, BPOs, and other unlicensed ‘evaluators.’

Pretty soon I push the magical button, and it stops delivering my magical appraiser fee.  Or – clients begin to ask for more.  Like why did I push that button?  What data did I put in?  What data did I leave out?  Why did I fail to consider the lack of sufficient value?  Please explain.  Please.

Why is this happening to us?

We hear how wonderful it is to regress, but no one is happy.  Could it be one of the four “things”?  For now, let’s look at just the first thing – whether the regression formula is the right algorithm for this problem.  Is it the right model, the right solution? The assumption is that we are all talking about the same formula – the minimized least squares formula.  I am not.

So, there are several different regression formulas. The least squares formula was popularized by a couple of things:  First, doing math with paper and pencil or even by hand with a calculator was hard and slow. (Still is!)  The formula had to be ‘tractable’ as they said.  They admitted that squaring, summing, then un-squaring was rigid and often a poor model. But it was tractable.  Next, after the accountant’s spreadsheet became popular, add-ons were added on.  Some of them included statistics.  The “statistics” regurgitated whatever the programmers learned in Stats 101.  They included descriptive statistics and the inferential ‘sample’ statistics.  Regression came with the graphs.  It was always least squares, because that was all they knew, and the 8086 CPU was not powerful enough to do other regression types.

But today, there are choices of regression types:  simple, multiple, multivariate, logistic, quantile, seam, polynomial, stepwise, ridge, lasso, and Bayesian, among others.  Each does well for different purposes. Each is a good model solution for a particular data and problem type.  It’s the model, not the math, that’s important.  The appraiser models the model.

Appraisers will be useful so long as modeling decisions need to be made.

Appraisers will be useful so long as algorithms and tools need to be selected – like regression types.