Monday, June 24, 2013

On the Practice of Science (aka a Scientist’s view of science) - Part 4



Fitting Models to Data

When an observation which is not predicted or explained by the current a scientist’s current model of reality is discovered and resists all attempts to correct it, the model must change to accommodate this new fact. In most instances this can be done by simply extending the existing model without any fundamental change to its existing structure. However, with sufficient extension any model can be made to fit almost any observations, creating a danger of ‘forcing’ the model to fit the data. Scientists detect this ‘forcing’ when proposed model extensions are not clearly logical and the justification is inadequate or seems contrived. The key test for proposed extensions is their ability to correctly predict similar situations to the original problematic observation (the reason for this will be discussed later). An extension which proves useful for many additional situations is adopted where as if the extension continues to be insufficient it may trigger a proposal of a new model of reality.
New models of reality come in two major forms; those which challenge an existing model of reality by being grossly consistent with past predictions & results while also opening up new areas, and those which carve off a subset of reality into a new field of study. The latter is typically triggered by an advance in technology or technique which improves the resolution or scale with which observations can be made. While the former are triggered by problematic observations which cannot be explained or predicted using the existing model of reality.
New models are evaluated by scientists before they are adopted or integrated into their personal model of reality. The criteria used in this evaluation are the ability of the model to explain existing observations (‘the facts’) as well as its ability to make accurate predictions. Since each scientist has differing beliefs about the importance of various observations to the understanding of the world based on their personal model, each scientist will evaluate the new model slightly differently.  Kuhn correctly observes that it typically takes 20 years for the majority of the scientific community to adopt a new model of reality and often many opponents are never convinced but rather retire in the interim. Because of the importance of prediction to the evaluation of models senior scientists have more direct experience with the prediction ability of the existing model; they tend to require more evidence of a new model’s prediction ability before they will accept it. Scientific research is slow so it often takes decades to accumulate sufficient validation of a new model’s predictions to convince a majority of the scientists in a field.
When models are extended or devised, the scientist relies exclusively on existing data and models to inform their thinking. Thus there is always a risk of over-fitting the data. Models which over-fit the data will explain the existing data extremely well but will fail to make accurate predictions because they are not modelling all of reality due to biases in which data have been collected. Prediction ability is most important when comparing models due to this problem. This over-fitting problem also explains why the simplicity of a model is highly valued.

Value of Simplicity
               
“It is pointless to do with more what can be done with fewer.”
William of Ockham (via: Wikiquote)

Above is just one quote which represents the principle known as Occam’s Razor. This is highly related to the issue of over-fitting discussed above, where a model of reality is not informative or useful because it is only explaining the part of reality we have already observed. Complicated models of reality can explain observed data well just because it is complicated without providing understanding (or predictability), because they effectively have a different explanation for each subset of data, valuing simplicity avoids wasting too much time on these models. In addition, excessive complexity can indicate the scientist is ‘forcing’ the model to fit the data.
                Another reason for the value of simplicity can be understood by its relationship with null hypotheses. Null hypotheses are what are actually tested in most scientific experiments. They are the model of what we expect if there is no pattern, no rules; so that we can determine if our observations support our model or prediction. Similarly observations cannot be said to support the additional elements in a more complex model if a simpler model matches the observations just as well. Models or model elements without support from observations cannot be considering informative or meaningful.

Success of Science
In contrast to Kuhn, I shall close by explaining why science does progress despite seeming to backtrack and flip-flop.  Science is exploration. It wanders blindly into the unknown feeling its way along. Individual ideas may wander back and forth or even backtrack. However, with each new model of reality that is adopted more and better predictions can be made without losing the explanation of observations already made. As such each new model represents a closer approximation to reality, a fixed target (given the assumptions outlined in Part 1) we may never reach.

Part 1
Part 2
Part 3