|
|
|
CALISO - the calibration software for scientists and engineers |
|
|
The science of experimental physics can be described as
"...putting something IN,
and then observing what comes OUT".
If we continue with our back-to-basics approach, we could also say that, in general, calibration procedures fall into one of only two catagories:
Imagine that you perform a calibration on a device and take 20 measurements.
The results of our calibration are shown in tabular form below. Tabular Output.
This data, as it stands, can be of value to other people. For instance it can be used as a look-up table to obtain the output value for a given input, or to determine what input is needed to produce a required output. However, presenting the information in this way produces a big question:
|
|
Graphical Output
A quick, simple and, it has to be said, effective, way of achieving this is to plot the calibration data on an x-y graph. Just by using your eye, a ruler, or a set of french-curves, you will be able generate a line that indicates the relationship between the data points. The line, which is known as a "trend line", will, if drawn with care, be a surprisingly accurate representation of the data points. Seeing a graph will also illustrate to you a very important characteristic of eye-brain coordination, namely that when it comes to detecting trends, you will do so much better using a graph than a table of values. Any deviation from a smooth x-y relationship is immediately detected by looking at the graph. The same cannot be said when you look at the table of values. Shown below is the data from our calibration, plotted as an x-y graph by CALISO. Someone with a copy of this graph can quickly an easily read off data points and interpolate between them. But there are still questions to be answered.
Which leads us on to the subject of: |
|
Regression Analysis
Regression Analysis is a statistical technique that produces a line or curve of known function (equation) through a series of data points. The average of all the distances from each point to the curve/line is minimised. In other words, the line drawn through is the closest mathematical relationship to the data, or as it is known, a line of "best fit". The most common type of regression is where a straight line is drawn through the data points. This is known as "Linear" or "First Order" regression. The line generated will be of the type:
A person given the values of C and D for the line of best fit can calculate for themselves the value of y for ANY given value of x and vice-versa.
Example 1:
Inserting the values of C and D into our expression gives:
Example 2:
Again, inserting the values of C and D into our expression gives:
The examples chosen are actually conversions between temperatures expressed in Celsius
and Fahrenheit.
Conversions of this type are obviously very useful, but the picture is incomplete unless the
person given the values of C and D is also made aware of exactly how closely the data
fits the line.
By this we are not talking about
how good a job your eye and ruler (or software) does of putting the best line through the
data points, any problem here is a mistake in interpretation of the data. What we are
actually concerned about is the dispersion of the data points around our line of best fit, and
hence, how well the data can be represented a trend line of known function.
The statistical method of expressing the magnitude of this dispersion is called "CORRELATION".
|
|
Correlation The number that we use to quantify it is known by a few fancy names, such as the Pearson Product Moment, but we shall refer to it as the "CORRELATION COEFFICIENT". Its calculation is beyond the scope of this page, it suffices to say, that Caliso seamlessly performs these calculations for you, and has all the tools you need for further Regression Analysis. Its purpose though is this:
Have a look at the three Caliso graphs below. They will help you to appreciate the concept of a line of "best fit", and how the correlation coefficient indicates the level of dispersion of the data points around the line Now have another look at the three graphs and note the following:
|
|
Higher Order Regression Analysis
Not all data is best represented by a straight line, for example, the area of a circle is a function of the square of its diameter and not the diameter itself. This not a problem because Linear Regression is not the only type of Regression Analysis. Higher orders of regression allow us to find more complex relationships between x and y. Listed below are the types of analysis offered by Caliso:
The A, B, C, and D are known as the "Regression Coefficients, where:
Orders higher than 3 are also possible, but are not used by Caliso. This is because third order
regression is more than capable of providing accuracies that are within the requirements of
experimental measurement.
|
|
Selecting the Best Order Regression Analysis to Use
In some cases, external circumstances will require you to perform linear regression for example:
Otherwise, use the highest order of regression available for the number of data points you have or are required to produce. All regression software tools worthy of merit will produce a line of best fit, even if that line is generated using a lower order of regression than the one chosen by the user. The software will do this by forcing any unwanted coefficients to zero.
Going back to start of our discussion, Caliso was asked to perform a third order regression on our 20 calibration points. The results were:
You might also like to know that, using Caliso, the whole process, which involved entry of the new calibration data and the calculations, took less that 2 minutes including printing a new calibration certificate and graph. |