A few words about confidence

When doing validations and verifications, we statistically estimate different performance characteristics of a method. The values gained by calculations never give exact information about the method under examination. So how can we know how trustworthy the results really are?

The standard tool for estimating the credibility of a statistical result is the confidence interval (CI), which describes the precision of an estimate. A wide confidence interval points to a lack of information, whether the result is statistically significant or not, and warns against over-interpreting results from small studies. Validation Manager calculates CI ranges for you automatically to help you in interpreting your results.

Image 1: Whenever the confidence interval is calculated, you should evaluate how it affects the reliability of your results.

In Validation Manager, we use a 95% confidence level for calculations, meaning that if you repeated your experiment 100 times, in 95 of those experiments, the true value would fall between the measured confidence intervals.

What’s the difference between probability and confidence level?

Probability describes a situation where we know the distribution (or basically the mean value and variance) of possible results from which we are taking individual data points to examination. When the distribution is known, we know the probabilities of each possible result.

Confidence level describes a situation where we do not know the true distribution. Even when we know the exact concentrations we are measuring, we don’t know the exact mean value of results given by the instrument because we don’t know the precise amount of bias given by our measurement setup. We estimate the distribution by collecting a statistically significant data set to represent the true distribution. It is possible (though rather unlikely, see xkcd comic 882) that our experiment yields such results that the measured CI does not give any information about the true value. Still, with a 95% confidence level, the true value lies within the measured confidence interval.

With a 95% confidence level, if you repeated your experiment 100 times, 95 times the true value would fall between the obtained CI.

With a 95% confidence level, if you repeated your experiment 100 times, 95 times the true value would fall between the obtained CI. In Image 2, the green, blue, and grey segments represent the confidence intervals of consecutive measurements, and the red arrow represents the true value of the quantity being measured.

Image of a ruler and segments to visualize the idea of CI

Image 2: Example of confidence intervals that could be measured from the same sample.

What determines the width of the confidence interval?

Confidence interval is affected by the following components:

Selected confidence level. The more confidence you look for, the wider the interval. 95% is the standard choice for a confidence level, as it is widely considered the minimum level for any scientific use. In cases where mistakes would lead to extreme consequences like death, higher confidence levels are recommended. The reason why 95% confidence is used whenever possible is that with higher confidence levels, the minimum sample size for getting meaningful results grows fast to amounts that are barely feasible for most purposes (see the third point of this list).

Variability. Samples with more variability (larger SD) generate wider confidence intervals.

Sample size. The smaller the N size of the experiment, the wider the confidence intervals will be. There is an inverse square root relationship between the confidence intervals and sample sizes. This means that if you want to cut your margin of error in half, you need to approximately quadruple your sample size. Furthermore, it must be noted that, for example, for a probit LoD experiment, the sample size means both the replicate count and the number of dilutions.

Image 3 visualizes how growing your sample size gives more confidence in your results. All the graphs show result distributions with different data sets of a population with a certain mean value and variance. When measuring a quantity, we measure only a subset of the possible results that could be obtained. Different subsets give different mean values as well as different confidence intervals. With a larger number of measurements, we can be more confident that the result distribution represents the measured quantity. This makes the confidence interval smaller, but our measurement still gives only an estimate of the value.

A few words about confidence

What’s the difference between probability and confidence level?

What determines the width of the confidence interval?

What conclusions can be made from your results?

So what would be acceptable for confidence intervals?

Accomplish more with less effort

Leave a Reply Cancel Reply

PRODUCTS

ABOUT

RESOURCES

A few words about confidence

What’s the difference between probability and confidence level?

What determines the width of the confidence interval?

What conclusions can be made from your results?

So what would be acceptable for confidence intervals?

Accomplish more with less effort

Related Posts

How to estimate measurement uncertainty?

Succeeding with Validation Manager [Part 2]: climbing up the quality steps

Succeeding with Validation Manager [Part 1]: getting started

Leave a Reply Cancel Reply

PRODUCTS

ABOUT

RESOURCES