Power and sample size

'How many data values do I need?'

- 'It depends on the wanted power'

- 'It depends on your two hypotheses'

'??'

- 'Read all info, do the exercises!'

Definition of 'power' (Oxford dictionary of statistics):

'The probability of accepting the alternative hypothesis when it is true'

Interpretation:

Make sure that the page is in its initial state (restart if necessary). The two red lines is the interval where we will accept the null hypothesis (H₀). However, if H₁ is true, there is a large probability (the area right of the right red line and under the dashed curve) of getting an average right of the right line. This probability is called power and is 0.85.

Thus there is a risk of 0.15 (1 - the power) of falsely accepting H₀ when it is false.

(In so-called acceptance sampling the power is traditionally calculated as (1 - power) and then called the Operating Characteristic curve (OC-curve).

Sample size:

Difference:

Power value:

Sigma:

Alpha:

Less than Not equal Greater than

Data description...

Info, referenser, m.m...

19. Test of hypothesis — power and sample size

Repeat (a new average)

An illuminating example

Suppose that a foodstuff is sold in glass jars with a content of 25 grams, at least according to the label. If the average content is too small the consumers will protest and if the average is too high it will be too expensive for the manufacturer.
To protect himself the manufacturer wants to measure the content of a number of glass jars. A simple set of measurements can be helpful but the sample needs to be designed in some way.
Apart from the equipment and methods needed, the size of the sample needs to be determined. To determine the sample size, there must be some kind of knowledge of sigma, the standard deviation of the content. This might be known from previous measurement or some other method of estimation.

The investgator needs also some other entities:
1. The alpha-value i.e. the risk (probability) that the null hypothesis (here "The average content is 25 grams").

2. The alternative hypothesis which can be seen as the largest deviation that can be accepted. (In computer programs this is often expressed as the difference from the null hypothesis.)

3. The power of the test i.e. the probability of rejecting the null hypothesis in favour of the alternative hypothesis (when the null hypothesis is false).

The investigator can adjust these parameters so the result is a reasonable plan for the measurements. (If a large power is wanted the sample size might rise to what can be unacceptable.) NB that the reasoning can be also reversed, i.e. having a certain sample size in mind, what power will this get, or having decided the power and sample size, what difference can be detected?

(See the other info buttons and exercises for more information.)

••••

μ H0

μ H1

number of values

alpha

General. The exercises from 2 and higher are clickable and will store Minitab-commands with the given parameter settings. Any change of a parameter will not be transfered to the commands unless the exercise is clicked again.
To transfer the commands the 'Copy commands'-button is pressed and then the result must be pasted into the Minitab-session window.
Any value of the tripple 'power/difference/sample' can be calculated given the other two. Perhaps 'sample size' is the wanted answer and getting this the power wanted and difference to detect, must be given. (Se Exercise 4).

Exercise 1 – change the parameters
Change the parameters via the slides and note that the difference between the two hypotheses and correpsponding distributions and the curve change accordingly.
The power curve is a function of the other parameters. Click the radio buttons to see the changes of the power curve. Note that the power increases with increasing sample size and also with decreasing sigma.

Exercise 2 – power, given difference and sample size
This exercise will calculate the power value using the given difference and given sample size. Click 'Exercises...' and transfer the parameter values to the command area.

Exercise 3 – difference, given power and sample size
This exercise will calculate the difference between the hypotheses that can be detected using the wanted power and given sample size. Click 'Exercises...' and transfer the parameter values to the command area.

Exercise 4 – sample size, given difference and power
This exercise will calculate the needed sample size in order to detect the wanted difference with the wanted power. Click 'Exercises...' and transfer the parameter values to the command area.

Exercise 5 – 'Less than'
Sometimes we want the parameter to be less than a given value. Thus we want protection against the parameter becoming too large and this with a certain power. Click 'Exercises...' and transfer the parameter values to the command area.

Exercise 6 – 'Greater than'
Sometimes we want the parameter to be greater than a given value. Thus we want protection against the parameter becoming too small and this with a certain power. Click 'Exercises...' and transfer the parameter values to the command area.

••••

The two distributions are theoretical normal distributions of averages, not the measurements. The blue distribution shows averages around the null hypothesis and the grey dashed distribution shows averages around the alternative hypothesis.

The standard deviation of averages where each average is calculated with n values:

$σ_{averages} = \frac{σ}{\sqrt{n}}$

The two red lines are the same as a calculating a confidence interval with real data in a so-called t-test.
The sum of the two areas under the blue curve, but outside the vertical red lines, equals the alpha-value.

The red cross is a simulated average created by the alternative hypothesis. The 'Repeat'-button will create a new value.

••••

Probability of rejecting H₀ when it is false

The Y-axis shows probability from 0 to 1.

The X-axis shows both the null hypothesis H₀ and the alternative hypothesis H₁. (The endpoints of the X-axis are changed in the top distribution graph.)

The blue curve shows how the power changes with the distance between the two hypotheses. The curve is symmetrical around the null hypothesis. The lowest value of the power curve is the alpha-value.

The red dot is the power value for the given value of the alternative hypothesis.

The X-axis can be changed by clicking and changing the min or max values for a better fit.

••••

'Less than', 'Not equal' or 'Greater than'

'Less than' is valid when we want the mean to be high and we want the analysis to show if the results is lower than wanted. Perhaps we have promised that 'the product contains at least 200 grams'.
We do not mind if the average is more but definitely not lower than this value.

'Not equal' is valid when we want the mean to be a certain value, not less, not more. Perhaps we have promised that 'the product contains 250 grams'. We do not want the average to be neither less nor larger than this value.

'Greater than' is valid when we want the mean to be low and we want the analysis to show if the results is higher than wanted. Perhaps we have promised that 'the product contains less than 20 milligrams of fat'. We do not mind if the average is less but definitely not higher than this value.

••••

The endpoints of the slides can not be changed.

The three top slides move 0.1 every time a right or left arrow is pressed. The 'number of values' changes by integers. The alpha slide moves in steps of 0.01.

μ H₀:  The value of the null hypotheses.

μ H₁:  The value of the alternative hypothesis.

σ:  The theoretical standard deviation, equal for both hypothesis.

number of values:  Number of values on which an average will be calculated.

alpha:  The probability (risk) that the null hypothesis is rejected even if it is true. This risk can be chosen lower but at the prices of a larger sample.

••••

A click on the [Copy commands]-button copies the commands to the clipboard. Shift to the Minitab-application and perform a 'Paste' into the 'Session window' in Minitab at the 'MTB >'-prompt. Click the [Enter]-key of the keyboard. This will launch the Minitab-macro in Minitab.

The rows in the right column in blue contain a short comment to each Minitab-command.

••••