a113 6 p p15 making a strong case

Copyright © 2009

A113 Mathematics

Problem 15: Making a Strong Case

6th Presentation

• Central Limit Theorem– The Central Limit Theorem states that if the sample

size is large (≥ 30), the shape of the histogram of the sample means will resemble a “bell-shaped curve”, also known as the Normal distribution curve.

– Hence, we will be able to use the Normal distribution curve to estimate the chance that a sample mean falls within a certain range of values.

Recap

– The Normal curve is symmetrical about its mean.– The Normal curve is described by its mean and its standard

deviation (or variance).– The area under the Normal distribution curve represents the

probability of an event occurring where the total area is 1.

• Normal distribution

Recap

Mean

100%

• Manufacturer’s claim– The average volume per can is 330 ml.– This claim is commonly referred to as the null

hypothesis, H0.– The null hypothesis is presumed true unless we have

enough evidence to reject it.

• Student’s suspicion– The average volume per can is less than 330 ml.– This is commonly referred to as the alternative

hypothesis, H1.

Forming the hypotheses

Testing practically

• Hypothesis testing

– It is impossible to take all the cans that the manufacturer has produced and find their average volume to prove or disprove the manufacturer’s claim.

– A more practical way is to take a ‘sample’ and find statistical evidence to decide whether or not to reject the manufacturer’s claim.

• First, we assume that the null hypothesis is true.

• If our experiment produces a result (e.g. a sample mean) which is highly unlikely, we conclude that the null hypothesis is probably not correct. In this case, we have reason to reject the null hypothesis.

• Otherwise, we are unable to reject the null hypothesis. But this does not mean that we are 100% sure that the null hypothesis is true.

Hypothesis testing

• Significance level is the probability of making a decision to reject the null hypothesis when the null hypothesis is actually true.

• Critical region is the set of values for which we reject the null hypothesis.

• Critical value determines the boundary between a decision whether or not to reject the null hypothesis.

Common terms used

Hypothesis testing (Group A)

By the Central Limit Theorem, the shape of the histogram of the sample means will resemble a “bell-shaped curve”, which is known as Normal Distribution, as shown below.


xData collected from 30 cans: Average, = 325.8 ml, Standard deviation, s = 15.2 ml

x


Data collected from 30 cans:

– Population mean = 330 ml, estimated standard deviation = 3015.2

330

Average, = 325.8 ml, Standard deviation, s = 15.2 ml

x




– Population mean = 330 ml, estimated standard deviation =

330

– Consider testing at a significance level of 5%

x

Probability = 5%

30

15.2

– Determine the critical value and critical region (Red Zone)

Critical value: 325.4


330

Probability = 5%


325.8

Since 325.8 does not lie in the critical region, it is likely to obtain the average (for 30 cans) of 325.8 ml at a significance level of 5%.

We do not have enough evidence to reject the null hypothesis.




– Population mean = 330 ml, estimated standard deviation = – Consider testing at a significance level of 5%

30

15.2

– Determine the critical value and critical region (Red Zone)


x

Hypothesis testing (Group B)

Data collected from 100 cans: Average, = 327.2 ml, Standard deviation, s = 12.4 ml

– Population mean = 330 ml, estimated standard deviation = 100

12.4

330


Probability = 5%


– Determine the critical value and critical region

327.2

Since 327.2 lies in the critical region, it is highly unlikely to obtain the average (for 100 cans) of 327.2 ml at a significance level of 5%.

We do have enough evidence to reject the null hypothesis.



x

Hypothesis testing (Group B)


– Population mean = 330 ml, estimated standard deviation =

330


Probability = 1%



327.2





x

100

12.4

Hypothesis testing (Combined data)


– Population mean = 330 ml, estimated standard deviation = 130

13.1

330


Probability = 1%



326.9





x

• Establish the null and alternative hypotheses• Understand that significance level and critical

region determine how confident we are of our decision

• Determine whether or not to reject the null hypothesis

• Recognise that a larger sample size is preferred over a smaller sample size in hypothesis testing

Learning points

Discussion

If you are one of the students conducting the study to check the manufacturer’s claim, what would be your ideal sample size? State any assumptions that you have made.

a113 6 p p15 making a strong case

Technology

normal curve

normal distribution

bellshaped curve

null hypothesis

sample size

sample means

alternative hypothesis

total area