business statistics - qbm117 introduction to hypothesis testing
TRANSCRIPT
Business Statistics - QBM117
Introduction to hypothesis testing
Objectives
To introduce the second type of statistical inference - hypothesis testing
To introduce the concept of hypothesis testing.
To gain a basic understanding of the methodology of hypothesis testing
Introduction to hypothesis testing
Hypothesis testing is another type of statistical inference where, once again, decisions are based on sample data.
The purpose of hypothesis testing is to determine whether the sample results provide sufficient statistical evidence to support (or fail to support) a particular belief about a population parameter.
Over the next few lectures we will develop a step-by-step methodology that will enable us to test these beliefs.
The objective of hypothesis testing is captured by this question:
Is the sample evidence consistent with a particular hypothesized population parameter, or does the sample evidence contradict the hypothesized value?
By rejecting the plausibility of the initially hypothesized value, we indirectly establish the plausibility of an alternative hypothesized value or range of values.
The null and alternative hypotheses
is the challenged hypothesis. It is the assertion we hold as true, until we have sufficient statistical evidence to conclude otherwise.
it always expresses a value of the population parameter which we intend to subject to scrutiny, based on sample data.
The null hypothesis, denoted H0 ,
The purpose of scrutinizing the null hypothesis is to determine whether there is support for the alternative hypothesis, denoted HA ,
Examples of null and alternative hypotheses
The operations manager is concerned with determining whether the filling process for filling 100g boxes of smarties is working properly.
If the manager wants to know whether the average fill of the boxes is less than 100g, he would specify the null and alternative hypotheses to be
100:
100:0
AH
H
If the manager wants to know whether the average fill of the boxes is more than 100g, he would specify the null and alternative hypotheses to be
100:
100:0
AH
H
If the manager wants to know whether the average fill of the boxes differs from 100g, he would specify the null and alternative hypotheses to be
100:
100:0
AH
H
The manager hopes to find the filling process is working properly, however, he may find the sampled boxes weigh too little
100: AHor too much
100: AH
As a result he may decide to halt the production process until the reason for the failure to fill to the required weight of 100g is determined.
By analysing the difference between the weights obtained from the sample and the 100g expected weight, he can reach a decision based on this sample information, and one of the two conclusions can be drawn.
The test statistic is a sample statistic calculated from the data. Its value is used in determining whether to reject or not reject the null hypothesis.
When testing hypotheses about the population mean , when the population variance is known, the test statistic will be x
The test statistic
or its standardised value
n
xz
/
as long as the population is normal or 30n
When testing hypotheses about the population mean , when the population variance is unknown, the test statistic will be x or its standardised value
ns
xt
/
as long as the population is normal.
When testing hypotheses about the population proportion p, the test statistic will be p̂ or its standardised value
npq
ppz
ˆ
as long as 5 and both nqnp
The rejection region
The sampling distribution of the test statistic is divided into regions, a region of rejection (critical region) and a region of non-rejection.
The rejection region consists of all values of the test statistic for which H0 is rejected.
The non-rejection region consists of all values of the test statistic for which H0 is not rejected.
The value that separates the rejection region from the non-rejection region is the critical value.
-3 -2.50
-2.00
-1.50
-1.00
-0.50
0.00
0.50
1.00
1.50
2.00
2.50
3.00
Critical value Critical value
Region of non-rejection
Region of rejection Region of rejection
Two-tailed hypothesis test HA: 100
Critical value
Region of rejection
Upper tailed hypothesis test HA: > 100
-3 -2.50
-2.00
-1.50
-1.00
-0.50
0.00
0.50
1.00
1.50
2.00
2.50
3.00
Region of non-rejection
Critical value
Region of rejection
Lower tailed hypothesis test HA: < 100
-3 -2.50
-2.00
-1.50
-1.00
-0.50
0.00
0.50
1.00
1.50
2.00
2.50
3.00
Region of non-rejection
The decision rule
This is a rule that specifies the conditions under which the null hypothesis will be rejected.
It is a mathematical representation of the region of rejection as seen in the previous three slides.
For example the rejection region in slide 2 might be described by
Reject H0 if x is greater than 110.
Reject H0 if x is less than 90.
or the rejection region in slide 3 might be described by
The critical value
Therefore, in order to illustrate these rejection regions and describe them mathematically, we need to know the critical value(s), ie that value or values which separate the rejection region(s) from the non-rejection region.
How do we determine the critical value?
How do we determine this critical value?
The determination of the critical value depends on the size of the rejection region.
The size of the rejection region depends on the probability of making an error, when testing our hypotheses.
So, what are these errors?
Errors in hypothesis testing
rejecting the null hypothesis when it is true, a type I error;
not rejecting the null hypothesis when it is false, a type II error.
A hypothesis test concludes with a decision to either reject or not reject the null hypothesis. This decision, together with whether the hypothesis is true or not, results in two possible errors:
Because the decision we make and the conclusion we reach is based on sample data, there is always a possibility of making an error.
Type I and type II errors
The probability of making a type I error is defined as . This probability is also referred to as the level of significance.
The probability of making a type II error is defined as .
Ideally we would like to keep both errors and as small as possible. Unfortunately however, as decreases increases and vice versa therefore generally the size of is decided by the cost of making a type I error.
The size of is usually kept as small as possible, generally a value between 1% and 10%.
Once the value of is specified by the decision maker, the size of the rejection region is known because is the probability of rejecting the null hypothesis when it is true.
The size of this rejection region is .
-3 -2.50
-2.00
-1.50
-1.00
-0.50
0.00
0.50
1.00
1.50
2.00
2.50
3.00
Critical value Critical value
Region of non-rejection
Region of rejection
/2 = 0.025
Region of rejection
/2 = 0.025
Two-tailed hypothesis test HA: 100 when = 0.05
0.95
Critical valueRegion of rejection
= 0.05
Upper tailed hypothesis test HA: > 100 when = 0.05
-3 -2.50
-2.00
-1.50
-1.00
-0.50
0.00
0.50
1.00
1.50
2.00
2.50
3.00
Region of non-rejection
0.95
Critical value
Region of rejection
= 0.05
Lower tailed hypothesis test HA: < 100 when = 0.05
-3 -2.50
-2.00
-1.50
-1.00
-0.50
0.00
0.50
1.00
1.50
2.00
2.50
3.00
Region of non-rejection
0.95
Reading for next lecture
Chapter 10, sections 10.3 and 10.5 (Chapter 9, sections 9.3 and 9.5 abridged)