topic 3: hypothesis testingweb.uvic.ca/~bettyj/246/topic3web.pdf · topic 3: hypothesis testing...

78
1 Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation to estimate some unknown population parameter. I.e. μ and σ. # Within Topic 3, we now want to test the ________of some statement about the __________. Example: Did the price of ________increase by an average of only 10¢ a litre last month? Example: Was the average amount of _______ fees charged to Bank of Nova Scotia customers in January of 2000 higher than in January of 1999? Example : Is the proportion of _____paid as tax the same in 1989 as in 1999? # These assumptions about _______ values of population parameters are generally referred to as STATISTICAL HYPOTHESES. # Determining the validity of an assumption of this nature is called “hypothesis _______.” The primary goal of hypothesis testing is to choose between two __________ and ________ exclusive competing hypotheses about the value of a population parameter.

Upload: others

Post on 03-Aug-2020

10 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

1

Topic 3: Hypothesis TestingSection 10.1 Introduction

# Within Topics 1 and 2, we employed point and interval estimation to estimate some unknown population parameter. I.e. µ and σ.

# Within Topic 3, we now want to test the ________of some statement about the __________.

Example: Did the price of ________increase by an average of only 10¢ a litre last month?

Example: Was the average amount of _______ fees charged to Bank of Nova Scotia customers in January of 2000 higher than in January of 1999?

Example: Is the proportion of _____paid as tax the same in 1989 as in 1999?

# These assumptions about _______ values of population parameters are generally referred to as STATISTICAL HYPOTHESES.

# Determining the validity of an assumption of this nature is called “hypothesis _______.”

“ The primary goal of hypothesis testing is to choose between two __________ and ________ exclusive competing hypotheses about the value of a population parameter.

Page 2: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

2

Types of Hypotheses:

pWhen specifying the competing hypothesis with respect to some statement about a population parameter, it is convenient to distinguish between ______ hypotheses and _________ hypotheses:

1) ______ Hypotheses: only one value of the population parameter is specified.

Example: Test whether the average price of gasoline last month µ=70¢/litre.

Example: The variability of the increase in gas prices last month was σ2=100 ¢.

2) _________ Hypotheses: Specifies a range of values that the population parameter may assume.

With _________ hypotheses, more than one value is specified in each case.

Examples: The average price of gasoline last month was µ …70¢/litre. The price of gasoline was µ> 70¢/litre. The variability of gas prices σ2…100 ¢2

%______ hypotheses are generally easier to test than _________ hypotheses.

,With ______ hypotheses we only have to determine whether or not the population parameter equals the specified value.

,With _________ hypotheses, we must determine whether or not the population parameter takes on any one of a set of values.

Page 3: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

3

˜The two mutually exclusive hypotheses in a statistical test are referred to as the ____ hypothesis and the ___________ hypothesis:

(I) ____ Hypothesis: þ It is the statement being tested

þ Denoted by .H0

þ It is called “____’ because this hypothesis corresponds to a theory about a population parameter that is thought not to be true. Hence, the title: ‘null’ or ‘_______’.

Example: Θi~(µ, σ2) : µ=0 Testing to see if the population mean is ____. H0

is pronounced “H-nought.”H0

þWe can either reject or fail to reject the ____ hypothesis.

(II) ___________ Hypothesis: þ Is the situation which prevails if the null is _____.

þ Denoted .Haþ Generally, specifies the values of the parameter that theHa

researcher believes is ____.

Example: Θi~(µ, σ2)

(a) µ>0 ___ sided alternative.

(b)µ<0

(c) µ 0 ___-sided alternative.

Page 4: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

4

Remarks:

(i) (the ___ hypothesis) and (the alternative hypothesis), H0 Ha must be ________ _________ ( i.e. no overlap). P .( )A BI = 0

þIf is ____, then is _____.H0 Ha

(ii) The null and alternative hypotheses can both be either simple or composite:

Examples:

HH Simple null and alternative

HH Composite and composite

HH Simple null composite alternative

a

a

a

0

0

0

01

00

00

::

.

::

.

:: , .

µµ

µµ

µµ

==

≤ ⇐

= ⇐

>

<

Regardless of the form of the two hypotheses, the ____ populationparameter under consideration must be either in the set specified byH0

or in the set specified by þ __ _______!!Ha

Page 5: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

5

’To guarantee there is no _______, one convention is to create

_____________ hypotheses: HHa

0 00

::µµ=≠

(iii) The hypothesis is always specified about the __________ parameter, and not about the ______ statistic.

H correct formH X incorrect form

0

0

2525

:: !µ = ⇐

= ⇐

One and Two-Sided _____

’If the null hypothesis is ______, ( ), then the alternative H0 0: µ =

hypothesis may specify value(s) for the _________parameter that are

entirely above ( ), entirely below, ( ) or on Ha: µ > 0 Ha: µ < 0

other sides of the value specified by the ____ hypothesis,

( ). Ha: µ ≠ 0

Page 6: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

6

Notation:

___-sided Test: A statistical test where specifies that the Ha population parameter lies either ________ ‘above’ or ‘below’ the value specified in the ____ hypothesis.

Example: ___ sided test, since specifies that µ lies on HHa

0 77

::µµ=

> Ha

one particular side of 7.

___-Sided Test: A statistical test where the (alternative Ha hypothesis) specifies that the parameter can lie on

______ side of the value indicated by (null H0

hypothesis).

Example: ___-sided test since specifies that µ lies on HHa

0 77

::µµ=≠

Ha

______ side of _.

Page 7: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

7

Decision Problem

p We must use ______ information to determine whether to reject or not reject the _____hypothesis.

ö Hence, we must deal with the “uncertainty” associated with using _______ to test hypotheses pertaining to __________ parameters.

ö We again are faced with using ______ information to say something about the unknown __________ parameter

ö “_______ of _____”.

” In hypothesis testing, the usual procedure to solving these decisions is to: (1) initially assume that the ____ hypothesis is_____; (2) establish a probability _______ criteria; (3) take a ______; and (4) then employ ___________ based ________criteria.

ö We can then decide whether there is sufficient evidence to _____ the ____.

p The probability value which we base our conclusion, that the null is _____, is extremely important.

p Moreover, since the decisions to accept or reject the ____ are probability based, there are “chances of error” in these decisions.

Page 8: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

8

Two Types of Potential Error

Based on the sample result:(1) Type _ Error: ______ the null hypothesis when the null is

actually ____.

(2) Type __ Error: _____t the null hypothesis when the null is actually _____.

TrueSituation: is True is FalseH0 H0

Action:

Accept H0

Reject H0

Notes: Type I and Type II errors are ___________ probabilities.

1)Type I Error:

ö Conditional on being ____.H0

ö The probability of Type I error is denoted by “α”.ö “α” is referred to as the level of significance or____ of the

test.

“α” = P(Type I error) = P(______ | is ____).H0 H0

öThe level of ____________ of a statistical test is comparable to the probability of an error, also referred to as ‘α’, as discussed in Topic 2.

Page 9: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

9

The value of “(1-α)” is referred to as the __________ level and represents the complement of P(Type I Error).

(1-α) = Confidence level = 1-P(Type _ Error) = P(______ | is ____).H0 H0

Example: Set α = 10%; We will ______ when it is ____, in 10% of H0

the samples.

2) Type II Error:

”The probability of a Type II error is denoted by ‘β’.

β = P(Type II error) = P(______ | is _____).H0 H0

p The complement of this probability is known as the ______of a statistical test.,It indicates the ability of the test to correctly recognize that the null hypothesis is _____ and should be ________. (1-β) = (______) = P(______ | is _____).H0 H0

=1- P(______ | is _____).H0 H0

” The researcher always wants to create a test that will yield a power

close to _. I.e., ‘β’ close to ____, when is false. H0

Page 10: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

10

,Thus, we have the decision problem now, in terms of probability:

Decision Problem:

TrueSituation: is True is FalseH0 H0

Action:

Accept H0

Reject H0

Sum 1 1

Remarks:y We want a test for which both α and β are ____. (i.e. the probability of the two errors are low).y The probability of each decision outcome is a conditional probability.

y Elements in each column sum to one, because the column events are complements.yα and β need not add to 1, since their probabilities are not complementary.

y α and β are not ___________ of each other. öWhen α is lowered, β normally _________ if the sample size

remains the same.

Page 11: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

11

yα and β are not _________ of the sample size, n. öIf n increases, ____ α and β ________, since we use more

information about the population and potentially reduce the sampling error.

ö Researchers must decide between the higher cost of sampling to ________ the sample size, and the potential sampling error and size of α and β.

y In classical hypothesis testing, we set α and try to design tests such that β is as small as possible.

Page 12: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

12

Section 10.2 The Standard Format of Hypothesis Testing

pThe general procedure for testing hypotheses follows 5 STEPS:

STEP 1: State the ____ and ___________ hypotheses. i.e. Formulate the and .H0 Ha“ The form of the test will depend on both and .H0 Ha

Example:

HH Clearly specify the two conflicting hypothesesa

0 100100

:: .µµ=≠

, The two hypotheses must be ________ _________.

, The___ value of the population parameter must be included in one of these hypotheses.

STEP 2: Determine the Test _________ Used To Test the _____Hypothesis.

Example: In testing:

HH We could use X to do the testa

0 100100

:: .µµ=≠

Page 13: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

13

In this example, we must decide whether is true or false byH0

determining if is “_____ enough” to 100.X, If is far above or below 100, this will lead us to _____t the X

____.

, If the value of is slightly below or above 100, we will fail X to reject the ____, and we conclude that is true.H0

So: “ How do we decide what is far away and what is slightly

above/below the null hypothesis?”

¸We need to use probability — specifically the _______ distribution of the test statistic — to determine whether to reject or not reject .H0

¸To obtain a sampling distribution which is known, we _________ X to the standard normal distribution form:

, ZX

n=

− µσ

0

where ö ~N(µ,σ2/n), X ö σ is known, and ö µ0 is the value of µ under the null. (Here µ0=100)

¸ The random variable, Z, is the test _________.

Page 14: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

14

¸ The ____ statistic is a random variable employed to determine whether a specific sample result falls in one of the hypotheses being tested.

¸ The test statistic (1) must have its _._._. known under the condition that is ____;H0

(2) must contain the _________ being tested; and

(3) all of its remaining terms musts be _____ or calculable from the sample.

So, , ZX

n=

− µσ

0

(1) contains µ

(2) we know the distribution of Z ~N(0,1)

(3) in our example n and σ are assumed known and would be specified, and

(4) can be determined from the sample.X

Page 15: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

15

STEP 3: Determine the “________ Region(s)” of the Test

¸ Before the sample is taken, it is important to specify which values of the test statistic will lead to the _______of , (called the _______ H0

region,) and which values will lead to the acceptance of .H0

¸ The Critical Region is the region of test statistic values over which we believe to be _____, leading to a _________ of the null H0

hypothesis.

¸ The Acceptance Region is the complement of the rejection region. It is the region of test statistic values over which we believe the null to be ____: acceptance of the null.

¸ The ________ value is the value of the test statistic which separates the ________ region and __________ region.

The Acceptance and Critical Regions for H0

(Two-sided Alternative)

Acceptance Region

(Accept )H0 _______ Region

(Reject )H0

_______Region

(Reject )H0

Large Negative Z

Large Positive Z 0

Z values close to zero._______ Value _______

Value

Page 16: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

16

Question: “How do we determine the exact location of the ________ Values?”

¸Since the sampling distribution gives values of statistics and associated probabilities, we can use the sampling distribution of the ____ statistic, assuming is ____, to evaluate the probability of H0

observing different values of the test statistic under the null hypothesis.

¸Then we can derive _______values which depend on the level of ____ of Type I errors (α) and Type II errors (β).

Note:¸__________ the size of α will ________ the size of β.

¸Traditionally, the value of α is set and then we choose the critical region that yields the ________ value of β.

¸The value of α is an indicator of the degree of importance that a researcher attaches to the consequences of __________ rejecting the null .H0

¸ Usually in Social Sciences, we use a level of significance of α=0.__. I.e. we are willing to accept a _% chance of being wrong when we reject . In contrast, hard sciences, like H0

pharmacology, α is set much lower , α=0.000_ or α=0.0000_, indicating a greater concern of incorrectly rejecting .H0

Page 17: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

17

Example: Continuation:

HHa

0 100100

::µµ=≠

, Suppose the researcher desires a level of significance of α=0.05. I.e., no more than a 5% chance of committing a Type I error.

, The researcher wants the critical region to cut off 5% of the appropriate p.d.f. , Z- distribution.

, When the null hypothesis is ___-sided, the optimal critical region will cut off ___ of the area in each tail. (Same procedure as construction of a __________ interval.)

, If α=0.05, then α/2 =0.___. The Z-values that correspond to α/2=0.025 in each tail are ±____.

Figure 3.1

0

Acceptance Region

(Accept )H0

Critical Region

(Reject )H0

Critical Region

(Reject )H0

-1.96 1.96

Page 18: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

18

¢ The above illustration expresses the critical value (±____) in terms of the Z-distribution. i.e. We standardize , so we could determine the _______values for X a known probability distribution.

¢ We can also express the critical values in terms of __.

¢ It is simple to transform a Z ________ value into its__ critical value format by solving for :X

ZX

nOur test statistic

rearranging

X Zn

*

*

( )

:

=−

=

+

µσ

σµ

0

0

µ0 +

+Z

µ0

These 2 illustrations are equivalent means of depicting the critical regions.

+Z

µ 0

Page 19: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

19

STEP 4: Take Your Sample and Calculate the Value of the ____ Statistic

Recap: 1) Specified ____hypotheses: and .H0 Ha 2) Determined the appropriate ____ _________. 3) Find the ________ region(s).

Now we must determine if the sample result lands in the acceptance orrejection regions.

¸ If we use _-values to define the ________values, we must standardize the sample results into _-values.

,This is called determining the “________-Z” value: Zc

ZX

n

Equationc =−

⇐( )

σ0 101

Notation:Z* Ô ________ Z-value

Zc Ô ________ Z value

t* Ô Critical t-value

tc Ô Computed t-value

Page 20: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

20

STEP 5: Compare the Calculated Test Statistic With the Critical Value and Make the Statistical Decision

Decision Rule:

p If Zc value of the test statistic falls ______ the critical region, thenthe belief that is ____, is ________. H0

p If Zc value of the test statistic falls ______ the acceptanceregion, is ________.H0

Remarks:

The final decision depends on:

% the particular ______% α, the level of significance/size of the test% form of the alternative hypothesis: Ha

Example: Two-Sided TestLet Xi ~N(µ,σ2)We want to test the belief that µ=$50.

HHa

0 5050

::µµ=≠

We decide that we will test at the _% significance level α=0.05. Suppose that n=___; σ2=__; and =48.5.XQuestion: Is the value of (=48.5) “close enough” to the value of µXunder the null?

Page 21: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

21

The sampling distribution ~ N(µ, σ2/n) =X ~ N(50, __/___=0.25) if is true.H0

P( | is true)X H0

Low 50 UpperX X

α=5%P(Reject | is true) = 0.05.H0 H0

P( < *Low or > *

upper | is true)=0.05X X X X H0

, 2P( > *upper | is true)=0.05X X H0

, 2P( > *upper |µ=50)= 0.__X X

, P( > *upper |µ=50)= 0.___X X

, P(Z > Z*)=0.___

Where Z* is the critical value for Z.

Page 22: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

22

From the standard normal table we get:Z*upper =_.__Z*Low =-_.__

We reject if either: Zc >____H0

Zc<-____

ZX

nc =

−=

−= = −

µσ

0 485 506

122 15 3

( . )( . )

Since Zc =-3 which is less than -____, we _____t at 5% level ofH0

significance.

-3 -1.96 0 1.96 3

Notice we split the significance level into 2 parts — one relating toeach condition under which we would reject H0

Page 23: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

23

One-Sided Tests

For these tests, the only difference from the two-sided tests is that theone-sided tests have a one-sided ___________ hypothesis, such that wehave a single ________ rejection region.

α=probability in upper tail.

α=probability in lower tail.

Page 24: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

24

Example:

Let Xi ~N(µ,σ2 =36)Test to determine if the population mean is equal to 50 (µ=50)

HH One sided null hypothesisa

0 5050

::

.µµ=

⇐ −

<

Suppose n=___; = 48.5; Test at 5% significance α =0.05.X“Is close enough to the value of µ under the null?” X ~ N(µ, σ2/n) =X ~ N(50, 36/___=0.25) if is true.H0

50X *

Rejection region Acceptance region

is the critical value for .X * X

Page 25: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

25

Standardize :X

ZX

nc =

−=

−= − = −

µσ

0 485 506

122 15 3

( . )( . )

Zc ~N(0,1)

× ________ valueZα = = −0 05 1645.* .

Since, , ______ the null: The population mean does not Zc < Z =0.05*α

equal 50.

Z-3 -1.645 0

Page 26: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

26

In terms of the original units:

Critical value

X Zn

X

X

*

*

*

.

. .

= −

+

= −

+

= − + =

α

σµ

16456

1250

08225 50 491775

Calculated value

X Zn

X

X

c c

c

c

= −

+

= −

+

= − + =

σµ

306

1250

150 50 485

.

. .Since 48.5 is less than and left of 49.1775, we ______ the null.

X

48.5 49.177 50

Page 27: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

27

What if we change the level of ____________?Test at 1% significance α =0.05.

“Is close enough to the value of µ under the null?”X

ZX

nc =

−=

−= − = −

µσ

0 485 506

122 15 3

( . )( . )

Zc ~N(0,1)

− = −=Zα 0 01 2 326.* .

We still ______ the null.

******************************************************“What if we change the level of significance again?”:

Test at 0.1% significance: α =0.001.

“Is close enough to the value of µ under the null?”X

ZX

nc =

−=

−= − = −

µσ

0 485 506

122 15 3

( . )( . )

Zc ~N(0,1)

− = −=Zα 0 001 308.* .

We cannot ______ the null at the α =0.001 level.

Page 28: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

28

The P-Value

˜ Test results can vary depending on α, the level of significance.˜ It is not ________ for researchers to avoid specifying α before seeing the data, and instead reporting a ___________ that depends on the computed value of the test statistic, say Zc.

“Changing the previous example such that =__.X Let Xi ~N(µ,σ2 =__)HH One sided null hypothesisa

0 5050

:: .µµ=

⇐ −<

Suppose n=___; =___.; Test at 5% significance α =0.05.XThe computed value is now:

ZX

nc =

−=

−= − = −

µσ

0 49 506

122 1 2

( )( )

We usually compare Zc with the ________ values to decide whether to rejector not reject the ____.

− = −=Zα 0 01 2 326.* .

Zα = = −0 05 1645.* .

Zα = = −0 10 128.* .

Zc is outside the 5% and 10% acceptance region but inside the 1% acceptanceregion.

Page 29: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

29

The _-value equals the ___________ that the random variable Z wouldtake on a value as _______ as Zc, given that the ___ hypothesis is ____.

( )( )

P value P Z Z

P ZP Z

c− = ≤

= ≤ −= − ≥= −=

(( )

..

21 21 0 97720 0228

“A sample mean of __ or _____ will occur only _.__% of the time when the true mean is __. Hence, the true mean may not actually be 50.”

Decision Rule:

¸ If α is ______ than _-value, ______ H0.¸ If α is _____ than _-value, ______ H0.

The above example relates to a one-sided test about µ, when Ha is one-sided on the _____ side:

( )P value P Z Zc− = ≤

When Ha is one-sided on the upper side:

( )P value P Z Zc− = ≥

Page 30: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

30

¸ If the __________ hypothesis is ___-sided, the one-sided probability must be _______ to obtain the p-value:

( )( )

P value P Z Z

P ZP Z

c− = ≤

= ≤ −= − ≥= −= =

2

2 22 1 22 1 0 97722 0 0228 0 0456

*

* (*( ( ))( . )*( . ) .

So, if µ=50, we would observe a sample mean with the difference fromthe mean higher than ___, (50-49), either above or below µ, in 4.56% ofthe samples.

Relating back to the example above, _____ H0 at the α=0.05, but _____H0 at α =0.01.

Given that we have a 2-sided alternative:

-2.575 -2 -1.96 -1.645 0 1.645 1.96 2 2.575

At α=5% , Z*=____At α= 10%, Z*=_____At α=1% , Z*=_____

Page 31: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

31

For any test, the p-values can be obtained (at least approximately orwithin a range) from interpolation within a probability table such as Z-table, t-table, etc. Exact p-values can be found using EViews.

Example: Your research group is hired as consultants to a major financial institution in _______ who plan to market a new financial product, targeting people living in major urban areas. In a previous census 5 years ago, the mean income of people 25-40 years old was ________ distributed with a mean of $______ per year, and a standard deviation of $_,___. Within wage gaps closing due to union and pay equity, it is believed that the ____ income has _________. Your team takes a random sample of __ incomes and finds =$43,500. X Report a p-value, assuming the standard deviation has not changed. Would you accept or reject the null hypothesis at the 5% significance level?

(1) Establish the Hypotheses:

HHa

0 42: (in thousands):µµ=> 42 (one - sided)

(2) Test Statistic:

ZX

nc =

( )µ

σ0

(3) Determine critical regions: Since α=0.05 and Ha is one sided on the high side, the critical value is Zα=Z0.05=_____.

Page 32: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

32

(4)Calculate the test statistic value and report a p-value:

ZX

nc =

−=

−= =

µσ

0 435 428

75

150 924

1623( . ) .

..

Zc =1.623P(Z >1.623) =1-P(Z<1.623)=1-0.9474=0.0526P-value= ______

(5) Decision rule:

Since the p-value is _______ than 0.05, we _____ reject the ___. P-value falls ______ the acceptance region. The ____ income has ___ changed.

0 1.623 1.645

Page 33: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

33

Section 10.3 Testing Hypotheses About µ When σ2 is _________

˜In general, there are two types of tests involving __.

,_ assumed to be known: use of the _-distribution is appropriate.

,_ assumed to be unknown: _-distribution appropriate test statistic (although Z-distribution is OK if n > 30.

˜If we need to solve problems involving when σ is _______, use X the t-distribution if the parent population is ______.

˜ So, if µ0 is the value of µ specified by the ____hypothesis, then when σ is _______ and the population is ______, the appropriate test statistic for tests on µ is:

tX

Standardize X and form t statisticn( ) __

− =−

⇐ −10µ

Page 34: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

34

Example: In ____, the average _____-weight was ___ lbs. We believe that newborn birth-weights are _____now than they were 25 years ago. We sample of __ such weights and find that

=____lbs. with s2=4 lbs2. XUsing a significance level of _%: ,[ ]α = =P Reject H H is true0 0 0 05.

test to see if birth-weights have _________.

(1)Determine the hypotheses:

Test: HHa

0: _ _ _:µµ

=>_ _ _

(2) Test Statistic: tXsn

n( )− =−

10µ

(3) Determine Critical Region:

t t

X

critical = =

=

+ =

0 05 24

25

7 5

. ,*

*

_ _ _ _ _

_ _ _ _ . _ _ _ _ _

(4) Calculate the test statistic and p-value:

tXsn

t valuec =−

=−

= ⇐ −µ0 8 2 7 5

25

. ._ _ _ _ _ _ _ _ _ _ _ _ _ _ _

Page 35: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

35

t

X

calculated

c

=

=

+ =

_ _ _ _

. . .17525

7 5 8 2

(5) Since t* is ____ than tc and the calculated value is _______ the acceptance region, we ______ the null to be true, and conclude that birth-weights have _________.

Or since > , we ______ the null and conclude that birth-Xc X *

weights have _________.

t24

X

Notice at the 1% level, t*=2.___, and at the 2.5% level, t*=2.___We would ______ the null that birth-weights have not changed intwenty-five years.

P-Value Using EViews

Page 36: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

36

Tests: ( )H versus HP value P t

a0 7 5 7 5175

: . : ..

µ µ=− = ≥

>

EViews Command Code:scalar t=1.75show 1-@ctdist(t,24)

************************************

*******************Use the @ctdist(t, n-1) command to get p-value;*Note the p-value depends on the alternative hypothesis*that is specified.*******************************************************

|_*t is the calculated value.

Page 37: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

37

|_distrib t/type=t df=24 T DISTRIBUTION DF= 24.000 VARIANCE= 1.0909 H= 1.0000 DATA PDF CDF 1-CDF T ROW 1 1.7500 0.87989E-01 0.95355 0.0464481

( )P value P t CDF− = ≥ = − =175 1 0 04644. .

If we had a two sided ___________ hypothesis:

HHa

0 7 57 5

: .: .µµ

=≠

( )

( )

P value P tP t P tP t CDF

− == ≥ + ≤ −= ≥ = −= =

> 175175 175

2 175 2 12 0 04644 0 09288

.( . ) ( . )( . ) *( )*( . ) .

p Cannot ________ the null at the _% level, but ______ at the __% significance level.

Page 38: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

38

Section 10.4Measuring β and the _______of A Test

p Up to this point, we have not calculated the value of β: P(Type __ error),

primarily because the alternative hypothesis has been _________ in format, and hence there is no one value specified for µ that makes H0 false.Example: H0: µ=100 Ha: µ…100

p A Type I error, incorrectly _________ H0, is α, since it will occur when µ=100.

p A Type II error, incorrectly _________ H0, can occur for any value of µ not equal to 100.

¸The probability of incorrectly accepting H0:µ=100, is much higher when the true value of µ=99.9 than when the true value is µ=125.

, And, the value of β is _________ for these 2 situations.

p So, one can calculate the values of β for other values of µ which may be possible under the alternative hypothesis.

p These different probabilities of β that occur when Ha is _________ can be summarized in either _____ format, by graphical illustration, or described by a functional relationship.

p Often a _____ curve illustrates the value of (1-β) — the _____ function — which depicts the ability or power of the test to correctly reject a _____ null hypothesis.

Page 39: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

39

Example: Illustration of β Error Calculation:

Suppose Xi~N(µ=__,σ2= __); n=144 ; α=0.05 Zα/2=____.

Step 1: Determine your __________:

H0: µ = __Ha: µ …__

Step 2: Test _________:

ZX

n=

− µσ

0

A Type I error occurs when we believe µ… __, when the null is actually____: ,H0: µ=___.

A Type II error occurs when we believe µ=__, when the null is actually_____: , Ha: µ…__.

Step 3: ________ ______:

µσ

α0 2 1966

120 98± = ± = ±Z

n*

/ _ _ ( . ) _ _ .

, Acceptance region is from __.__ to __.__

Page 40: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

40

How Do We Calculate β?

¸ Since β is a conditional probability that depends on the ____ value of µ, we will assume that µ= __._.

β=P( Accept H0:µ0=__*µ=__._)

We know that H0 will be accepted whenever X lies between __.__ and__.__. So,

β=P(49.02 # # 50.98 *µ=__._).X¸ Using the same Zα/2 and test statistic, we transform this problem into the standardized normal format, but allow µ = __._, σ= _, and n= 144:

[ ]

P X PX

n

P Z

P ZF F

( . _ _ ._ _ ). _ _ ._ _ _ ._ _ _ _ ._

.

...

. .( ._ _ ) ( _ ._ _ )

. . ._ _

49 0249 02

612

612

14805

0 4805

2 96 0 960

08315 0015 0

≤ ≤ =−

≤−

≤−

=−

≤ ≤

= − ≤ ≤= − −= − =

µσ

' P(Type II error) =0.__Interpretation: When µ=__._, we will _________ accept H0:µ0=__ as being true 83% of the time using our test procedure. + The power of this test is (1- β)= 1-0.__= 0.17.

Page 41: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

41

This test will ________ recognize this _____ null hypothesis 17% of thetime when µ= __._.

Remark: This is only one possible result using µ= ____.

What about when µ=__._?

[ ]

P X PX

n

P Z

P ZF F

( . . ). _ _ ._ . _ _ _

.

...

. .(_ _ _ _ ) ( . )

. ( . ) ._ _

49 02 50 9849 02

612

50 986

12

0 4805

14805

0 96 2 960 96

0 9985 1 08315 0

≤ ≤ =−

≤−

≤−

=−

≤ ≤

= − ≤ ≤= − −= − − =

µσ

Symmetry! ˜The power of this test is (1- β)= 1-0.__= 0.__. This test will correctly recognize this ____e null hypothesis __% of the time when µ= 49.5.What if µ= __._

Page 42: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

42

[ ]

P X PX

n

P Z

P ZF F

( . . ). _ _ ._ . _ _ ._

.

...

. .( . ) ( _ _ _ _ )

._ _ _ _ . ._ _ _ _

49 02 50 9849 02

612

50 986

12

05805

13805

116 2 762 76

0 0123 0

≤ ≤ =−

≤−

≤−

=−

≤ ≤

= − ≤ ≤= − −= − =

µσ

P(Type II error) = 0.____

˜When µ= __._, we will ___________ accept H0:µ= __ as being true __.__% of the time.,The power 1-β =0.____.¸We will correctly recognize this false null hypothesis 12.5% of the time when µ=____.

What about when µ=_____?

[ ]

P X PX

nP ZF F

( . . ). _ _ _ _ _ . _ _ _ _ _

. .( . ) ( . )

. ( . )

._ _ _ _ . ._ _ _ _

49 02 50 9849 02

612

50 986

12

0 46 346346 0 46

0 9997 1 0 67720 0 3228 0

≤ ≤ =−

≤−

≤−

= − ≤ ≤= − −= − −= − =

µσ

P(Type II error)= 0.6769

Page 43: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

43

What about when µ=49.1?

[ ]

P X PX

nP ZF F

( . . ). . . .

. .( . ) ( . )( . ). .

49 02 50 9849 02 4910

612

50 98 49106

12

016 376376 016

1 1 056361 0 4264 05635

≤ ≤ =−

≤−

≤−

= − ≤ ≤= − −= − −= − =

µσ

P(Type II error)= 0.5635

What about when µ=48.9?

[ ]

P X PX

nP ZF F

( . . ). . . .

. .( . ) ( . )

. ).

49 02 50 9849 02 48 9

612

50 98 48 96

12

0 24 416416 0 24

1 059480 4052

≤ ≤ =−

≤−

≤−

= − ≤ ≤= −= −=

µσ

P(Type II error)= 0.4052

Page 44: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

44

What about when µ=48?

[ ]

P X PX

nP ZF F

( . . ). .

. .( . ) ( . )

. .

49 02 50 9849 02 48

612

50 98 486

12

2 04 596596 2 04

1 0 9793 0 0207

≤ ≤ =−

≤−

≤−

= ≤ ≤= − −= − =

µσ

P(Type II error)= 0.0207 Table of Results:

µ β Power=(1-β)

0.9793

0.5948

0.4365

0.3231

0.1700

0.1259

0.0500

0.1259

0.1700

0.3231

0.4365

0.5948

0.9793

Page 45: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

45

¸The most powerful tests are ones with the _______ ascending power. ¸These are tests that quickly recognize a ____ null hypothesis for ____ differences between µ0, the hypothesized value of the parameter and µ, the true value.

The Trade-Off Between α and β

˜ When the sample size, n, is _____, α and β have an _______ relationship. I.e. Cannot decrease α without __________ β.

Illustration: repeat the example from above, but this time α= 10% instead of α = 5%.

Since α has increased, the _________ region will be _______ and β willbe _____ than before. Hence the _____ of the test will increase.

Z0.10 = ± 1.___ Ô Critical values Hypotheses: H0: µ0=__ Ha: µ0…__

Critical and Acceptance Regions:

X Zn

* */ _ _ ( . ) _ _ .= ± = ± = ±µ

σα0 2 1645

612

08225

Critical Values are __.__ and __.__, so the acceptance region is from49.18 to __.__.

So,

Page 46: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

46

β= P(Accept H0:µ0=__| H0 is _____) = P(Accept H0:µ0=__| µ=__._)

Since H0 is accepted whenever lies between 49.18 and 50.82:X

β = P(49.18 # # 50.83 | µ=__._)X

[ ]

P X PX

nP ZF F

( . . ). _ _ ._ . _ _ ._

. .( . ( . )

._ _ _ _ ( ._ _ _ _ )

._ _ _ _

4918 50824918

612

50826

12

2 64 0 640 64 2 64

0 1 00

≤ ≤ =−

≤−

≤−

= ≤ ≤= − −= − −=

µσ

β=0.____ (which is less than 0.____).

This means that when µ=__._, we will ___________ accept H0:µ0=__ asbeing true about 73% of the time. The power of this test is (1-β)=__%.

¸This test will correctly reject the false null 27% of the time when µ is actually __._.

Note: Trade-off is not one-to-one.

( Increasing α resulted in a _____ β and a higher power.

¸Recall, in classical testing we fix the P(Type I Error) and attempt to

Page 47: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

47

construct a test that minimizes P(Type II error) to maximize _____:

Definition: The “Power Curve” associated with a test plots power against the value of the _________ under the test.

A) H0:µ=µ0 versus Ha:µ>µ0

Power=(1- β)

1P reject H( ?)0 µ =

µ0 µ* H0 true H0 is false

P reject H P reject H H is true( ) ( )0 0 0 0µ µ α= = =

Page 48: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

48

B) H0:µ=µ0 versus Ha:µ<µ0

Power=(1- β) 1

P reject H( ?)0 µ =

µ* µ0 H0 is false H0 trueP(Reject H0|µ=µ*)=Power at µ*=µ.

C) H0:µ=µ0 versus Ha: µ…µ0 Power

1.0

(1-α)

0.05

}P(reject H0 when H0 is true)

P reject H P reject H H is true( ) ( )0 0 0 0µ µ α= = =

Page 49: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

49

Definition: An ________test is one such that Power $ Size everywhere: (1-β) $ _.Definition: A __________ test is one whose power approaches 1 as the sample size goes to ________.

Power as n⇒ ⇒ ∞1 .

Meaning, we will always reject the ____when the null is _____, assample size is large.

¸We are using unbiased and __________ test here.

Decreasing α and β by Increasing the Sample Size (n)

¸The former examples fixed the sample size in advance.

¸If _ changes, the size of α and β may be changed, since the size of n affects the __________ of the underlying probability distribution and the location of the acceptance and critical regions.

¸When n is _________, the test usually becomes more sensitive in distinguishing H0 and Ha since the acceptance region is _________.

, α and β ________.

Page 50: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

50

Example: H0: µ=__ Ha: µ…__

n=625σ=6α= 0.05Zα/2=±____

Critical and Acceptance Regions:

X Zn

* */ _ _ (_ _ _ _ ) _ _ .= ± = ± = ±µ

σα0 2

625

0 4704

Acceptance region is between 49.53 to 50.47.

What if µ=__._?

[ ]

P X PX

n

P Z

P ZF F

( . . ). _ _ ._ . _ _ ._

..

..

. .( ._ _ _ _ ) ( )

.

49 53 50 4749 53

625

50 476

25

0 970 24

0 030 24

4 04 0131 0 1 1

0 4483

≤ ≤ =−

≤−

≤−

=−

≤ ≤−

= − ≤ ≤ −= − − −=

µσ

β=0.4483(1-β)=0.5517 Ô_____ has increased.

Page 51: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

51

Section 10.7 One Sample Test On σ2

Test On A Population Variance σ2

¸Recall ~ , is a χ2 random variable with:( )n s− 1 2

2σχ ( )n−1

2

n = sample sizeσ2 = population variances2 = sample varianceParent population is _______ and (n-1)=ν= degrees of freedom.

”We can use this random variable to test hypotheses about the _______ population variance σ2 based on the sample estimator s2.

”The appropriate ___statistic for hypotheses about σ2 for (n-1) degrees of freedom is:

Equation 10.5χ ( )

( )__n

n− =

−1

22

02

1

where is the ___________value of σ2 when H0 is ___ and s2 is theσ 02

sample variance:

.sn

X Xii

n2 2

1

11

=−

−=∑( )

Page 52: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

52

Example: Suppose Xi ~ N(__, σ2)α =0.__n=__s2=__

Step 1: H0: σ2=__ Ha: σ2<__ Ô(One-sided alternative in the lower direction)

”We will _____ H0 for small values of the test statistic, and all ‘___’ is in the ____-hand tail.

Step 2: Test statistic: χσ( )

( )n

n s− =

−1

22

02

1

Step 3: ________ Values: : P( < )= 0.05.χ 2* χ242 χ 2*

Using the Chi-square table: = __._.χ 2*

Step 4: Calculate the test statistic: for this sample:χ c2

χ c2 600

2142=

= =

(_ _ )(_ _ )_ _ _ _

. .

Page 53: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

53

Step 5: Decision Rule:

(i) Reject H0 if is less than __._: < ______ the null.χ c2 χ c

2 χ242*

(ii) ______ H0 if is greater than __._: > do not reject χ c2 χ c

2 χ242*

the ____.

oSince > , we ______ H0. χ c2 χ24

2*

We cannot reject H0 that σ2=__ at the _% significance level.

P( |H0 true)χ242

χ242

12 13.8 21.42

Page 54: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

54

Example: Two Sided Test

Suppose Xi ~N(__, σ2)α =0.__n=__s2=__

Step 1: H0: σ2=__ Ha: σ2 …__

” So ______ H0 if the test statistic is “too _____” or “too _____.”

Step 2: Test statistic: χ σ( )

( )n

n s− =

−1

22

02

1

Step 3: Critical Values: Must split _% rejection region into 2 tails: __% in each ____.

( )( )

χ

χ

χ χ χ χ

χ χ χ χ

Upper

Lower

n Upper n Lower

Upper Lower

critical upper tail value

critical lower tail valueThen

P or H is true

P or H is true

2

2

12 2

12 2

0

242 2

242 2

0

0

0

=

=

=

=

− −

.

.:

._ _

._ _

> <

> <

From the Table VII: χχLower

Upper

2

2

=

=

_ _ _ __ _ _ _

Page 55: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

55

Step 4: Compute the test statistic:

χ c2 24

2142=

= =

( )(_ _ )_ _

_ _ __ _

. .

Step 5: Decision rule:

(i) _____ H0 if calculated value of test statistic, , for this sample is χ c2

less than ____ or greater than ____.

(ii) ______ H0 if calculated value of the test statistic, , is between χ c2

____ and ____.

ª Since =_____, which falls into the __________ region, we χ c2

cannot ______ the hypothesis that σ2 =__ against the 2-sided alternative at the __% level of significance.

12.4 39.4 χν2

χ lower2 χ c

2 χupper2

Page 56: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

56

Chapter 11: Hypothesis Testing: Multi-Sample TestsSection 11.1-3Introduction

pIn Chapter 10 we tested hypotheses about the population parameters from a _______ sample.pThis chapter will examine hypothesis testing when two or more _______ are drawn and we want to determine if these samples originate from the ____ population.

Example: Are their population _____ or variances the same? If they are, then the same population generated the various samples.

Typical Examples:

1) Testing the different treatment of two groups for ____ development: p _______ Group p_________ Group

2) Tests on income ________: Is the mean _____of university graduates in Victoria and _______ the same?

3) Is the mean salary of ______ employees the same as ____ employees in the __________ industry?

4) Is the male unemployment rate in _________ the same as the female unemployment rate?

pWe will explore two primary cases:

Page 57: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

57

I) Testing equality of two means when are both _____.( )σ σ12

22and

II) Testing ________ of two means when are both ( )σ σ12

22and

_______.

Case I: Testing Equality of Two _____

(Population variances are _____)( )σ σ12

22and

Let µ1 = mean of population 1.Let µ2 = mean of population 2.

If we restrict the null hypothesis to be _______in format, the commonform of H0 is:

H0:µ1 - µ2 =___ or

H0:µ1 = µ2

These two means are _____.

The alternative hypothesis could then be either:

Ha: µ1 - µ2 >0 Ôµ1 _______ µ2or

Ha: µ1 - µ2< 0 Ôµ2 _______ µ1or

Ha: µ1 … µ2 Ôµ1 not equal to µ2

pSuppose a simple random sample of size n1 is taken from population

Page 58: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

58

1 and . Also, suppose a simple random sample of size X i1 12~ N( 1µ σ, )

n2 is drawn from population 2 and .X i2 22~ N( 2µ σ, )

pDetermine the Test Statistic:

Recall, when we standardize the Z-variable is:

(11.1)Z =−Point _ _ _ _ _ _ _ _ Hypothesized _ _ _ _

Standard _ _ _ _ _ of point estimator

The “best” ________ point estimator of (µ1 - µ2) is (_______), where:

= ____ of sample 1( )X1

= ____ of sample 2( )X2

The standard error of (______), denoted is:( )σ X X1 2−

V X Xn nX X( )1 2

12

1

22

21 2

− = = +−σσ σ

Note: Assuming independence:V V V

V V

n n

V Xn

(_ _ ) (_ ) ( _ )(_ ) (_ _ ) (_ )

( ) .

1 2 1 2

12

2

12

1

22

22

− = + −

= +

= +

=

σ σ

σsince

pIf and are normal, is ______. (It is a linear X1 X2 ( )X X1 2−

Page 59: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

59

function of normal random variables.)

Hence,(________)~ N n n( ), .µ µ

σ σ1 2

12

1

22

2− +

Thus, we can derive the test _______ for finding the 2-sample computedZ-value for H0:µ1 - µ2 =__:

~ N(0, 1).Z =(_ _ ) ( )1 2 1 2

12

1

22

2

− − −

+

µ µσ σn n

pThe ________ values are determined the same way as in the former sections:

If α=0.__, then α/2=0.___Z*=±____ will be the critical values.

pCompute the test statistic:

Z

X X

n n

C =− − −

+

( ) (_ _ ).1 2 1 2

12

1

22

2

σ σ

pDecision Rule :Reject H0 if: the calculated values of Z are _______ than the ‘+’

Page 60: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

60

critical value or ____ than the ‘-’ _______ value.Reject H0: Z* < __ or -Z* > __.

Cannot reject otherwise.

Example: Determine if the length of _____to find employment in their field of study of university _________ in Ottawa (µ1) and ________ (µ2) is the same:

pTest Hypotheses: H0: µ1 - µ2 = __ }| __ = __

Ha: µ1 - µ2 … __ }| __ … __

α=0.__=_% | Z*=±_____

Let: n X weeks weeksn X weeks weeks

1 1 12 2

2 2 22 2

122 3

= = =

= = =

_ _; _ _ _ ; . ._ _ ; _ _ _ ; . .

σσ

pCompute Z Value:

ZC =− −

+

=−+

=−

=−

= −

(_ ._ _ ._ ) ( ). .

.. .

..

..

_ ._ _

01250

2 375

190 024 0 03067

190 0546

190 2338

Since -Z* >Zc, we ______ H0.

pInterpretation:

Page 61: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

61

We believe the _______ amount of _____these graduates search for jobsin their field of study in each city is statistically _________.

pUsing the P-value:

p-value =2P(Z#-____) = 2(1-P(Z # ____)) =(2)(1-_) =2(0.____) =_____

Since ____ is ____ than 0.__, we ______ the null hypothesis.

Page 62: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

62

Case II:Difference Between Two Means

(Variances Are _______ But Assumed _____)( )σ σ12

22and

p The two-sample Z test is appropriate only when both

are _____.( )σ σ12

22and

p When the _________ are unknown, the _-distribution can be used to test the two means if both parent populations are ______.

p If the parent populations are not normal, but n1 and n2 are _____ (ni $15), the _-distribution can be used.

p The other difference from the last section is the assumption that

are the ____ population variance; Both ( )σ σ12

22and

must represent two estimates of the ____ population s s12

22and

variance.

Example: Suppose X ~1i N n sample size( , ); .µ σ1 12

1 =

X N n1 112

1~ ( , )µ σ

X ~2i N n sample size( , ); .µ σ2 22

2 =

X N n2 222

2

~ ( , )µ σ

Page 63: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

63

We want to test:H0: µ1 - µ2 = __ }| __ = __

Ha: µ1 - µ2 … __ }| __ … __

Test Statistic:

Use the t-distribution since X1 and X2 are normal and variances are_______ but _____:

Form:

( )

( )

sn

X X

sn

X X

ii

n

ii

n

12

11 1

1

2

22

22 2

1

2

11

11

1

2

=−

=−

=

=

r If are not _____, which estimate of s s12

22and

the population ________ of σ2 do we use?

Use a ________ average of the two estimators.

,This will produce a more ________ and reliable estimate of the population variance than using one or the other.

pUse ‘________’ degrees of freedom as the ______ relative to the total number of degrees of freedom.

Page 64: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

64

Let: s2= weighted average.

s s s

sn s n s

2 1

1 212 2

1 222

2 1 12

2 22

1 2

12

12

1 12

=−

+ −∗ +

−+ −

=− + −

+ −

(_ )(_ _ )

(_ )(_ _ )

*

( ) ( )(_ _ )

p Now: ~ NormalX X1 2−

E X XandV X X V V by

n n

n n

( )

( ) (_ _ ) (_ _ ) _ _ _ _ _ _ _ _ _ _ _ _

1 2 1 2

1 2 1 2

12

1

22

2

2

1 2

1 1

− = −

− = + ⇐

= +

= +

µ µ

σ σ

σ

since σ2 denotes the two equal variances: .σ σ12

22 2= =_

Hence, the test statistic is: .( ) ( )

t

n n

=− − −

+

_ _

_

1 2 1 2

2

1 2

1 1

µ µ

p Using s2 as the estimator of σ2, we get the test statistic:

Page 65: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

65

( ) ( )( )t

n n

=− − −

+

+ −

_ _

_

1 2 1 2

2

1 2

21 1

µ µ~ t

n n1 2

which is the two sample t-test statistic.

Example: Test the difference in average ________ paid to _____ in _______ and Vancouver, assuming salaries are normally distributed. Test at the α=0.__ level of significance.

_ _ _ _ _ _ _nXs

1

1

12

10000000

==

=

$_ _ ,$_ _ _ ,

VancouvernXs

2

2

22

9500

000

==

=

$39,$_ _ _ ,

Test:

H0: µ1 - µ2 = __ }| __ = __ Average ________ are the same.

Ha: µ1 - µ2 …__ }| __ … __ Average ______ are not the same.

,The degrees of freedom = __ + 9 - 2= __.

Page 66: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

66

,t__ ~ under the null hypothesis. ,The ________ value from a two-sided alternative is: t*=±____

Computed Value:

( ) ( )

( ) ( )

tX X

n n

s

C =− − −

+

− −

+

1 2 1 2

2

1 2

2

1 1

000 39 500 0

110

19

µ µ

_

_ _ , ,=

Must determine s2 :

sn s n s2 1 1

22 2

2

1 2

1 12

1 000 9 1 0009 2

000 8 00017

1 350 000 1 240 000 000

=− + −

+ −

=− + −

+ −

=+

=+

= =

( ) ( )(_ _ )

(_ _ )(_ _ _ , ) ( )(_ _ _ , )(_ _ )

(_ )(_ _ _ , ) ( )(_ _ _ , )( )

, , , ,_ _

_ ,_ _ _ ,_ _

_ _ _ ,_ _ _ ._ _

So, calculating tc:

Page 67: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

67

( ) ( )

( ) ( )

( )

tX X

sn n

s

C =− − −

+

− −

+

=−

+

= =

1 2 1 2

2

1 2

2

1 1

000 39 500 0

110

19

000 39 500

352 941

1019

1500179 29

µ µ

=_ _ , ,

_ _ , ,

_ _ _ , .

._ ._ _ _

t*= ± _.__ and tc= _.___

,Reject H0 if tc is _______ than _.__ or less than -_.___

p We clearly ______ the null. p The average salaries for women are statistically _______in each city.p

Page 68: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

68

Section 11.6 Two Sample Tests for Population _________

˜Suppose we have random variables from two ___________:

X NX N

i

i

1 1 12

2 2 22

~~

( , )( , )µ σµ σ

%If the two populations have _____ variances , then ( )σ σ12

22=

the populations are _____________ ( “homo” = same, “scedastic” = variability) as opposed to _______________ (different variability).

Why would we be interested in whether populations are____scedastic or ______scedastic?

%To answer questions like:

(1) Is the variability in ____ worked different between the United Statesand ______?(2) Is the variability of the ____________ rate in Canada over the last year the _____as in the United States?(3) Is the variability of ___ prices in Victoria and ______ the same?

%To test if the ___________ is the same across populations, the null hypothesis is usually in the form:

Ô the variability is the ____ across these two H0 12

22:_ _=

populations.

Page 69: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

69

%The alternative hypothesis can be any of the following:

Ô the variability is ___ the ____ across these two Ha:σ σ12

22≠

populations.

Ô the variability in population 2 is _______ than the Ha:σ σ12

22<

variability in population 1.

Ô the __________in population 1 is greater than the Ha:σ σ12

22>

variability in population 2.

%The form of the alternative hypothesis will affect the _______ values and p-values, but ___ your approach to testing hypotheses.

Illustration: ___-sided Alternative Hypothesis

Suppose or H0 12

22:σ σ= H0 1

222: / _ _σ σ =

or Ha:σ σ12

22≠ Ha : / _ _σ σ1

222 ≠

%Since the population variance is estimated using the sample variance, we can base the test of H0 on the _____ of ______ variances

:( )S and S12

22

Page 70: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

70

Recall

Sn

X X

Sn

X X

ii

n

ii

n

12

11 1

2

1

22

22 2

2

1

11

11

1

2

=−

=−

=

=

( )

( )

The sampling distribution for testing hypotheses about two ________ iscalled the _-distribution.

Hence the test statistic is: F = __ .1

2

22

The _-distribution tests whether , by taking the _____ ofσ σ12

22=

the two sample variances, .( )S and S12

22

%If H0 is ____, we would expect ‘_’ to be close to _.

%Thus, the more _ differs from _, the more likely we will _____ H0 for the alternative Ha.

˜To test H0 using , we need to know the distribution of the _

_12

22

test statistic under the null.

It can be shown that if H0 is true:

SS

where n rees of freedom for samplen rees of freedom for sample

12

22

1 1

2 2

1 2

1 11 2

~_ν ν

νν

,

: deg .deg .

= − == − =

Page 71: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

71

Things To Note:

1) An _ random variable has ___ different degrees of freedom associated with it.

___

( )

( )= =

−−

−−

=

=

∑12

22

11 1

2

1

22 2

2

1

11

11

1

2

nX X

nX X

ii

n

ii

n

_-values contains a random variable with (n1-1)=ν1 d.f. in the numeratorand (n2-1)= ν2 in the denominator.

2) Since the test statistic is a _____ of two numbers which are $ 0, an _ random variable is always $__.

3) _- distribution is __________ skewed.

Looks like a ___-square distribution because the _-distribution is theratio of 2 independent _2 distributions.

4) Use _-tables to find critical values associated with a given ν1 and ν2.

Page 72: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

72

Example: Let ν1 =_ and ν2=___. We want the 5% critical _ value: Want F* such that P(F5,10$F*)=0.05.(Notice that the tables are only for _% and _% upper tail regions.)

f (F5,10)

5%

F*From Table VIII (a) in H&M, page 896, we have

F*=____ þ P( F5,10$____)=0.05.

Example: Let ν1 = _ and ν2= _. We want the 1% critical F value: Want F* such that P(F3,8$F*)=0.01. From Table VIII (b) in H&M, page 898, we have

F*= þ P( F3,8$____)=0.01.

So how do we determine the _____ tail critical values,since this is not a symmetrical distribution?

Page 73: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

73

Low Side F-Test

A lower-tail critical value can be found by the following:

FFLower critical value

critical value, ,

_____ , ,ν ν

ν ν1 2

2 1

1=

“A _____-tail critical value of F can always be found byreversing the _______ of freedom of the numerator and thedenominator, determining the corresponding value in the_____ tail of the F-distribution, and then taking thereciprocal of this number.”

Suppose we want F* such that P(F3,12 #F*)= 0.05:

f (F3,12)

F3,12 0.___ 8.74Since where F*Upper, 12,3 is the F-value such thatF =

1F

(Lower 3,12)*

(Upper 12, 3)*

P(F 12,3 $ F* upper, 12,3)=0.05. Value þ _.__

Page 74: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

74

Hence:

F =1

F=

1_ ._ _

= 0._ _ _

P(F

(Lower 3,12)*

(Upper 12, 3)*

3,12 ≤ =0 0 05._ _ _ ) .

0.___ 8.74

4) It does ___ matter if your ____ statistic is :

SS

SS

SS F

22

12

12

22

22

12

2 1

instead of

Under the null is distributed

.

.,ν ν

Simply put the d.f. in the same order as the _____ of the variances.

Page 75: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

75

Returning to the two-sided alternative test: versus .H0 1

222:σ σ= Ha:σ σ1

222≠

Let α= 0.02. P(Reject H0|H0 is true) =0.02.

For a 2-sided alternative put 0.01 (α/2=0.01) in each tail and find thecritical values of F with ν1 and ν2 degrees of freedom.

f (Fv1,v2)

1% 1%

Fv1,v2 F*low F*up

¸Finally calculate the value of the test statistic: , and see if it S

S12

22

falls in the critical region or the acceptance region.

Decision Rule

(I) ______ H0 if: F or FC C< F > Flower*

upper* .

(II) _____t H0 if: F Flower*

upper*≤ ≤FC .

Page 76: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

76

Example: Suppose you wish to test the variability in ______ times between Vancouver and _______ and between _______ and Vancouver. A sample of __ flights is taken for the route originating in Vancouver (n1 =__) and a sample of __ flights is taken for the route originating in _______ is taken (n2=__). We find that and S1

2 225= minutes

Assume that the flight times are S22 216= minutes .

normally distributed. Let α=0.10.

Test versus .H0 12

22:σ σ= Ha:σ σ1

222≠

Test Statistic is:

_ ,

: _ _ ._ _ .

,=

= − == − =

SS under the null

where n degrees of freedom for sample 1n degrees of freedom for sample

12

22

1 1

2 2

1 2

11 2

~ Fν ν

νν

P(Reject H0 | H0 is true )=0.10.

So, [ ] [ ]P F F P F Flower upper≤ = ≥ =, ,*

, ,* .15 20 15 20 0 05

From the cumulative F-distribution table (α=0.05) :

F FFupper lower

upper, ,

*, ,_

*

, ,*_ _ _ _

_ _ _ _._ _ _ _ .15 20 15

20 15

1 10= = = =and

Calculating FC:

Page 77: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

77

FSSC = = =1

2

22

2516

15625. .

Since Fc is 1.5625, which lies _______ 0.___ and _.__, we cannot _____the null and conclude the two samples are from a population with the____ variability.

We could also use:

F SS under the null

where n degrees of freedom for sample 1n degrees of freedom for sample

=

= − == − =

22

12

1 2

2 1

2 1

1 201 15 2

~ Fν ν

νν

, ,

: ..

So, [ ] [ ]P F F P F Flower upper≤ = ≥ =, ,*

, ,* .20 15 20 15 0 05

From the cumulative F-distribution table (α=0.05) :

Page 78: Topic 3: Hypothesis Testingweb.uvic.ca/~bettyj/246/topic3web.pdf · Topic 3: Hypothesis Testing Section 10.1 Introduction # Within Topics 1 and 2, we employed point and interval estimation

78

F and FFupper lower

upper, ,

*, ,

*

, ,*_ _ _ _

_ _ _ _._ _ _ .20 15 20 15

15 20

1 10= = = =

Calculating Fc:

FSSC = = =2

2

12

1625

0 64. .

Since Fc is 0.64, which lies _______0.___ and 2.__, we cannot ____ thenull and conclude the two samples are from a population with the ____variability.