3301 topic 9 hypothesis testing

HYPOTHESIS TESTING

Steps in Hypothesis Testing - 1• State the hypothesis to be tested in terms of Ho and

• State the significance level of the test ()• Take sample of known size (n) and compute

parameter estimate (e.g. )• Determine the test statistic - a statistic based on

the sample and whose distribution is known– e.g. for hypothesis about a mean () the statistic may

be z or t

Steps in Hypothesis Testing - 2

• Determine the criterion for rejecting the null hypothesis (region of rejection, R.R., or decision rule)

• Make a decision based on the criterion

• Make a conclusion

Test on a Single , Variance Known: Hypothesis

• See example 10.4

• We would like to test the research hypothesis that the mean human life span is not 70 yr. Ho and Ha are:

Ho: = 70

Ha: 70• Note: Ho may be written always with an equality (=) sign

Test on a Single , Variance Know: Test Statistics

• We sample death records from 100 persons and found yr

• Assume true s.d. is = 8.9 yr

• The appropriate test statistic (T.S) is Z, where

and o is value of the hypothesized mean in Ho

Test on a Single , Variance Known:Region of Rejection (R.R.) - 1

• If Ho is true we would expect the sample mean, , to be equal or close to o (o = 70)

– that is, we would expect Z to be equal to or close to 0 (zero)

• The probability of obtaining a large or small Z-value is therefore low if Ho is true

– that is, we would reject Ho if the Z-value is large (small) and conclude that > 70 or < 70

Test on a Single , Variance Known: Region of Rejection (R.R.) - 2

• How large the Z-value should be for rejection is arbitrarily set by a reasonably low error of falsely rejecting Ho

• This error (Type I error) should have a small probability () of occurring

is usually set at .01 or .05 and is called

the significance level of the test

Test on a Single , Variance Known: Types I and II errors

= Pr (Type I error) = Pr (rejecting Ho|Ho is true)

• β = Pr (Type II error)

• β = Pr (not rejecting Ho|Ho is false)

• Power of the test =1- β

Test on a Single , Variance Known:Region of Rejection (R.R.) - 4

• Ha: 70 is called a 2-sided alternative

• This means that we can reject Ho if Z falls in either the left or right side R.R.

• So each R.R. has area = , not 2

Test on a Single , Variance Known: Region of Rejection (R.R.) - 5

• To decide if Ho should be rejected we must determine if the calculated Z (from ) is greater than or less than -

• There are two ways of writing this:

• R.R.: Reject Ho if | Z | > Zα/2

• R.R.: Reject Ho if or

Test on a Single , Variance Known:All Steps

• Ho (Null Hypothesis)

• Ha (Alternative Hypothesis)

-value

• T.S. (Test Statistic)

• Region of Rejection (R.R.)

• Decision

• Conclusion– Note:

• Ha may also be written as H1 and

• R.R. may also be called Decision Rule (D.R.)

Test on a Single , Variance Known: Example - 2

• Ho: = 70

• Ha: 70

• Select = .05 arbitrarily (So

• T.S.:

)96.1025.2

02.2100/9.8

708.71

Test on a Single , Variance Known: Example - 2

• Decision: We reject Ho, since Z(=2.02) > Z.025(=1.96)

• Conclusion: No evidence that the true mean life span is 70 yr

• Note: = .05 means that there is a 5% chance of falsely rejecting Ho

Test on a Single , Variance Known: Right-Sided Test

• Suppose we are sure that can’ t be < 70 and therefore test only > 0; that is

Ho: = 70

Ha: > 70

• This is called a 1-sided right-tail test• For this 1-sided test R.R. is different

R.R.: Reject Ho if Z > Z

• All the error probability () is now in the right tail of the distribution

Test on a Single : Variance Known - Right-Sided Test: Example

• Ho: = 70 or ≤ 70

• Ha: > 70

• T.S.:

• R.R.: reject Ho if Z > Z

• Decision: We reject Ho since Z(=2.02) > Z.05(=1.645)

• Conclusion: No evidence that the true mean is less than or equal to 70 yr.

– Note: For a 1-sided test Z = Z.05 = 1.645

Test on a Single : Variance Known - Left-Sided Test

• Suppose we wanted to test Ha: < 70; then

• Ho: = 70 or ≥70

• Ha: < 70

• T.S.: 02.2/

Test on a Single : Variance Known - Left-Sided Test

• R.R.: Reject Ho if Z < Z

• Decision: We do not reject Ho since Z(=2.02) not < Z.05(= -1.645)

• Conclusion: No evidence that the true mean is less than 70– Note: (- Z = -1.645)

Hypothesis Tests: General Comments

• The R.R. depends on Ha and -value

• The conclusion is related to Ha

• We say ‘Do not reject H0 ’ instead of ‘Accept H0’ because our Type II error may be large. If we computed the Type II error probability () and found it to be small then we can say ‘Accept H0’

Test on a Single : 2 Unknown

• If is unknown, we use its estimate, s, from the sample

• The test statistic becomes

Because of s this statistic is not

Z-distributed. It has a student-t distribution with (n-1) degrees of freedom

Test on a Single : 2 Unknown - Example

• We wish to test a research (alternative) hypothesis that the mean energy consumption per home is less than 46 kilowatt-hours ( < 46)

• The energy consumption, X, is ND

• A sample of n = 12 homes gave = 42 and s = 11.19 kilowatt-hours

Test on a Single : Variance Unknown - Steps

• Ho: = 46 or ≥ 40

• Ha: < 46

• T.S.:

• R.R.: Reject Ho if t < t,n-1

• Decision: Since t (= -1.16) is not < t,n-1 (-1.796) we do not reject Ho

16.112/9.11

Test on a Single : Variance Unknown - Steps

• Conclusion: No evidence that the true average energy expenditure is less than 46 kw-hr

• Note: Because we are performing a

1-sided test (see Ha) all the error probability is in one region (the left) of the distribution; hence we use t , not t/2

Test on a Single : Variance Unknown - Size of n

• For the above t-test ( unknown) we may use the Z-test as an approximation when n 30. This is a consequence of the Central Limit Theorem

• The T.S. for the Z-approximation is

where the s-value from the large sample size (n 30) is written as

C.I. Approach to Hypothesis Testing

• We can test Ho: = o vs Ho: o using a C.I.

• We replace the usual test static with the (1)% C.I.

• Our Decision Rule (R.R.) is to reject Ho if the C.I. does not contains zero

C.I. Approach to Hypothesis Testing: Example

• For the earlier 2-sided example

• Ho: = 70

• Ha: 70 = .05

• T.S.: 100

9.896.18.71

)54.72,06.70(

C.I. Approach to Hypothesis Testing: Example

• D.R.: Reject Ho if the C.I. does not contain the hypothesized value

• Dec.: Since 70.06 72.54 does not contain 70.0, we reject Ho

• Conclusion: No evidence that the true mean is equal to 70

Test on the Difference in Two Means (1 2)

• Consider two independent normal populations. A test on the two means (e.g. 1 2) uses either of 3 test statistics depending on what we know about the true (population) variance, and . The 3 conditions are:– 1. and are known

– 2. and unknown, but

– 3. and unknown, but

222222

Test on the Difference in Two Means (1 2)

Test on (1 2) When And Are Known - 1

• Consider a 2-sided test

• Ho: 1 2 = do (do is a specified value)

• Ha: 1 2 do

• T.S.:

(Note: do = (1 2)0

Test on (1 2) When And Are Known - 2

• Recall that there can be 3 different Ha s and each has a different R.R.

• 1. Ha: 1 2 do

R.R.: Reject Ho if or

• 2. Ha: 1 2 > do

R.R.: Reject Ho if

• 3. Ha: 1 2 < do

R.R.: Reject Ho if

D.R. Rej ,if

0: 21210 orH

0: 2121 uuoruH a

07681:..

0H 025.0ZZ

Dec. Rej ,since 5.85 >1.96 Conclusion. No evidence that the two population means are equal.

Difference in Means Hypothesis Test: σ’s known – Problem 10.30 page 358

Test on (1 2) When And Are Known - Example

• Work example 10.6, page 314

Test on (1 2) When and are Unknown

and - 1

• For unknown we must use the sample estimates and

• When , the common variance is

• We must use the sample estimate of 2 (i.e. )

• This is computed as a pooled variance from the two samples

and - 2

• Since the are unknown the test statistic is the t-statistic not the Z-statistic

)1()1(

nSnSS p

and - 3• Consider a 2-sided test

• Ho: 1 2 = do

• Ha: 1 2 do

• T.S.:

2, 212 nntt 2, 212 nntt

and - 4• The T.S. (t) was formed by replacing

and of the known variance formula by

• The d.f. for t is the sum of the d.f.s for the 2 samples; i.e. (n11)+(n21) = n1+n22

• One sided tests are done in the usual way by modifying Ha and R.R.

P34610.6

and - 1• Again use and but do not pool

them since they are not from a common population variance

• The T.S. is formed by replacing and

in the known variance test by and

• T.S.:

221s 2

and - 2

• The degree of freedom for this t-distribution

is not (n1+n22). It is (pronounced ‘new’), where

and - 3

• Ho: 1 2 = do

• Ha: 1 2 do

• T.S.:

,2tt ,2

Paired t-test: Concepts - 1• Data are collected in pairs

– e.g. strengths of right (x1i) and left (x2i) hands in people

• The two sets of data are not independent

• The objective of the test is to compare the effects of the two conditions that comprise the pairs– e.g. to determine if the right hand is stronger than

the left hand– e.g. to determine if people’s average weight before

dieting for a week is greater than after dieting

Paired t-test: Concepts - 2

• Instead of testing (1 2) we compute the difference (x1i x2i) for each pair and then test the mean of the difference, d

• The differences are therefore tested as if they are one sample of observations

• This test is therefore like the one sample (t-test) on a single with 2 unknown

Paired t-test: Steps

• Ho: d = do

• Ha: d do

• T.S.:

1,2 ntt 1,2 ntt

Paired t-test: Example - Data

• See example 10.7, page 316

• We want to test whether androgen level in deer blood after 30 minutes of capture is different from the level at time of capture

• Data on 15 deers at time of capture (x2) and 30 minutes after capture (x1) are taken

• (x2i x1i) = di are computed for i = 1 to 15

• and sd = 18.474848.9d

Paired t-test: Example - Test

• Ho: d = 0

• Ha: d 0

= .05• T.S.:

• R.R.: Reject Ho if

• t (=2.06) > t.025,14 or t < t.025,14

06.215/474.18

0848.90

Paired t-test: Example- Test

• Decision: Since • 2.06 not > 2.145 (t.025,14) and

- 2.06 not < -2.145

we do not reject Ho

• Conclusion: The androgen levels at capture and after 30 minutes are not significantly different

PAIRED T-Test(example 2)

• Q: Weight loss programn=10 participants

Determine if the program was effective

Find di = x1-x2

X1 Starting Weight

210 240 150 190 200 150 180 185

X2 Ending Weight

200 180 155 200 150 140 170 168

PAIRED T-TEST (eg. 2)X1 Starting

Weight210 240 150 190 200 150 180 185

X2 Ending Weight

200 180 155 200 150 140 170 168

di Weight lost

10 60 -5 -10 50 10 10 17

Find , find Sd, like solving for , Sx

H0: µd = 0Ha: µd ≠ 0

T.S. t=

Continue as for any other t-test

Test on : Example

• We would like to test if the variances from two normal populations are equal

• Samples of size n1 = 21 and n2 = 25 gave

84.3s21 54.4s2

Test on : Example

• Ho: or

• Ha: or

• T.S.:

85.054.4/84.3s

Test on : Example

• D.R.: Reject Ho if F(=0.85) > F.025,20,24(=2.33)

or F < F.975,20,24(=.415)

• Decision: Since 0.85 not > 2.33

and 0.85 not < .415

we do not reject Ho

• Conclusion: The variance of the two populations are not significantly different

Test on : Comments

• is the F value with area to its right

• Table for F-distribution are usually given for small -values; e.g. 0.10, 0.05, 0.01, etc.

• So if we need, say, F.95,10,12 (i.e. )

then we use

21 F21

12,10,1 2F

10,12,05.12,10,95.

Large Sample Test on a Proportion (p)

• Recall:

• For large samples

• An approximate test statistic is therefore

• Since p is unknown it cannot be used in the Z-statistic. Instead we substitute the hypothesized value (po)

xp pp 2

ˆˆ ,,ˆ

),(~ˆnpqpNDp

Large Sample Test on p - Steps

• Ho: p = po

• Ha: p po

• T.S.:

• R.R.: Reject Ho if or 2ZZ

Large Sample Test on (p1 p2)

• For 2 sample proportions and

• When samples are large

• A suitable T.S. is therefore

1p̂ 2p̂

21ˆˆ 21pppp

112ˆˆ 21 n

NDpp ~)ˆˆ( 11

)()ˆˆ( 2121

oppppZ

Large Sample Test on (p1 p2)

• Since p1, p2 , q1 and q2 are unknown we substitute the sample values but we pool the samples to get a pooled and p̂ q̂

21ˆnn

Large Sample Test on (p1 p2): Steps

• Ho: (p1p2) = (p1p2)o

• Ha: (p1p2) (p1p2)o

where (p1p2) is a hypothesised value

• T.S.:

• R.R.: Reject Ho if or )(ˆˆ

Test on a Single Variance ( 2)

• The objective is to test if 2 is of a certain value ( )

• We estimate 2 from a sample (S2)

• Since the distribution of S2 is unknown, we convert it to 2

where the 2 has (n-1) degrees of freedom

222 /)1( oSn

Test on a Single Variance ( 2): Steps

• Ho:

• Ha:

where is a specified value• T.S.:

22o 22o

222 /)1( oSn 2

2 n 21,1

Test on 2: Comments

2 is always positive, so the left-tail critical region is

• for = .05

• is the 2 value with area .975 to its right

• This 2 test is for a single population variance. For comparing 2 population variances, we use the F-test

Test on Two Variances ( )

• The objective is to compare the variances of two populations

• If the two populations are normal then

• If we assume (i.e. ) as in Ho, then and can be used as a test statistic for this test

Test on Two Proportions ( )- Steps

• Ho:

• Ha:

• T.S.:

21 )/(/

1,1, 212 nnFF

1,1,1 212 nnFF

Test of Independence0 lights 1 light 2 lights 3 lights 4 lights Sums

Morning 4 6 20 65 64 159

Afternoon 8 12 20 75 72 187

Night 10 8 10 16 10 54

Sums 22 26 50 156 146 400The table indicates the number of times a certain amount of traffic lights were stopped at (0,1,2,3, or 4) by a vehicle on a workday within a small city. The data are for 400 vehicles. Let L = the number of lights stopped at by a vehicle. The 2 factors (variables) are ‘number of lights’ and ‘time’. The question: Does the number of lights stopped at depend on the time.

Calculations: The above are the observed frequencies (Oi). Find the expected frequencies (Ei) for each cell from Ei=ni*nj/n; e.g. For the first cell Ei = 22*159/400.

Now compare Ei and Oi using the Chi-square statistic as done in goodness of fit tests.

95.145.1 x

45.295.1 x

95.245.2 x

45.395.2 x

95.345.3 x

45.495.3 x

95.445.4 x

Given the following frequency distribution for the battery life data determine if battery life (X) follows a normal distribution.

GOODNESS OF FIT HYPOTHESIS TEST

Calculation of pi

Calculate pi as follows:

P(1.45<x<1.95)

=P( <Z< ) = 0.0125

If at least 80% of the are not 5 then combine classes.

2 0.0125 0.5

1 0.0525 2.1

4 - 5.9

15 - 10.3

10 - 10.7

5 - 7.0

3 - 3.5

95.145.1 x

45.295.1 x

95.245.2 x

45.395.2 x

95.345.3 x

45.495.3 x

95.445.4 x

ii Of ipiE iii EEO /2

5.8/5.87 2

3.10/3.1015 2

7.10/7.1010 2

5.10/5.108 2

05.322

Goodness of Fit Hypothesis Test

Reject if (k-p-1=4-2-1=1)

Decision: Since 3.05 < 3.841 do not reject

Conclusion: No evidence that data comes from a normal distribution with mean = 3.5 and variance =

ixH :0 22 7.0;5.3~ ND

NDnotxH ia ~:

05.3/:.. 22 iii EEOST :..RD

2 pkoH

3301 topic 9 hypothesis testing

Documents

topic 17 simple hypotheses - university of...

model: 3301 -...

ap statistics topic 7 chi squared tests hypothesis tests for...

r and stats - pdcb topic hypothesis testing: parametric...

topic name-concept of hypothesis

mkt 3301 ch 1 pp

syllabus welcome to physics 3301 phys 3301, modern physics

solution 59670116 3301

session 7 introduction to research and evaluation topic 1:...

differences among the topic, problem, purpose, and questions...

topic 8 hypothesis testing mathematics & statistics...

hypothesis testing hypothesis testing topic 11. hypothesis...

topic 7 - hypothesis tests based on a single sample sampling...

personal finance 3301-d08

sfc it 3301 - lecture 4

waterlog h-3301/3311

1 topic 1.4.1 mathematical proofs. 2 topic 1.4.1...

read 3301 word study

topic hypothesis in process ability theory di biase hinger...

mgt 3301 - ge presentation