review z test ci 1

37
1 Week 7: Hypothesis Testing: z- test

Upload: shoffma5

Post on 06-May-2015

8.714 views

Category:

Business


3 download

TRANSCRIPT

Page 1: Review Z Test Ci 1

1

Week 7:

Hypothesis Testing: z-test

Page 2: Review Z Test Ci 1

2

Review

We have seen descriptive statistics, statistical procedures to summarize data and describe distributions and individual scores.

We also need inferential statistics, techniques that can help us to test hypotheses, to generalize results to a given population, and to estimate the parameters of a population.

Page 3: Review Z Test Ci 1

3

Statistics and Parameters

Statistics are the facts on which we draw conclusions about parameters.

Sample statistics are only estimates of the population parameters.

We need to know how the sample we have taken compares with other samples that might be taken from the population.

Page 4: Review Z Test Ci 1

4

Sampling distribution of the mean

Sampling distribution of the mean is the distribution of sample means for all samples of a given size randomly selected from the population.

If we had generated all possible samples, the mean of the possible values of sample mean would agree exactly with μ.

Page 5: Review Z Test Ci 1

5

Central Limit Theorem for sample mean

When n (sample size) is large, the sampling distribution of sample means will be approximately normal (with the approximation becoming more precise as n increases).

When the population distribution is normal, the sampling distribution of sample means is exactly normal for any sample size n.

Page 6: Review Z Test Ci 1

6

Example:

What is the probability of getting a sample mean of 104 or higher on an IQ test (μ = 100, σ = 15) with a random sample of n = 36?

Calculate the standard error for samples of n = 36:

(σx) = σ = 15 = 15 = 2.5

√n √36 6

Page 7: Review Z Test Ci 1

7

Draw the distribution of the sample means:

92.5 95 97.5 100 102.5 105 107.5

The probability of getting a sample mean of 107.5 is .0013.

The probability of getting a sample mean of 104 or higher is ____.

Page 8: Review Z Test Ci 1

8

We would need to compute a z score to figure out what the probability of getting a sample mean of 104 or higher: Compute a z score:

Z = 104 - 100 = 4 = 1.60 2.5 2.5

0.0548 probability of getting a sample mean of 104.

Page 9: Review Z Test Ci 1

9

Steps of Hypothesis Testing

1. State the hypotheses.2. Set the criterion for rejecting Ho (null hypothesis).3. Compute the test statistic.4. Compare test statistic to criterion set.

* Create a confidence interval and interpret.5. Decide about null Ho.

Page 10: Review Z Test Ci 1

10

1. Stating the Hypotheses

Null hypothesis (Ho): The hypothesis to be tested. Usual states “no difference between groups” or “no relationship between variables.” Any difference or relationship you observe is most likely due to chance.

Research/Alternative hypothesis (H1): Conjecture about one or more population parameters. Can be non-directional or directional.

Page 11: Review Z Test Ci 1

11

Directionality of the test

A directional alternative hypothesis states that the parameter is statistically significantly greater than or less than the hypothesized value.

A non-directional alternative hypothesis merely states that the parameter is statistically significantly different from (not equal to) the hypothesized value.

Page 12: Review Z Test Ci 1

12

Examples:

Non-directional hypothesis: There is a statistically significant difference in

males and females on math achievement test scores.

How can we rephrase this to be a directional hypothesis?

Page 13: Review Z Test Ci 1

13

Errors in Inferential Statistics

Type I error: Reject Ho when it’s true. Type II error: Fail to reject Ho when it’s false. We’ll discuss Type I error and Type II error when

we look at statistical power next week.

Page 14: Review Z Test Ci 1

14

2. Setting the Criterion for Rejecting Ho

Level of significance (α level): the probability of making a Type I error when testing a null hypothesis.

Establish α level before collecting data. Alpha levels: .10,.05,.01,.001….

In deciding to reject the hypothesis at one of these levels, the researcher knows that the decision to reject the hypothesis may be incorrect 1%, 5%, or 10% of the time, respectively.

Page 15: Review Z Test Ci 1

15

Rejection Region

The level of significance represents a proportion of area in a sampling distribution that equals the probability of rejecting the Ho if it is true.

Rejection region refers to the area of the sampling distribution that represents values for the sample means that are improbable (possible but highly unlikely) if the Ho is true.

The critical values of the test statistic are those values in the sampling distribution that represent the beginning of the region of rejection.

The directionality of the test (one-tailed or two tailed) affects the critical values.

Page 16: Review Z Test Ci 1

16

See pages 181 – 188 in text for examples depicting the relationship between the critical value and rejection region.

Page 17: Review Z Test Ci 1

17

3. Computing test statistic to compute P-value P value: The probability of obtaining a value of the

test statistic that is as likely or more likely to reject Ho as the actual observed value of the test statistic.

P value Ranges from 0 to 1. Answers: If the populations really have the same

mean overall, what is the probability that random sampling would lead to a difference between sample means as large (or larger) than you observed?

Do we want a large or small p value?

Page 18: Review Z Test Ci 1

18

Interpreting P-value

“The probability is greater than .05 that the observed sample mean would have occurred by chance if the null hypothesis were true” (p. 195). The difference between the sample mean and the

hypothesized value of the population is sufficient (or not sufficient) enough to attribute it to anything other than sampling error.

Page 19: Review Z Test Ci 1

19

4. Compare P value to preset threshold value: Create confidence interval “Interval Estimate”

Confidence Interval: Provides an indication of how accurately the sample mean estimates population mean.

An interval of possible values for population mean in place of using just a single value of sample mean.

Confidence Coefficient = 1-α When α=.05, a 95% confidence interval indicates

that 95% of the time in repeated sampling, intervals calculated will contain the population mean (true parameter).

Page 20: Review Z Test Ci 1

20

5. Make Decision about Ho

1. If P value is less than the threshold, state that you “reject the null hypothesis” and conclude that the difference is “statistically significant.”

2. If P value is greater than the threshold, state that you “fail to reject the null hypothesis” and conclude that the difference is “not statistically significant.” The difference is within distribution of sampling fluctuation.

Page 21: Review Z Test Ci 1

21

Revisiting Hypothesis Testing Steps:

1. State hypotheses.2. Set criterion.3. Compute appropriate test statistic to compute the

P value.4. Compare the P value to the preset threshold

value.5. Make decision.

Page 22: Review Z Test Ci 1

22

Z-test

Determine the magnitude of the difference between sample mean and population mean.

The population mean μ and its standard deviation σ are known.

Page 23: Review Z Test Ci 1

23

Z-test Example 1: non-directional

Page 24: Review Z Test Ci 1

24

1. State hypotheses:

Ho: Children who learn whole language approach do not statistically significantly differ from the average child in word recognition (µ = 75%, σ = 5%).

In symbols: Ho: µ = 75%.

H1: Children who learn whole language approach statistically significantly differ from the average child with respect to word recognition (µ = 75%, σ = 5%).

In symbols: H1: µ ≠ 75%.

Page 25: Review Z Test Ci 1

25

2. Set Criterion for Decision

α = 0.05, thus the critical values (C.V.) are ± 1.96.

Rejection region Rejection region

Page 26: Review Z Test Ci 1

26

3. Collect data and compute test statistic

Sample mean= 78%

Population mean = 75%

σ = 5%

n = 50

Z = .78 - .75 = 0.03 = 4.24

.05 /√50 .05 /√50 n

μ−

This is the test statistic which is a z –score

(unit: standard deviation)

Page 27: Review Z Test Ci 1

27

4. Compare P value to criterion set.

Critical values are determined by alpha

levels and indicate where your rejection region

begins

4.24

Plot test statistic to see if in rejection region

If the test statistic is in the rejection region, reject the null hypothesis. In other words, if the test statistic exceeds the critical value, reject the null hypothesis.

Page 28: Review Z Test Ci 1

28

5. Make Decision

We reject the null hypothesis and conclude that children who learn the whole language statistically significantly differ from the average child in word recognition, z = 4.24, p < .05.

Page 29: Review Z Test Ci 1

29

Compute confidence interval Alpha set a 0.05. Thus, we build a 95% confidence interval and

the critical values are ± 1.96.

CI95 = μ ± Critical value (σ ) √n

CI95% = .75 ± 1.96 (.05 / √50 ) = .75 ± .01 =

[.74, .76]

95% of all possible means for samples of size 50 in the population will fall between 74% and 76%.

Our sample mean of 78% falls outside of the interval we built.

Standard error

Page 30: Review Z Test Ci 1

30

Z-test Example 2: Directional

Page 31: Review Z Test Ci 1

31

1. State hypotheses:

Ho: Babies born to mothers who smoke do not significantly differ in birth weight than the average baby (µ = 7.5, σ = 1.2).

Ho: µ = 7.5

H1: Babies born to mothers who smoke weigh significantly less than the average baby at birth (µ = 7.5, σ = 1.2).

H1: µ < 7.5.

Page 32: Review Z Test Ci 1

32

2. Set Criterion.

α = 0.05 Since this is a directional test, only look at

one side (tail) of the distribution. What is the z-score corresponding to 0.05?

1.64 + 1.65 / 2 = 1.645. Use - 1.645 since the hypothesized mean is

smaller than the population mean in the H1.

Page 33: Review Z Test Ci 1

33

3. Compute Test Statistic.

Sample mean = 7.1

Population mean = 7.5

σ = 1.2

n = 50

Z = = 7.1 – 7.5 = -.40 = -2.36

1.2 /√50 1.2 /√50 n

μ−

This is the test statistic which is a z –score

(unit: standard deviation)

Page 34: Review Z Test Ci 1

34

4. Compare P value to criterion set.

-2.36

Page 35: Review Z Test Ci 1

35

5. Make Decision.

We reject the null hypothesis and conclude that babies born to mothers who smoke weigh statistically significantly less than the average baby at birth, z = -2.36, p < .05.

Page 36: Review Z Test Ci 1

36

Compute confidence interval:

CI95 = μ ± Critical value (σ )

√n

CI95 = 7.5 - 1.645 (1.2) = 7.22

√50 Directional test: Any group of 50 babies born to

mothers who smoke who are less than 7.22 pounds are significantly lower than the average baby.

Page 37: Review Z Test Ci 1

37

Recommended note cards:

Steps in hypothesis testing Critical values associated with alpha levels of .05

and .01 (most commonly used) for a one and two-tailed test. Use page 189 in your text as a guide.

Z-test When to use a Z-test Formula for computing a Z-test

Confidence interval Formula Interpretation