review z test ci 1

1

Week 7:

Hypothesis Testing: z-test

2

Review

We have seen descriptive statistics, statistical procedures to summarize data and describe distributions and individual scores.

We also need inferential statistics, techniques that can help us to test hypotheses, to generalize results to a given population, and to estimate the parameters of a population.

3

Statistics and Parameters

Statistics are the facts on which we draw conclusions about parameters.

Sample statistics are only estimates of the population parameters.

We need to know how the sample we have taken compares with other samples that might be taken from the population.

4

Sampling distribution of the mean

Sampling distribution of the mean is the distribution of sample means for all samples of a given size randomly selected from the population.

If we had generated all possible samples, the mean of the possible values of sample mean would agree exactly with μ.

5

Central Limit Theorem for sample mean

When n (sample size) is large, the sampling distribution of sample means will be approximately normal (with the approximation becoming more precise as n increases).

When the population distribution is normal, the sampling distribution of sample means is exactly normal for any sample size n.

6

Example:

What is the probability of getting a sample mean of 104 or higher on an IQ test (μ = 100, σ = 15) with a random sample of n = 36?

Calculate the standard error for samples of n = 36:

(σx) = σ = 15 = 15 = 2.5

√n √36 6

7

Draw the distribution of the sample means:

92.5 95 97.5 100 102.5 105 107.5

The probability of getting a sample mean of 107.5 is .0013.

The probability of getting a sample mean of 104 or higher is ____.

8

We would need to compute a z score to figure out what the probability of getting a sample mean of 104 or higher: Compute a z score:

Z = 104 - 100 = 4 = 1.60 2.5 2.5

0.0548 probability of getting a sample mean of 104.

9

Steps of Hypothesis Testing

1. State the hypotheses.2. Set the criterion for rejecting Ho (null hypothesis).3. Compute the test statistic.4. Compare test statistic to criterion set.

* Create a confidence interval and interpret.5. Decide about null Ho.

10

1. Stating the Hypotheses

Null hypothesis (Ho): The hypothesis to be tested. Usual states “no difference between groups” or “no relationship between variables.” Any difference or relationship you observe is most likely due to chance.

Research/Alternative hypothesis (H1): Conjecture about one or more population parameters. Can be non-directional or directional.

11

Directionality of the test

A directional alternative hypothesis states that the parameter is statistically significantly greater than or less than the hypothesized value.

A non-directional alternative hypothesis merely states that the parameter is statistically significantly different from (not equal to) the hypothesized value.

12

Examples:

Non-directional hypothesis: There is a statistically significant difference in

males and females on math achievement test scores.

How can we rephrase this to be a directional hypothesis?

13

Errors in Inferential Statistics

Type I error: Reject Ho when it’s true. Type II error: Fail to reject Ho when it’s false. We’ll discuss Type I error and Type II error when

we look at statistical power next week.

14

2. Setting the Criterion for Rejecting Ho

Level of significance (α level): the probability of making a Type I error when testing a null hypothesis.

Establish α level before collecting data. Alpha levels: .10,.05,.01,.001….

In deciding to reject the hypothesis at one of these levels, the researcher knows that the decision to reject the hypothesis may be incorrect 1%, 5%, or 10% of the time, respectively.

15

Rejection Region

The level of significance represents a proportion of area in a sampling distribution that equals the probability of rejecting the Ho if it is true.

Rejection region refers to the area of the sampling distribution that represents values for the sample means that are improbable (possible but highly unlikely) if the Ho is true.

The critical values of the test statistic are those values in the sampling distribution that represent the beginning of the region of rejection.

The directionality of the test (one-tailed or two tailed) affects the critical values.

16

See pages 181 – 188 in text for examples depicting the relationship between the critical value and rejection region.

17

3. Computing test statistic to compute P-value P value: The probability of obtaining a value of the

test statistic that is as likely or more likely to reject Ho as the actual observed value of the test statistic.

P value Ranges from 0 to 1. Answers: If the populations really have the same

mean overall, what is the probability that random sampling would lead to a difference between sample means as large (or larger) than you observed?

Do we want a large or small p value?

18

Interpreting P-value

“The probability is greater than .05 that the observed sample mean would have occurred by chance if the null hypothesis were true” (p. 195). The difference between the sample mean and the

hypothesized value of the population is sufficient (or not sufficient) enough to attribute it to anything other than sampling error.

19

4. Compare P value to preset threshold value: Create confidence interval “Interval Estimate”

Confidence Interval: Provides an indication of how accurately the sample mean estimates population mean.

An interval of possible values for population mean in place of using just a single value of sample mean.

Confidence Coefficient = 1-α When α=.05, a 95% confidence interval indicates

that 95% of the time in repeated sampling, intervals calculated will contain the population mean (true parameter).

20

5. Make Decision about Ho

1. If P value is less than the threshold, state that you “reject the null hypothesis” and conclude that the difference is “statistically significant.”

2. If P value is greater than the threshold, state that you “fail to reject the null hypothesis” and conclude that the difference is “not statistically significant.” The difference is within distribution of sampling fluctuation.

21

Revisiting Hypothesis Testing Steps:

1. State hypotheses.2. Set criterion.3. Compute appropriate test statistic to compute the

P value.4. Compare the P value to the preset threshold

value.5. Make decision.

22

Z-test

Determine the magnitude of the difference between sample mean and population mean.

The population mean μ and its standard deviation σ are known.

23

Z-test Example 1: non-directional

24

1. State hypotheses:

Ho: Children who learn whole language approach do not statistically significantly differ from the average child in word recognition (µ = 75%, σ = 5%).

In symbols: Ho: µ = 75%.

H1: Children who learn whole language approach statistically significantly differ from the average child with respect to word recognition (µ = 75%, σ = 5%).

In symbols: H1: µ ≠ 75%.

25

2. Set Criterion for Decision

α = 0.05, thus the critical values (C.V.) are ± 1.96.

Rejection region Rejection region

26

3. Collect data and compute test statistic

Sample mean= 78%

Population mean = 75%

σ = 5%

n = 50

Z = .78 - .75 = 0.03 = 4.24

.05 /√50 .05 /√50 n

xσ

μ−

This is the test statistic which is a z –score

(unit: standard deviation)

27

4. Compare P value to criterion set.

Critical values are determined by alpha

levels and indicate where your rejection region

begins

4.24

Plot test statistic to see if in rejection region

If the test statistic is in the rejection region, reject the null hypothesis. In other words, if the test statistic exceeds the critical value, reject the null hypothesis.

28

5. Make Decision

We reject the null hypothesis and conclude that children who learn the whole language statistically significantly differ from the average child in word recognition, z = 4.24, p < .05.

29

Compute confidence interval Alpha set a 0.05. Thus, we build a 95% confidence interval and

the critical values are ± 1.96.

CI95 = μ ± Critical value (σ ) √n

CI95% = .75 ± 1.96 (.05 / √50 ) = .75 ± .01 =

[.74, .76]

95% of all possible means for samples of size 50 in the population will fall between 74% and 76%.

Our sample mean of 78% falls outside of the interval we built.

Standard error

30

Z-test Example 2: Directional

31

1. State hypotheses:

Ho: Babies born to mothers who smoke do not significantly differ in birth weight than the average baby (µ = 7.5, σ = 1.2).

Ho: µ = 7.5

H1: Babies born to mothers who smoke weigh significantly less than the average baby at birth (µ = 7.5, σ = 1.2).

H1: µ < 7.5.

32

2. Set Criterion.

α = 0.05 Since this is a directional test, only look at

one side (tail) of the distribution. What is the z-score corresponding to 0.05?

1.64 + 1.65 / 2 = 1.645. Use - 1.645 since the hypothesized mean is

smaller than the population mean in the H1.

33

3. Compute Test Statistic.

Sample mean = 7.1

Population mean = 7.5

σ = 1.2

n = 50

Z = = 7.1 – 7.5 = -.40 = -2.36

1.2 /√50 1.2 /√50 n

xσ

μ−

This is the test statistic which is a z –score

(unit: standard deviation)

34

4. Compare P value to criterion set.

-2.36

35

5. Make Decision.

We reject the null hypothesis and conclude that babies born to mothers who smoke weigh statistically significantly less than the average baby at birth, z = -2.36, p < .05.

36

Compute confidence interval:

CI95 = μ ± Critical value (σ )

√n

CI95 = 7.5 - 1.645 (1.2) = 7.22

√50 Directional test: Any group of 50 babies born to

mothers who smoke who are less than 7.22 pounds are significantly lower than the average baby.

37

Recommended note cards:

Steps in hypothesis testing Critical values associated with alpha levels of .05

and .01 (most commonly used) for a one and two-tailed test. Use page 189 in your text as a guide.

Z-test When to use a Z-test Formula for computing a Z-test

Confidence interval Formula Interpretation

review z test ci 1

Business