education 793 class notes decisions, error and power presentation 8

Education 793 Class Notes

Decisions, Error and Power

Presentation 8

2

Review: Chain of reasoning

Population with parameters

Random selection

Sample with statistics

Probability

Underlying distributions of the statistic

Inference

Random selection can take several forms (such as simple, systematic, cluster, or stratified random sampling), and is intended to generate a sample that represents the population.

3

Four Steps

1. State the hypothesisH0 vs. HAlternate

2. Identify your criterion for rejecting H0

Directional or non-directional testOne-tailed or two-tailed

Set alpha level (Prob. incorrectly rejecting H0)

3. Compute test statistic General form: Test = statistic – expected parameter

standard error of statistic

4. Decide about H0

4

Million Dollar Question

• To Reject or Not To Reject?

5

Review: Inferential error

• Type I: Alpha– Rejecting the null hypothesis with the null

hypothesis is really true

• Type II: Beta– Failing to reject the null hypothesis when the null

hypothesis is in fact false

6

Possible outcomes

DecisionIn the population,

H0 is true

In the population,

H0 is false

Reject H0

(H0 false)Type I error

Correct decision

Fail to reject H0

(H0 true)Correct decision

Type II error

7

Considerations

• Basic considerations– What is the statistic of interest?– Is a one-tailed versus two-tailed test appropriate?– What is the nature of the sample?

• Dependent versus independent

• How much power is available?– Power is 1 – – Factors influencing power:

• Increasing the number of observations• Reducing the error in the data

8

Statistical Power

• Power is the probability of correctly rejecting a false null hypothesis. As such, power is defined as:

1 - ß where ß is the Type II error probability

• Two kinds of power analyses:– A priori: Used to identify what the sample size

needs to be to identify a specified effect– Post hoc: Used when the null hypothesis has not

been rejected, and when you want to know the probability that you have committed a Type II error

9

What Power is all about

• Power analysis is about Type II errors, “missed effects” – failing to reject H0: when there really is an effect in the population

• “Power” is the antithesis of “risk of Type II error” – Risk of Type II error = 1 - power – Power = 1 - Risk of Type II error

10

Trade off Between and Power

• Conundrum: As we decrease (Type I Error), Type II Error) increases. The two errors have an inverse relationship. In order to deal with the problem, researchers follow an established set of guidelines. =.05 or .01

• Which error is the most dangerous?

11

Power of a Test

• Example: Ho: =72, H1: <72 =.05, n=36

Review: Given the sample data, a decision is made to reject or not reject the null hypothesis.

=72

zcrit= -1.65

=Type I Error

This distribution is assumed to be true under the Null

12

Power of a Test

• If the null hypothesis is false, then the above distribution is NOT the one we sampled from. We must have sampled from one of the possible alternative distributions, of which there are an infinite number. Recall H1: <72

zcrit= -1.65

=Type I Error

=72

13

Power of a Test

• The steps in computing power require the researcher to set a predetermined difference from the two means (1) that is meaningful (Effect Size).

• With three pieces of information, N, we can estimate the power of a test.

• We will let computer programs do this for us.

14

Factors Affecting Power

• Size of the difference between population means, what the book refers to as 1.– The greater the effect size (1), the greater the

power will be. Common sense, as the difference gets larger, it will be easier to detect.

• Significance level– As the power of a statistical test increases so does

This is the trade off between Type I and Type II Errors.

15

Factors Affecting Power

• Variance– All other factors being equal (, N, ), the

smaller the standard deviation in the population, the greater the power.

• Sample size– By increasing N, we decrease the standard

deviation in the sampling distribution. Hence the power is increased.

16

Power as a function of…

the true difference in

significancelevel

samplesize

17

Next Week

• Chapter 12 p. 333-367 and

Available through JSTOR at http://www.jstor.org/journals/00221546.html

education 793 class notes decisions, error and power presentation 8

Documents