stats chapter 12

52
Chapter 12 Significance Tests in Practice

Upload: richard-ferreria

Post on 05-Dec-2014

2.526 views

Category:

Documents


3 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Stats chapter 12

Chapter 12

Significance Tests in Practice

Page 2: Stats chapter 12

12.1 TESTS ABOUT POPULATION MEAN

Page 3: Stats chapter 12

Student’s t-distribution

• Published in 1908• Used to describe the sampling

distribution when the population std dev is unknown

TEST STATISTIC ( uknown)

1/

n

xt

s n

Page 4: Stats chapter 12

Student’s t- distribution

• Since this is just another significance test:Use PHANTOMS

• Differences:–We are using a t distribution with n-1

degrees of freedom– Use “tcdf(lower, upper, df)”– The t-distribution is not resistant to

outliers when sample size is small (less than 30)

Page 5: Stats chapter 12

Student’s t-distribution

Assumptions• Simple Random Sample• Independence

N > 10n• Normality

The sample must be approx Normal to indicate the Normality of the sampling distribution(1) Histogram: single peak, symmetric

note: slight skew is OK, but must be mentioned(2) Norm probability plot: approx. linear(3) NO OUTLIERS

Page 6: Stats chapter 12

Example 12.2

Tasters use a “sweetness scale” of 1 to 10. Cola is rated before and after a month of storage in high temperature. The differences are shown below. The bigger the difference, the greater the loss of sweetness. Does the data indicate that sweetness was lost after the storage interval?

2.0, 0.4, 0.7, 2.0, -0.4, 2.2, -1.3, 1.2, 1.1, 2.3

Page 7: Stats chapter 12

Example 12.2

Parameter• Let = the population mean

sweetness lost after a month of storage at high temperature

• Let xbar = the sample (n=10) sweetness loss after a month of storage at high temperatures

Page 8: Stats chapter 12

Example 12.2

Hypotheses• H0: = 0

This indicates that there is no sweetness loss

• Ha: > 0This indicates that there is sweetness loss

Page 9: Stats chapter 12

Example 12.2

Assumptions• Simple Random Sample

We are not told that our data represents an SRS. We must check that this sample is an SRS (or acts like an SRS) and proceed.

• IndependenceWe can be assured that the population of cola is greater than 10(10) = 100

Page 10: Stats chapter 12

Example 12.2

Assumptions (cont.)• Normality

Our histogram is single peaked with a slight left skew

The Normal probability plot is approximately linear

There are no outliers

Page 11: Stats chapter 12

Example 12.2

Test Statistic

P-value

Page 12: Stats chapter 12

Example 12.2

Test Statistic

P-value

10, 1.02, 1.1961

10 1 9

n x s

df

Page 13: Stats chapter 12

Example 12.2

Test Statistic

P-value

10, 1.02, 1.1961

10 1 9

n x s

df

9

1.02 0

1.1961/ 10t

Page 14: Stats chapter 12

Example 12.2

Test Statistic

P-value

10, 1.02, 1.1961

10 1 9

n x s

df

9

1.02 0

1.1961/ 10t

9

1.02 0

1.1961/ 102.697

t

Page 15: Stats chapter 12

Example 12.2

Test Statistic

P-value

10, 1.02, 1.1961

10 1 9

n x s

df

9

1.02 0

1.1961/ 10t

9

1.02 0

1.1961/ 102.697

t

9Pval 2.697P t

Page 16: Stats chapter 12

Example 12.2

Test Statistic

P-value

10, 1.02, 1.1961

10 1 9

n x s

df

9

1.02 0

1.1961/ 10t

9

1.02 0

1.1961/ 102.697

t

Page 17: Stats chapter 12

Example 12.2

Test Statistic

P-value

10, 1.02, 1.1961

10 1 9

n x s

df

9

1.02 0

1.1961/ 10t

9

1.02 0

1.1961/ 102.697

t

9Pval 2.697P t

0.0123

Page 18: Stats chapter 12

Example 12.2

Make a decisionWe are going to “reject” (our p-value is small)

SummarizeApproximately 1% of the time, a sample of size 10 will produce a mean sweetness loss of at least 1.02.Since the p-value is smaller than a presumed = 0.05, we reject the null hypothesis.We have evidence to conclude that the mean sweetness loss greater than 0. Our new estimate for the average sweetness loss is 1.02.

Page 19: Stats chapter 12

Paired t-tests

• When a sample is produced using a matched pair design, the data used in the significance test is the difference of the two measurements

• Some typical examples of a paired t-test would be a “pre-test and post-test” as well as the previous example.

• The important thing here is to recognize the matched pair design and to work of the difference of the scores (and not the scores themselves)

Page 20: Stats chapter 12

Example 12.5

Page 21: Stats chapter 12

Example 12.5

We will work through the first few steps and leave the rest for you on your own.

Parameter• = the population difference in

average time to complete the maze • xbar = the sample (n = 21)

difference in average time to complete the maze

Page 22: Stats chapter 12

Example 12.5

Hypotheses• H0: = 0

• Ha: > 0(the scented mask decreases average time to complete maze)

• Remember: we are looking at the “difference” column only!

Page 23: Stats chapter 12

Example 12.5

Assumptions• Simple Random Sample

The data comes from a randomized matched pair design; we will have to assume that this is an SRS of the population and proceed with the test

• IndependenceWe must assume that the population is greater than 10(21) = 210 and that the scented and unscented trails are independent; we will proceed as though this condition is satisfied

Page 24: Stats chapter 12

Example 12.5

Assumptions (cont.)• Normality

Our histogram is single peaked with a slight right skew

The Normal probability plot is approximately linear

There are no outliers

Page 25: Stats chapter 12

Example 12.5

Name of the Test• We will use a “paired t-test for a

mean”Test Statistic

• You can do the rest, yeah?

20

0.9567 0

12.5479 / 21t

21, 0.9567,

12.5479, 20

n x

s df

0.349

Page 26: Stats chapter 12

Robustness

• t-procedures are robust against non-Normal population except in the presence of outliers

Guidelines for using t-procedures• n < 15: data must be approx normal,

no outliers• n >15: data can have slight skew,

no outliers• n > 30: data can have skew

Page 27: Stats chapter 12

Assignment 12.1

• Page 745 #1, 3, 6, 9, 10, 12, 16, 20

Page 28: Stats chapter 12

12.2 TESTS ABOUT A POPULATION PROPORTION

Page 29: Stats chapter 12

z-tests for proportion

• Again, we have introduced most of the material- this is just another significance test.

• Unlike tests for means, tests for proportions will always be a z-test

• We will review some of the key information:

Page 30: Stats chapter 12

Assumptions for proportions

• Simple Random Sample• Independence

N > 10n• Normality (of sampling distribution)

np > 10 and nq >10remember that this is just the number of responses

Page 31: Stats chapter 12

Test Statistic for proportions

If H0: p = p0

0

0 0

p pz

p q

n

Page 32: Stats chapter 12

Example 12.8

A random sample of 100 workers from a large chain restaurant were asked whether or not work stress had a negative impact on their personal lives. Thirty-two of them responded “No.” A large national survey reported that 25% of restaurant workers did not feel that stress exerted a negative impact.Does this large chain restaurant have the same work stress as the nation?

Page 33: Stats chapter 12

Example 12.8

• We are going to use the national survey as our known population mean.

Parameterp = the national proportion of restaurant workers who feel as though work stress has a negative impact on their personal lives.p-hat = the proportion of the sample of 100 workers who feel as though work stress has a negative impact on their personal lives.

Page 34: Stats chapter 12

Example 12.8

HypothesesH0: p = 0.25

Ha: p 0.25

AssumptionsSimple Random Sample

We are told that our sample is from a random sample. We will treat this as an SRS.

IndependenceAlthough we are not told, we will make the assumption that there are more than 10(100) = 1000 workers for this national chain

Normalitynp = 38 >10 and nq = 62 > 10Our sampling distribution is approximately Normal

Page 35: Stats chapter 12

Example 12.8

Name of TestWe will conduct a 1-proportion z-test(Note that this will be a two-tailed test)

Test Statistic

0

0 0

p pz

p q

n

0.32 0.25

0.25 0.75

100

1.617z

Page 36: Stats chapter 12

Example 12.8

Pvalue

Decision

Page 37: Stats chapter 12

Example 12.8

Pvalue

Decision

Pval 2 1.617P z

Page 38: Stats chapter 12

Example 12.8

Pvalue

Decision

Pval 2 1.617P z

Page 39: Stats chapter 12

Example 12.8

Pvalue

Decision

Pval 2 1.617P z Pval 0.1059

Page 40: Stats chapter 12

Example 12.8

Pvalue

DecisionFail to reject

Pval 2 1.617P z Pval 0.1059

Page 41: Stats chapter 12

Example 12.8

SummarizeApproximately 11% of the time, a sample of size 100 will produce a proportion at least as extreme 0.38.Since this is not less than a presumed = 0.05, we will fail to reject the H0.

We do not have enough evidence to conclude that the proportion of workers of this national chain who feel that work stress affects their personal lives is not 0.25.

Page 42: Stats chapter 12

Power

The preceding example should make you feel a bit uncomfortable: that p-value was not large. We don’t really expect the average to be exactly equal to the national average!Most likely, the restaurant’s proportion was not that different from the national average: maybe just a few percentage points greater.Our test didn’t have enough power to detect the difference between the two proportions!

Page 43: Stats chapter 12

Confidence Intervals and Significance Tests

• Let’s calculate the confidence interval for out proportion in the preceding example.

Page 44: Stats chapter 12

Confidence Intervals and Significance Tests

• Let’s calculate the confidence interval for out proportion in the preceding example.

CI *p q

p zn

Page 45: Stats chapter 12

Confidence Intervals and Significance Tests

• Let’s calculate the confidence interval for out proportion in the preceding example.

CI *p q

p zn

Notice that the Conf. Int.uses p-hat and q-hat for the Standard Error

Page 46: Stats chapter 12

Confidence Intervals and Significance Tests

• Let’s calculate the confidence interval for out proportion in the preceding example.

CI *p q

p zn

0.32 0.68CI 0.32 *

100z

Page 47: Stats chapter 12

Confidence Intervals and Significance Tests

• Let’s calculate the confidence interval for out proportion in the preceding example.

CI *p q

p zn

0.32 0.68CI 0.32 *

100z

CI 0.2433,0.3967

Page 48: Stats chapter 12

Confidence Intervals and Significance Tests

• Let’s calculate the confidence interval for out proportion in the preceding example.

CI *p q

p zn

0.32 0.68CI 0.32 *

100z

CI 0.2433,0.3967Our interval contains 0.25It is equally likely that theproportion is actually 0.25!i.e. the proportion could be0.25!

Page 49: Stats chapter 12

Calculator Usage

As you may have already noticed, the TI calculators automate many of these calculations.Of course, this does not excuse you from writing out the PHANTOMS or PANIC procedures, or even showing your calculations!

Page 50: Stats chapter 12

Calculator Usage

Page 51: Stats chapter 12

Calculator Usage

TI83/84: Since these functions are menu driven we will just list the tests and their usage

[STAT] → “TESTS”Z-Test = one or two tailed z test for meanT-Test = one or two tail t-test for means1-PropZTest = one or two tail test for proportionsZInterval = confidence interval for mean (-known)Tinterval = confidence interval for mean (-

unknown)1-PropZInt = confidence interval for proportions

Page 52: Stats chapter 12