chapter

49
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Sampling Distributio ns 8

Upload: xena-gross

Post on 15-Mar-2016

27 views

Category:

Documents


0 download

DESCRIPTION

Chapter. 8. Sampling Distributions. Section 8.1. Distribution of the Sample Mean. Statistics such as the “mean” ( ) are random variables. Their value varies from sample to sample, so they have probability distributions associated with them. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

Chapter

Sampling Distributions

8

Page 2: Chapter

Chap 2 2

Page 3: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

Section 8.1

Distribution of the

Sample Mean

Page 4: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

Statistics such as the “mean” ( ) are random variables. Their value varies from

sample to sample, so they have probability distributions associated with them.

In this chapter we focus on the shape (normal?), center (peak) and

spread(deviation) of distributions of .

8-4

x

x

Page 5: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

The sampling distribution of the sample mean is the probability distribution of all possible values of the random variable “ ”

Take many samples of size “n” from a population whose mean is μ and standard

deviation is σ.

Then plot just the means ( ) of each sample.

8-5

x

x

Page 6: Chapter

8-6

Illustrating Sampling Distributions

Step 1: Obtain a simple random sample of size “n”

Step 2: Compute the sample mean.

Step 3: Repeat Steps 1 and 2 until all possible simple random samples of size n have been obtained (in theory). In practice, take as many n-sized samples as practicable (time & money…).

Page 7: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

The weights of pennies minted after 1982 are approx normally distributed with mean = 2.46g (grams) and

std dev = 0.02g. (28g = 1 oz)

Approximate the sampling distribution of the sample mean by taking 200 simple random samples of size n = 5 pennies from this population (in other words, find the mean weight of each 5-penny sample and then plot

just the 200 ‘s, not the weight of all 1000 pennies.)

8-7

Sampling Distribution of the Sample Mean: Normal Population

x

Page 8: Chapter

8-8

The data on the following slide represent the sample means for the 200 simple random

samples, each of size n = 5.

For example, the first sample of n = 5 pennies had the following weight data (in g):

2.493 2.466 2.473 2.492 2.471

Note: = 2.479g for this sample

x

Page 9: Chapter

8-9

The data on the following slide represent the sample means for the 200 simple random

samples, each of size n = 5.

For example, the first sample of n = 5 pennies had the following weight data (in g):

2.493 2.466 2.473 2.492 2.471

Note: = 2.479g for this sample

x

Page 10: Chapter

8-10

The data on the following slide represent the sample means for the 200 simple random

samples, each of size n = 5.

For example, the first sample of n = 5 pennies had the following weight data (in g):

2.493 2.466 2.473 2.492 2.471

Note: = 2.479g for this sample

x

Page 11: Chapter

8-11

Sample Means for Samples of Size n = 5

Page 12: Chapter

8-12

The mean of the 200 sample means is 2.46g, the same as the mean of the

population.

The standard deviation of the sample means (“standard error”) is 0.0086g, which is smaller than the standard

deviation of the population.

The next slide shows the histogram of all 200 sample means.

Page 13: Chapter

8-13

Page 14: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

What role does “n”, the size of each sample, play in the value of the standard deviation of the

sample means?

As the size “n” of each sample increases, the standard deviation of the distribution of the

sample mean decreases.

8-14

Page 15: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

Approximate the distribution of the sample means by obtaining 200 simple random samples of size n = 20 pennies

(not 5)….

from the same population of pennies minted after 1982 (μ = 2.46 grams and σ

= 0.02 grams)

8-15

The Impact of Sample Size “n” on Sampling Variation (Variance, Std Dev)

Page 16: Chapter

8-16

The mean of the 200 sample means for n = 20 pennies is still 2.46g, but the standard deviation is now 0.0045g

(0.0086g for n = 5). There is less variation in the distribution of the sample means when n =20p than with n = 5p and the curve is more “normal”.

Page 17: Chapter
Page 18: Chapter

8-18

If a random sample of size “n” is drawn from a large population with mean “μ” and std dev

“σ”. Then the distribution of the ‘s will have:

mean

std dev

is the “standard error of the mean” or just “standard error”

The Mean & Standard Deviation of theSampling Distribution of

x

x

x n

.

x

x

Page 19: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

Theorem If the parent population of the random variable X is normally

distributed, the distribution of the sample means from that population

will also be normally distributed.

8-19

x

Page 20: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

The weights of pennies minted after 1982 (pop) are normally distributed with:

mean = 2.46g and std dev = 0.02g.

What is the probability that, in a random sample of 10 of these pennies, the mean weight of the

sample is at least 2.465 grams?

8-20

Distribution of the Sample Mean

Page 21: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

.

.P(Z > 0.79) = 1 – 0.7852 = 0.2148

.

x

x

x 0.02

100.0063

Z 2.465 2.46

0.00630.79

normally distributed = 2.46g

normcdf (-1E99, 0.79) = 0.7852Or: normcdf (0.79,1E99) = 0.2148

Or: normcdf(2.465,1E99,2.46,0.0063) = 0.2137

Page 22: Chapter

22

Now…Now…

Lets describe the Distribution of the Lets describe the Distribution of the Sample Means Sample Means if if the parent population the parent population

is is non-non-normal, normal,

OrOr (same thing) we (same thing) we don’t know don’t know whether whether

the parent is normal or not.the parent is normal or not.

Page 23: Chapter

8-23

The following table and histogram give the probability distribution for rolling a fair die:

μ = 3.5, σ = 1.708Note that the population distribution is Uniform, NOT normal

Result Face on Die

Relative Frequency orProbability

1 0.1667

2 0.1667

3 0.1667

4 0.1667

5 0.1667

6 0.1667

Sampling from a Population that is Not Normal (or Unk)

Page 24: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

Roll a dice 4 times (n = 4) and find the mean value Repeat this 200 times. Calculating the sample mean for each of the 200 samples. Repeat for n = 10 and 30. Estimate the sampling distribution of by obtaining 200 simple random samples of size andHistograms of the sampling distribution of the sample mean for each sample size are given on the next slide.

8-24

x

Probability Experiment

Page 25: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

Roll a dice 4 times (n = 4) and find the mean value Repeat this 200 times. Calculating the sample mean for each of the 200 samples. Repeat for n = 10 and 30. Estimate the sampling distribution of by obtaining 200 simple random samples of size andHistograms of the sampling distribution of the sample mean for each sample size are given on the next slide.

8-25

x

Probability Experiment

Page 26: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

1. Roll 4 dice (n = 4). Find the mean value of result

2. Repeat this 200 times. Calculate the sample mean for each of the 200 rolls. Repeat for n = 10 and n = 30.

3. Estimate the sampling distribution of for each “n”

Histograms of the distribution of the sample means for each sample size are given on the next slide.

8-26

x

Probability Experiment

x

Page 27: Chapter

8-27

Page 28: Chapter

8-28

Page 29: Chapter

8-29

Page 30: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

The mean of the sampling distribution is equal to the mean of the parent population. The std error of the distribution of the sample means is regardless of “n”

The Central Limit Theorem: the shape of the distribution of the sample means becomes approx normal as the sample size n ≥ 30 increases, regardless of the shape of the parent population (means can use z-scores).

8-30

Key Points from Dice Experiment

n

Page 31: Chapter

8-31

Using the Central Limit Theorem

Suppose that the mean time for an auto oil change at Karlos’ Speedy Oil Change is 11.4 minutes with a standard

deviation of 3.2 min.

(a) If a random sample of n = 35 oil changes is selected, describe the sampling distribution of the sample mean.

(b) If a sample of n = 35 oil changes at Karlos’ is selected, what is the probability that the mean of these 35 oil change

times is less than 11 minutes?

Page 32: Chapter

8-32

Using the Central Limit Theorem

S”what is the probability the mean of 35 oil change times is less than 11 minutes?

that the mean time for an oil change at a “10-minute oil change joint” is 11.4 minutes with a standard deviation of 3.2 minutes.

(a) If a random sample of n = 35 oil changes is

Or….

2:Distr:2: normcdf(-1E99,11.0,11.4,0.5409) = 0.2298

IOW, if you did this experiment (n=35) 100 times, about 23 of those times (1/4) the mean oil change would take less than 11 mins.

Solution: since n > 30, is approximately normally distributed with mean = 11.4 min and std. dev. = min

P(Z < –0.74) = 0.2296

3.235

0.5409

x

11 11.4 0.740.5409

Z

Page 33: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

Section 8.2

Distribution of the Sample

Proportion(not Mean)

Page 34: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

If a random sample of size “n” is obtained from a population in which each outcome is binomial (Yes/No; True/False; Success/Fail) The sample proportion, (“p-hat”) is

where x is the number of “successes”.

The sample proportion is a statistic that estimates the population proportion, ρ (Gr:

rho).8-34

ˆ p

ˆ p xn

ˆ p

Page 35: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

In a 2003 national Harris Poll, 1745 registered voters were asked whether they approved of the way President

Bush was handling the economy (wthtm). 349 responded “yes”, and the other 1396 said various other

unprintable things.

Obtain a point estimate (%) for the proportion of registered voters who approved of the way President

Bush was handling the economy.

8-35

Computing a Sample Proportion

349ˆ 0.20001745

xpn

Page 36: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

According to a Time poll in 2008, 42% of registered voters believed that gay/lesbian couples should be allowed to (civil) marry.

Describe the sampling distribution of the sample proportion for samples of size:

n = 10, 50, 100.Note: We are using simulations to create the histograms on

the following slides.

8-36

Using Simulation to Describe the Distribution of the Sample Proportion

Page 37: Chapter

8-37

Page 38: Chapter

8-38

Page 39: Chapter

8-39

Page 40: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

Shape: As the size of the sample “n” increases, the shape of the sampling proportion distribution

becomes approximately normal.

Center: The mean of the sampling proportion distribution equals the population proportion, ρ.

Spread: The std dev of the sampling proportion distribution decreases as the sample size, n

increases.

8-40

Key Points from Time Poll

ˆ p

Page 41: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

For a simple random sample of size n with point estimate and population proportion ρ:

1. The shape of the sampling distribution of is approximately normal provided npq ≥ 10.

2. The mean of the sampling distribution of is

3. The standard deviation of the sampling distribution of is

Note: p + q=1 or q = 1 - p8-41

Sampling Distribution of

ˆ p

ˆ p p̂

ˆ p p̂

pqn

Page 42: Chapter

42

Normality for Qualitative Normality for Qualitative Variables (Proportions)…Variables (Proportions)… In order for us to use z-scores for In order for us to use z-scores for

probability of proportions, the probability of proportions, the distribution of must be approx distribution of must be approx normal.normal.

Your text uses npq ≥ 10 for this Your text uses npq ≥ 10 for this

requirement, but most books use requirement, but most books use both np ≥ 5 both np ≥ 5 and also and also nq ≥ 5 to nq ≥ 5 to

assume normalityassume normality

ˆ p

Page 43: Chapter

43

Normality RequirementNormality Requirement

In this civil marriage problem: p = 0 .42 In this civil marriage problem: p = 0 .42 (42% approve) , so q = 1-p = 0.58 (42% approve) , so q = 1-p = 0.58 (58% disapprove)(58% disapprove)

If n = 50, then: npq = 12.18 > 10 If n = 50, then: npq = 12.18 > 10 np = 21 >5 and nq = 29 >5np = 21 >5 and nq = 29 >5But, if n = 20, npq = 4.872 < 10But, if n = 20, npq = 4.872 < 10 np = 8.4 > 5 and nq = 11.6 > 5np = 8.4 > 5 and nq = 11.6 > 5

Page 44: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

According to a Time poll conducted in 2008, 42% of registered voters believed that gay/lesbian couples

should be allowed to civil marry. Suppose that we obtain a random sample of 50

voters and determine which of those voters believe that gay/lesbian couples should be allowed to

marry (“Success” = x). Describe the sampling distribution of the sample proportion for registered voters who believe that

gay and lesbian couples should be allowed to marry.

8-44

Sampling Distribution of the Sample Proportion

Page 45: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

np(1 – p) = npq = 50(0.42)(0.58) = 12.18 ≥ 10.Thus the sampling distribution of the sample proportion is therefore approximately normal with: mean = 0.42 (42% of the 50 = Success, 58% of the 50 = Fail)and standard deviation =

8-45

Proportion Marriage Survey

(0.42)(0.58) 0.0698 6.98%50

Page 46: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

According to the Centers for Disease Control and Prevention (CDC) in 2004, 18.8% of school-aged children (aged 6-

11 yrs) were overweight.

In a random sample of 90 school-aged children, what is the probability that at least 19% are overweight?

Suppose in a random sample of 90 school-aged children you find 25 overweight children.

What might you conclude?

Note: we are dealing with % of children, not quantity of children, because this variable is proportion.

8-46

Compute Probabilities of a Sample Proportion

Page 47: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

1. npq = 90(0.188)(0.812) ≈ 13.739 ≥ 10So, is approximately normal with mean = 0.188

and standard deviation =

8-47

Porky Pigs: a) In sample of 90 children, what is prob at least 19% are overweight?

ˆ p

(0.188)(0.812) 0.041290

0.19 0.188 0.04850.0412

( 0.0485) 0.5193

Z

P Z

Page 48: Chapter

Copyright © 2013, 2010 and 2007 Pearson Education, Inc.

1. As before, is approximately normal with mean = 0.188 and standard deviation = 0.0412

8-48

Porky Pigs: b) In a sample of 90, you find 25 overweight children. What might you conclude?

ˆ p

2.normcdf (2.179, 1E99) = 0.0147 = prob of finding 25/90 O/W

3.If the true population proportion is 0.188 (18.8% O/W), then the prob of finding 25/90 or 27.8% o/w children in a sample is “unusual” (any data point outside 2σ).

4.If we repeated this experiment 100 times, we would only expect to get this many O/W children (25/90) approx 1 or 2 times.

25ˆ 0.277890

p 0.2778 0.188 2.179

0.0412Z

Page 49: Chapter

Chap 2 49