chapter
DESCRIPTION
Chapter. 8. Sampling Distributions. Section 8.1. Distribution of the Sample Mean. Statistics such as the “mean” ( ) are random variables. Their value varies from sample to sample, so they have probability distributions associated with them. - PowerPoint PPT PresentationTRANSCRIPT
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
Chapter
Sampling Distributions
8
Chap 2 2
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
Section 8.1
Distribution of the
Sample Mean
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
Statistics such as the “mean” ( ) are random variables. Their value varies from
sample to sample, so they have probability distributions associated with them.
In this chapter we focus on the shape (normal?), center (peak) and
spread(deviation) of distributions of .
8-4
x
x
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
The sampling distribution of the sample mean is the probability distribution of all possible values of the random variable “ ”
Take many samples of size “n” from a population whose mean is μ and standard
deviation is σ.
Then plot just the means ( ) of each sample.
8-5
x
x
8-6
Illustrating Sampling Distributions
Step 1: Obtain a simple random sample of size “n”
Step 2: Compute the sample mean.
Step 3: Repeat Steps 1 and 2 until all possible simple random samples of size n have been obtained (in theory). In practice, take as many n-sized samples as practicable (time & money…).
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
The weights of pennies minted after 1982 are approx normally distributed with mean = 2.46g (grams) and
std dev = 0.02g. (28g = 1 oz)
Approximate the sampling distribution of the sample mean by taking 200 simple random samples of size n = 5 pennies from this population (in other words, find the mean weight of each 5-penny sample and then plot
just the 200 ‘s, not the weight of all 1000 pennies.)
8-7
Sampling Distribution of the Sample Mean: Normal Population
x
8-8
The data on the following slide represent the sample means for the 200 simple random
samples, each of size n = 5.
For example, the first sample of n = 5 pennies had the following weight data (in g):
2.493 2.466 2.473 2.492 2.471
Note: = 2.479g for this sample
x
8-9
The data on the following slide represent the sample means for the 200 simple random
samples, each of size n = 5.
For example, the first sample of n = 5 pennies had the following weight data (in g):
2.493 2.466 2.473 2.492 2.471
Note: = 2.479g for this sample
x
8-10
The data on the following slide represent the sample means for the 200 simple random
samples, each of size n = 5.
For example, the first sample of n = 5 pennies had the following weight data (in g):
2.493 2.466 2.473 2.492 2.471
Note: = 2.479g for this sample
x
8-11
Sample Means for Samples of Size n = 5
8-12
The mean of the 200 sample means is 2.46g, the same as the mean of the
population.
The standard deviation of the sample means (“standard error”) is 0.0086g, which is smaller than the standard
deviation of the population.
The next slide shows the histogram of all 200 sample means.
8-13
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
What role does “n”, the size of each sample, play in the value of the standard deviation of the
sample means?
As the size “n” of each sample increases, the standard deviation of the distribution of the
sample mean decreases.
8-14
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
Approximate the distribution of the sample means by obtaining 200 simple random samples of size n = 20 pennies
(not 5)….
from the same population of pennies minted after 1982 (μ = 2.46 grams and σ
= 0.02 grams)
8-15
The Impact of Sample Size “n” on Sampling Variation (Variance, Std Dev)
8-16
The mean of the 200 sample means for n = 20 pennies is still 2.46g, but the standard deviation is now 0.0045g
(0.0086g for n = 5). There is less variation in the distribution of the sample means when n =20p than with n = 5p and the curve is more “normal”.
8-18
If a random sample of size “n” is drawn from a large population with mean “μ” and std dev
“σ”. Then the distribution of the ‘s will have:
mean
std dev
is the “standard error of the mean” or just “standard error”
The Mean & Standard Deviation of theSampling Distribution of
x
x
x n
.
x
x
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
Theorem If the parent population of the random variable X is normally
distributed, the distribution of the sample means from that population
will also be normally distributed.
8-19
x
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
The weights of pennies minted after 1982 (pop) are normally distributed with:
mean = 2.46g and std dev = 0.02g.
What is the probability that, in a random sample of 10 of these pennies, the mean weight of the
sample is at least 2.465 grams?
8-20
Distribution of the Sample Mean
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
.
.P(Z > 0.79) = 1 – 0.7852 = 0.2148
.
x
x
x 0.02
100.0063
Z 2.465 2.46
0.00630.79
normally distributed = 2.46g
normcdf (-1E99, 0.79) = 0.7852Or: normcdf (0.79,1E99) = 0.2148
Or: normcdf(2.465,1E99,2.46,0.0063) = 0.2137
22
Now…Now…
Lets describe the Distribution of the Lets describe the Distribution of the Sample Means Sample Means if if the parent population the parent population
is is non-non-normal, normal,
OrOr (same thing) we (same thing) we don’t know don’t know whether whether
the parent is normal or not.the parent is normal or not.
8-23
The following table and histogram give the probability distribution for rolling a fair die:
μ = 3.5, σ = 1.708Note that the population distribution is Uniform, NOT normal
Result Face on Die
Relative Frequency orProbability
1 0.1667
2 0.1667
3 0.1667
4 0.1667
5 0.1667
6 0.1667
Sampling from a Population that is Not Normal (or Unk)
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
Roll a dice 4 times (n = 4) and find the mean value Repeat this 200 times. Calculating the sample mean for each of the 200 samples. Repeat for n = 10 and 30. Estimate the sampling distribution of by obtaining 200 simple random samples of size andHistograms of the sampling distribution of the sample mean for each sample size are given on the next slide.
8-24
x
Probability Experiment
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
Roll a dice 4 times (n = 4) and find the mean value Repeat this 200 times. Calculating the sample mean for each of the 200 samples. Repeat for n = 10 and 30. Estimate the sampling distribution of by obtaining 200 simple random samples of size andHistograms of the sampling distribution of the sample mean for each sample size are given on the next slide.
8-25
x
Probability Experiment
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
1. Roll 4 dice (n = 4). Find the mean value of result
2. Repeat this 200 times. Calculate the sample mean for each of the 200 rolls. Repeat for n = 10 and n = 30.
3. Estimate the sampling distribution of for each “n”
Histograms of the distribution of the sample means for each sample size are given on the next slide.
8-26
x
Probability Experiment
x
8-27
8-28
8-29
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
The mean of the sampling distribution is equal to the mean of the parent population. The std error of the distribution of the sample means is regardless of “n”
The Central Limit Theorem: the shape of the distribution of the sample means becomes approx normal as the sample size n ≥ 30 increases, regardless of the shape of the parent population (means can use z-scores).
8-30
Key Points from Dice Experiment
n
8-31
Using the Central Limit Theorem
Suppose that the mean time for an auto oil change at Karlos’ Speedy Oil Change is 11.4 minutes with a standard
deviation of 3.2 min.
(a) If a random sample of n = 35 oil changes is selected, describe the sampling distribution of the sample mean.
(b) If a sample of n = 35 oil changes at Karlos’ is selected, what is the probability that the mean of these 35 oil change
times is less than 11 minutes?
8-32
Using the Central Limit Theorem
S”what is the probability the mean of 35 oil change times is less than 11 minutes?
that the mean time for an oil change at a “10-minute oil change joint” is 11.4 minutes with a standard deviation of 3.2 minutes.
(a) If a random sample of n = 35 oil changes is
Or….
2:Distr:2: normcdf(-1E99,11.0,11.4,0.5409) = 0.2298
IOW, if you did this experiment (n=35) 100 times, about 23 of those times (1/4) the mean oil change would take less than 11 mins.
Solution: since n > 30, is approximately normally distributed with mean = 11.4 min and std. dev. = min
P(Z < –0.74) = 0.2296
3.235
0.5409
x
11 11.4 0.740.5409
Z
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
Section 8.2
Distribution of the Sample
Proportion(not Mean)
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
If a random sample of size “n” is obtained from a population in which each outcome is binomial (Yes/No; True/False; Success/Fail) The sample proportion, (“p-hat”) is
where x is the number of “successes”.
The sample proportion is a statistic that estimates the population proportion, ρ (Gr:
rho).8-34
ˆ p
ˆ p xn
ˆ p
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
In a 2003 national Harris Poll, 1745 registered voters were asked whether they approved of the way President
Bush was handling the economy (wthtm). 349 responded “yes”, and the other 1396 said various other
unprintable things.
Obtain a point estimate (%) for the proportion of registered voters who approved of the way President
Bush was handling the economy.
8-35
Computing a Sample Proportion
349ˆ 0.20001745
xpn
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
According to a Time poll in 2008, 42% of registered voters believed that gay/lesbian couples should be allowed to (civil) marry.
Describe the sampling distribution of the sample proportion for samples of size:
n = 10, 50, 100.Note: We are using simulations to create the histograms on
the following slides.
8-36
Using Simulation to Describe the Distribution of the Sample Proportion
8-37
8-38
8-39
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
Shape: As the size of the sample “n” increases, the shape of the sampling proportion distribution
becomes approximately normal.
Center: The mean of the sampling proportion distribution equals the population proportion, ρ.
Spread: The std dev of the sampling proportion distribution decreases as the sample size, n
increases.
8-40
Key Points from Time Poll
ˆ p
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
For a simple random sample of size n with point estimate and population proportion ρ:
1. The shape of the sampling distribution of is approximately normal provided npq ≥ 10.
2. The mean of the sampling distribution of is
3. The standard deviation of the sampling distribution of is
Note: p + q=1 or q = 1 - p8-41
Sampling Distribution of
ˆ p
ˆ p p̂
ˆ p p̂
pqn
p̂
42
Normality for Qualitative Normality for Qualitative Variables (Proportions)…Variables (Proportions)… In order for us to use z-scores for In order for us to use z-scores for
probability of proportions, the probability of proportions, the distribution of must be approx distribution of must be approx normal.normal.
Your text uses npq ≥ 10 for this Your text uses npq ≥ 10 for this
requirement, but most books use requirement, but most books use both np ≥ 5 both np ≥ 5 and also and also nq ≥ 5 to nq ≥ 5 to
assume normalityassume normality
ˆ p
43
Normality RequirementNormality Requirement
In this civil marriage problem: p = 0 .42 In this civil marriage problem: p = 0 .42 (42% approve) , so q = 1-p = 0.58 (42% approve) , so q = 1-p = 0.58 (58% disapprove)(58% disapprove)
If n = 50, then: npq = 12.18 > 10 If n = 50, then: npq = 12.18 > 10 np = 21 >5 and nq = 29 >5np = 21 >5 and nq = 29 >5But, if n = 20, npq = 4.872 < 10But, if n = 20, npq = 4.872 < 10 np = 8.4 > 5 and nq = 11.6 > 5np = 8.4 > 5 and nq = 11.6 > 5
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
According to a Time poll conducted in 2008, 42% of registered voters believed that gay/lesbian couples
should be allowed to civil marry. Suppose that we obtain a random sample of 50
voters and determine which of those voters believe that gay/lesbian couples should be allowed to
marry (“Success” = x). Describe the sampling distribution of the sample proportion for registered voters who believe that
gay and lesbian couples should be allowed to marry.
8-44
Sampling Distribution of the Sample Proportion
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
np(1 – p) = npq = 50(0.42)(0.58) = 12.18 ≥ 10.Thus the sampling distribution of the sample proportion is therefore approximately normal with: mean = 0.42 (42% of the 50 = Success, 58% of the 50 = Fail)and standard deviation =
8-45
Proportion Marriage Survey
(0.42)(0.58) 0.0698 6.98%50
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
According to the Centers for Disease Control and Prevention (CDC) in 2004, 18.8% of school-aged children (aged 6-
11 yrs) were overweight.
In a random sample of 90 school-aged children, what is the probability that at least 19% are overweight?
Suppose in a random sample of 90 school-aged children you find 25 overweight children.
What might you conclude?
Note: we are dealing with % of children, not quantity of children, because this variable is proportion.
8-46
Compute Probabilities of a Sample Proportion
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
1. npq = 90(0.188)(0.812) ≈ 13.739 ≥ 10So, is approximately normal with mean = 0.188
and standard deviation =
8-47
Porky Pigs: a) In sample of 90 children, what is prob at least 19% are overweight?
ˆ p
(0.188)(0.812) 0.041290
0.19 0.188 0.04850.0412
( 0.0485) 0.5193
Z
P Z
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
1. As before, is approximately normal with mean = 0.188 and standard deviation = 0.0412
8-48
Porky Pigs: b) In a sample of 90, you find 25 overweight children. What might you conclude?
ˆ p
2.normcdf (2.179, 1E99) = 0.0147 = prob of finding 25/90 O/W
3.If the true population proportion is 0.188 (18.8% O/W), then the prob of finding 25/90 or 27.8% o/w children in a sample is “unusual” (any data point outside 2σ).
4.If we repeated this experiment 100 times, we would only expect to get this many O/W children (25/90) approx 1 or 2 times.
25ˆ 0.277890
p 0.2778 0.188 2.179
0.0412Z
Chap 2 49