sampling we have a known population. we ask “what would happen if i drew lots and lots of random...

23

Post on 21-Dec-2015

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”
Page 2: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

Sampling• We have a known

population. We ask “what would happen if

I drew lots and lots of random samples from this population?”

Page 3: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

Inference• We have a known sample. We ask “what kind of

population might this sample have been drawn from?”

Page 4: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

The Central Limit Theorem• If you draw simple random samples of size n

• from a population with mean and variance

• then

the expected mean of x-bar is

the expected variance of x-bar is / n

the expected histogram of x-bar is approximately normal

Page 5: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”
Page 6: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

Estimating mu from sample data

• Is this true?

• amu = sample mean

• Why not? Because the Central Limit Theorem tells us that, if we

drew lots and lots of sample, the sample means vary. Some are bigger than mu and others are smaller than mu.

Page 7: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

Estimating mu from sample data

• What about this?

• mu = somewhere in the neighborhood

• of the sample mean

• But how do we define neighborhood?

Page 8: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

Example 6.1 We have a sample of 500 high-school seniors, selected at

random from the population of all high-school seniors in California. For the 500 kids in the sample, their average score on the math section of the SAT is 461.

Known: sample mean is 461 Unknown: population mean Assumed: population sigma is 100

Page 9: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

The Central Limit Theorem• If you draw simple random samples of size 500

from a population with mean and standard deviation of 100, then

the expected mean of x-bar is

the expected st dev of x-bar is about 4.5

the expected histogram of x-bar is approximately normal

Page 10: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

Table A tells us...

• ...about 68% of sample means should fall within 4.5 points

of mu

• ...about 95% of sample means should fall within 9 points

of mu

• ...about 99.75% of sample means should fall within 13.5

points of mu

Page 11: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

mu-9 mu mu+9

Sample Means

About 95% of sample means should fall within 9 points of mu

Page 12: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”
Page 13: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

mu is 452

435 440 445 450 455 460 465 470 475 480 485

Sample Means

Page 14: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

mu is 470

435 440 445 450 455 460 465 470 475 480 485

Sample Means

Page 15: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

mu is 452

435 440 445 450 455 460 465 470 475 480 485

Sample Means

mu is 470

435 440 445 450 455 460 465 470 475 480 485

Sample Means

Page 16: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

The 95% Confidence Interval If mu is any number less than 452, then our

sample mean would be surprisingly large. If mu is any number greater than 470, then our

sample mean would be surprisingly small. Therefore, the 95% confidence interval for mu

is the range from 452 to 470. If mu is inside this range, then our sample is

not unusual (according to the 95% rule).

Page 17: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”
Page 18: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”
Page 19: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

Other confidence intervals• If we suppose that the sample mean is within 1.645

standard deviations of mu, then we get a 90% confidence interval.

• If we suppose that the sample mean is within 2.576 standard deviations of mu, then we get a 99% confidence interval.

Page 20: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

Effect of sample size on the confidence interval

• As n gets larger, the expected variability of the sample means gets smaller.

Larger sample sizes produce narrower confidence intervals (other things equal).

Smaller sample sizes produce wider confidence intervals (other things equal).

Page 21: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”
Page 22: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”
Page 23: Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”

Some cautions The data must be a simple random sample from the

population The sample mean, and therefore the confidence interval,

may be too heavily influenced by one or more outliers If the sample size is small and population is not

approximately normal, then the CLT doesn’t promise the approximately normal distribution for the sample means