selecting sampling strategy chris olsen [email protected] 12/14/20151sampling strategies
TRANSCRIPT
The sampling question du joir: just how tall IS Iowa corn?
04/21/23 2Sampling Strategies
Professional basketball players’ view of Iowa Corn
(In our dreams…but what about reality??)
04/21/23 3Sampling Strategies
In order to measure the height of a stalk of corn we must chop it down.
Measuring all the corn stalks is not on the table; farmers being what they are, we have only one cornstalk per Iowa county that we can utilize.
The economy being what it is, we can only afford to chop down a small number of cornstalks.
Our problem: identify the counties.
04/21/23 4Sampling Strategies
The generation of a subset of a population is known as “sampling” from the population.
We want our sample to be “representative” of the population – if it is, we can make credible statements about our population by generalizing from the sample.
We maximize the probability of getting a representative sample by generating the sample randomly.
The randomization scheme allows the calculation of probability distributions (“sampling” distributions) of statistics.
04/21/23 5Sampling Strategies
A random (“probability”) sample is one such that each population member has a greater-than-zero probability of selection.
The basic random sampling strategy is the “simple” random sample.
A simple random sample of size n from a population of size N is a sample taken in such a way that each of the possible NCn samples is equally likely.
04/21/23 6Sampling Strategies
Iowa has 99 counties -- perfect for a random number table…
04/21/23 7Sampling Strategies
Bubble, bubble, toil and trouble…
04/21/23 8Sampling Strategies
(All other states use calculators)
MathPrbrandInt(1, 99) (A random county)
MathPrbrandInt(1, 99, 10) (10 random counties)
MathPrbrandInt(1, 99, 15) (15 random counties, anticipating bad luck)
MathPrbrandInt(1, 99, 15)L1 (Put in List1)
04/21/23 9Sampling Strategies
Oops?
04/21/23 10Sampling Strategies
A possible improvement on the simple random sampling strategy is to take a stratified random sample.
Stratified random sampling capitalizes on known (or possibly suspected, but be careful) pockets of homogeneity in the population.
04/21/23 11Sampling Strategies
Possible pockets: Golden Gopher Droppings?
04/21/23 12Sampling Strategies
If certain areas of the state have been contaminated by a certain other state this might affect corn height, we would want to take note of this in our sampling – in advance!
We would not want to have each element of the sample from a non-contaminated county;
we would not want to have each element of the sample from a contaminated county;
we would want each part – contaminated and not – represented in our sample.
04/21/23 13Sampling Strategies
To accomplish this representation, we could use a stratified random sample.
60 Pristine, 39 contaminated…
Pristine counties
Contaminated counties
04/21/23 14Sampling Strategies
MathPrbrandInt(1, 60, 6) (6 random pristine counties)
MathPrbrandInt(1, 39, 4) (4 random contaminated counties)
04/21/23 15Sampling Strategies
Bravo!
04/21/23 16Sampling Strategies
In some circumstances we might have reason to believe that the variability in the state is captured in each region of the state.
As an example, consider the quadrennial blitz known as the presidential primary season. All the candidates don boots and overalls and milk the standard cow.
This event generally causes all the news channels to take a poll of Iowans on their opinions about the milking technique of the candidates.
04/21/23 17Sampling Strategies
Newspersons would probably want to spend little time “down on the farm,” and simple random sampling could result in lots of drive time! So some sort of improvement on the simple random sample is desired.
If the variability and representativeness (?) in the state is captured in each region, why not just randomly pick a few regions in the state ?
Why not, that is, take a “cluster sample?” (a random sample of regions.)
04/21/23 18Sampling Strategies
randInt(1, 9, 2)L1
04/21/23 19Sampling Strategies
A special cluster sample: The transect.
04/21/23 20Sampling Strategies
04/21/23 21Sampling Strategies
04/21/23 22Sampling Strategies
Some newspersons might be unable to follow complex directions. It is possible, however, they can at least count up to some relatively small number.
In this situation, systematic random sampling might be considered.
04/21/23 23Sampling Strategies
The Systematic Sample: Getting it done
1.Decide on the sampling fraction. (Judgment)
2.Decide on a starting point. (Random!)
3. Count off by n’s… (Arithmetic)
04/21/23 24Sampling Strategies
Systematic – every 11
04/21/23 25Sampling Strategies
Systematic – every 5
04/21/23 26Sampling Strategies
Systematic – alphabetical, every 9
04/21/23 27Sampling Strategies
Questions before practice?
04/21/23 28Sampling Strategies
A Review of Sampling Strategies:
Simple Random Sample: The Basic Strategy, requires a list
Stratified Random Sample: Capitalizes on pockets of homogeneity
Cluster sample: Capitalizes on there being NO pockets of homogeneity
Systematic sample: (Alleged) Population arrives serially
04/21/23 29Sampling Strategies
Problem #1: The Cultured Crowd
04/21/23 30Sampling Strategies
Problem #2: Some populations are elusive and/or difficult to sample:
04/21/23 31Sampling Strategies
Problem #2: Pick your difficult population…
1.Homeless
2.Illegal aliens
3.Teen texters in school
04/21/23 32Sampling Strategies
Problem #3:
The Case of the Fiddler Crab…(Uca pugilato)
04/21/23 33Sampling Strategies
Path integration, eh?
04/21/23 34Sampling Strategies
The Sex Ratio of Fiddler Crabs?
Just to be clear, the sex ratio we’re talking about is
• males / females, • NOT # events / time!!!
04/21/23 35Sampling Strategies
Just the facts, ma’am…
That big claw is for courtship & fighting, but is dysfunctional for foraging. (Males fight & forage more?)
Crabs outside burrows are susceptible to predation.
Males are territorial and promiscuous.
Females forage closer to water sources than males.
Breeding females may be smaller.
04/21/23 36Sampling Strategies
The end!
04/21/23 37Sampling Strategies