1 systematic sampling (sys) up to now, we have only considered one design: srs of size n from a...

14
1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic sample is a sample obtained by randomly selecting one sampling unit from the first k sampling units in the sampling frame, and every k-th sampling unit thereafter

Upload: aubrie-morgan

Post on 03-Jan-2016

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic

1

Systematic Sampling (SYS) Up to now, we have only considered

one design: SRS of size n from a population of size N

New design: SYS DEFN: A 1-in-k systematic sample is a

sample obtained by randomly selecting one sampling unit from the first k sampling units in the sampling frame, and every k-th sampling unit thereafter

Page 2: 1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic

2

Sampling procedure for SYS Have a frame, or list of N SUs

Assume SU = OU for now Determine sampling interval, k

k is the next integer after N/n Select first SU in the list

Choose a random number, R , between 1 & k R-th SU is the first SU to be included in the

sample Select every k-th SU after the R-th SU

Sample includes unit R, unit R + k, unit R + 2k,…, unit R + (n-1)k

Page 3: 1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic

3

Example Telephone survey of members in an

organization abut organization’s website use N = 500 members Have resources to do n = 75 calls N / n = 500/75 = 6.67 k = 7 Random number table entry: 52994

Rule: if pick 1, 2, …, 7, assign as R; otherwise discard #

Select R = 5 Take SU 5, then SU 5+7 =12, then SU 12+7

=19, 26, 33, 40, 47, …

Page 4: 1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic

4

Example – 2 Arrange population in rows of

length k = 7R 1 2 3 4 5 6 7 i

1 2 3 4 5 6 7 1

8 9 10 11 12 13 14 2

15 16 17 18 19 20 21 3

22 23 24 25 26 27 28 4

… …

491

492

493

494

495

496

497

71

498

499

500

72

Page 5: 1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic

5

Properties of systematic sampling – 1 Number of possible SYS samples of size n

is k Only 1 random act - selecting R

After select 1st SU, all other SUs to be included in the sample are predetermined

A SYS is a cluster with sample size 1 Cluster = set of SUs separated by k units

Unlike SRS, some sample sets of size n have no chance of being selected given a frame A SU belongs to 1 and only 1 sample

Page 6: 1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic

6

Properties of systematic sampling – 2 Number of possible SYS samples of

size n is k Are these samples equally likely to be

selected?

Probability of selecting a sample P{S } = 1/k

Inclusion probability for a SU P{ SU i S } = 1/k

Page 7: 1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic

7

Properties of systematic sampling – 3 Plan for sample size of n , but

actual sample size may vary If N / k is an integer, then n = N / k If N / k is NOT an integer, then n is

either the integer part of (N / k ) or the integer part of (N / k ) + 1

Page 8: 1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic

8

Properties of systematic sampling – 4 Because only the starting SU of a

SYS sample is randomized, a direct estimate of the variance of the sampling distribution can not be estimated Under SRS, variance of the sampling

distribution was a function of the population variance, S2

Have no such relationship for SYS

Page 9: 1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic

9

Estimation for SYS Use SRS formulas to estimate

population parameters and variance of estimator

11

)ˆ1(ˆ]ˆ[ˆ and ˆ with PROPORTION pop Estimate

][ˆ]ˆ[ˆ and ˆ with TOTAL pop Estimate

1][ˆ and with MEAN pop Estimate

2

2

Nn

npp

pVpp

yVNtVtt

Nn

ns

yVyyU

Page 10: 1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic

10

Properties of systematic sampling – 5 Properties of SRS estimators

depends on frame ordering SRS estimators for population

parameters usually have little or no bias under SYS

Precision of SRS estimators under SYS depends on ordering of sample frame

Page 11: 1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic

11

Order of sampling frame Random order

SYS acts very much like SRS SRS variance formula is good approximation

Ordered in relation to y Improves representativeness of sample SRS formula overestimates sampling variance

(estimate is more precise than indicated by SE) Periodicity in y = sampling interval k

Poor quality estimates SRS formula underestimates sampling variance

(overstate precision of estimate)

Page 12: 1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic

12

Example – 3 Suppose X [age of member] is correlated with

Y [use of org website] Sort list by X before selecting sample

k 1 2 3 4 5 6 7 X i

1 2 3 4 5 6 7 young 1

8 9 10 11 12 13 14 2

15 16 17 18 19 20 21 3

22 23 24 25 26 27 28 4

… mid …

491

492

493

494

495

496

497

71

498

499

500

old 72

Page 13: 1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic

13

Practicalities Another building block (like SRS) used in

combination with other designs SYS is more likely to be used than SRS if

there is no stratification or clustering Useful when a full frame cannot be

enumerated at beginning of study Exit polls for elections Entrance polls for parks

Page 14: 1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic

14

Practicalities – 2 Best if you can sort the sampling frame

by an auxiliary variable X that is related to Y Improve representativeness of sample

(relative to SRS) Improve precision of estimates Essentially offers implicit form of

stratification