moazzam ali sampling. how we select from an infinite number of observations we could possibly make...

21
MOAZZAM ALI Sampling

Upload: hortense-daniel

Post on 23-Dec-2015

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

MOAZZAM ALI

Sampling

Page 2: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Sampling

• How we select from an infinite number of observations we could possibly make

• Why do we sample?– Size of the population– Cost of obtaining elements– Convenience and accessibility of elements

• Sampling is the process of obtaining information from a subset (sample) of a larger group (population)

• The results for the sample are then used to make estimates of the larger group

• From Chapter 7 in Babbie & Mouton (2001)

Page 3: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Characteristics of a Good Sample

Good sampling design should:

• Relate to the objectives of the investigation • Be practical and achievable; • Be cost –  effective in terms of equipment and

labour; • Provide estimates of population parameters that

are truly representative and unbiased. Ideally, representative samples should be:

• Taken at random so that every member of the population of data has an equal chance of selection;

• Large enough to give sufficient precision; • Unbiased by the sampling procedure or

equipment.

Page 4: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Sampling Terminology

Element

• The unit about which information is collected• Typically the elements are people• But look at the section on “unit of analysis” again:

any of them could be elements (schools, universities, corporations, etc.)

Page 5: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Sampling Terminology

Population

• All the potential study elements, as defined• Careful specification of the population

Sample Population

• Almost impossible to guarantee that every element meeting your definition of “the population” has a chance to be selected into the sample.

• Thus the “study population” will be somewhat smaller than “the population”

Page 6: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Sampling Terminology

Research design - sampling 6

Sampling Unit

• Typically the sampling units are the same as the elements and probably the units of analysis

• (We are not going to look into more complex sampling units)Sampling Frame

• The actual list of sampling units (or elements). • e.g. if you want to study “Students at the University of

Cape Town”, there is a list of such sampling units (but there are a number of definition issues to be resolved here) Sample

• A subset of a population selected to estimate the behaviour or characteristics of the population.

Page 7: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Sampling Designs

7

Basically two sampling strategies available:

Probability samplingNon-probability Sampling

Page 8: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Probability Sampling

Each member of the population has a certain probability to be selected into the sample

Types of Probability Sampling

Random Stratified Random Systematic Cluster

Page 9: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Random Sampling

Population members are selected directly from the sampling frame

Equal probability of selection for every member (sample size/population size)

400/10,000 = .04 Use random number table or random number

generator

Page 10: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Systematic Sampling

Order all units in the sampling frame based on some variable and number them from 1 to N

Choose a random starting place from 1 to N and then sample every k units after that

Page 11: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Stratified Sampling

The chosen sample contains a number of distinct categories which are organized into segments, or strata– equalizing "important" variables

• year in school, geographic area, product use, etc.

Steps:– Population is divided into mutually exclusive and

exhaustive strata based on an appropriate population characteristic. (e.g. race, age, gender etc.)

– Simple random samples are then drawn from each stratum.

Page 12: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Stratified Sampling

Page 13: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Stratified Sampling

The sample size is usually proportional to the relative size of the strata.

Ensures that particular groups (e.g. males and females) within a population are adequately represented in the sample

Has a smaller sampling error than simple random sample since a source of variation is eliminated

Page 14: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Cluster Sampling

The Population is divided into mutually exclusive and exhaustive subgroups, or clusters, usually based on geography or time period

Each cluster should be representative of the population i.e. be heterogeneous.

Means between clusters should be the same (homogeneous)

Then a sample of the clusters is selected. then some randomly chosen units in the selected

clusters are studied.

Page 15: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Cluster Sampling

divide population into clusters (usually along geographic boundaries)

randomly sample clusters

measure units within sampled clusters

Page 16: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Non-probability Sampling

Members selected not according to logic of probability (or mathematical rules), but by other means (e.g. convenience, or access)

Types of Non-Probability Sampling

convenience sampling judgement sampling snowball sampling quota sampling

Page 17: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Convenience Sampling

Convenience SamplingA researcher's convenience forms the basis for selecting a sample. people in my classes Mall intercepts People with some specific characteristic (e.g. bald)

Page 18: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Purposive Sampling

Select the sample on the basis of knowledge of the population: your own knowledge, or use expert judges to identify candidates to select

Typically used for very rare populations, such as deviant cases.

Page 19: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Snowball Sampling

Research design - sampling 19

Typically used in qualitative researchWhen members of a population are difficult to

locate, for covert sub-populations, non-cooperative groups

Recruit one respondent, who identifies others, who identify others,….

Primarily used for exploratory purposes

Page 20: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Quota Sampling

Research design - sampling 20

• A stratified convenience sampling strategy• Begins with a table that describes the

characteristics of the target population– e.g. the composition of postgraduate students at UCT in

terms of faculty, race, and gender• Then select on a convenience basis, postgraduate

students in the same proportions regarding faculty, race, and gender than in the population

• Of course, the quota frame (the proportions in the table) must be accurate

• And biases may be introduced when selecting elements to study

Page 21: MOAZZAM ALI Sampling. How we select from an infinite number of observations we could possibly make Why do we sample? – Size of the population – Cost of

Thank You