sample size planning, calculation, and...

33
Sample Size Planning, Calculation, and Justification Meridith Blevins, MS Vanderbilt University Department of Biostatistics [email protected] http://biostat.mc.vanderbilt.edu/MeridithBlevins Meridith Blevins, MS (Vandy Biostats) Sample Size 1 / 33

Upload: lehuong

Post on 17-May-2018

231 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Sample Size Planning,

Calculation, and Justification

Meridith Blevins, MS

Vanderbilt UniversityDepartment of Biostatistics

[email protected]://biostat.mc.vanderbilt.edu/MeridithBlevins

Meridith Blevins, MS (Vandy Biostats) Sample Size 1 / 33

Page 2: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Introduction

. After you’ve decided what and whom you’re going to study and thedesign to be used, you must decide how many ‘subjects’ to sample.

Even the most rigorously executed study may fail to answer itsresearch question if the sample size is too small.

If the sample size is too large, the study will be more difficultand costly than necessary while unnecessarily exposing a numberof ‘subjects’ to possible harm.

. Goal: to estimate an appropriate number of ‘subjects’ for a givenstudy design.

ie, the number needed to find the results you’re looking for.

Meridith Blevins, MS (Vandy Biostats) Sample Size 2 / 33

Page 3: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Introduction, cont’d

. Only as accurate as the data and estimates on which they arebased, which are often just informed guesses.. Often reveals that the research design is not feasible or thatdifferent predictor or outcome variables are needed.. TAKE HOME MESSAGE: Sample size should be estimated early inthe design phase of the study, when major changes are still possible.

. In addition to the statistical analysis plan, the sample size sectionis critical to an IRB proposal and any kind of grant.

42% of R01s examined in one review paper were criticized forthe sample size justifications or analysis plans.1

1Inouye & Fiellin, “An Evidence-Based Guide to Writing Grant Proposals for Clinical Research”, Annals of

Internal Medicine, 142.4 (2005): 274-282.

Meridith Blevins, MS (Vandy Biostats) Sample Size 3 / 33

Page 4: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Underlying principles

. Research hypothesis:

Specific version of the research question that summarizes themain elements of the study – the sample, and the predictor andoutcome variables – in a form that establishes the basis for thestatistical hypothesis tests.2

Should be simple (ie, contain one predictor and one outcomevariable); specific (ie, leave no ambiguity about the subjects andvariables or about how the statistical hypothesis will be applied);and stated in advance.

Example: Use of tricyclic antidepressant medications, assessedwith pharmacy records, is more common in patients hospitalizedwith an admission of myocardial infarction at Longview Hospitalin the past year than in controls hospitalized for pneumonia.

2NOTE: Hypotheses are not needed for descriptive studies – more to come.

Meridith Blevins, MS (Vandy Biostats) Sample Size 4 / 33

Page 5: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Underlying principles, cont’d

. Null hypothesis:

Formal basis for testing statistical significance; states that thereis no association, difference, or effect.

eg, Alcohol consumption (in mg/day) is not associated with arisk of proteinuria (>300 mg/day) in patients with diabetes.

. Alternative hypothesis:

Proposition of an association, difference, effect.

Can be one-sided (ie, specifies a direction).

eg, Alcohol consumption is associated with an increased risk ofproteinuria in patients with diabetes.

However, most often two-sided – no direction mentioned.

Expected by most reviewers; very critical of a one-sided.

Meridith Blevins, MS (Vandy Biostats) Sample Size 5 / 33

Page 6: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

1-sided versus 2-sided

. In general a one sided test is appropriate when a large difference inone direction would lead to the same action as no difference at all.

eg, Test the null hypothesis that the conception rate afterlaparoscopy was less than or equal to that before. The one-sidedalternative hypothesis is that the conception rate afterlaparoscopy was higher than that before.

. Expectation of a difference in a particular direction is not adequatejustification.TAKE HOME: Two-sided tests should be used unless there is a verygood reason for doing otherwise. If one-sided tests are to be used,the direction of the test must be specified in advance. One-sidedtests should never be used simply as a device to make aconventionally non-significant difference significant.

1Bland J.M. and D.G. Bland. Statistics Notes: One and two sidedtests of significance. BMJ, 1994; 309:248.

Meridith Blevins, MS (Vandy Biostats) Sample Size 6 / 33

Page 7: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Underlying principles, cont’d

. General process used in hypothesis testing:

Presume the null hypothesis (eg, no association between thepredictor and outcome variables in the population).

Based on the data collected in the sample, use statistical teststo determine whether there is sufficient evidence to reject thenull hypothesis in favor of the alternative hypothesis (eg, there isan association in the population).

. Reaching a wrong conclusion:

Type I error: false-positive; rejecting the null hypothesis that isactually true in the population.

Type II error: false-negative; failing to reject the nullhypothesis that is actually not true in the population.

Neither can be avoided entirely.

Meridith Blevins, MS (Vandy Biostats) Sample Size 7 / 33

Page 8: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Underlying principles, cont’d

. Effect size:

Size of the association/difference/effect you expect/wish to bepresent in the sample.

Selecting an appropriate size is most difficult aspect of samplesize planning.

REMEMBER: Sample size calculation only as accurate as thedata/estimates on which they are based.

Find data from prior studies to make an informed guess – needsto be as similar as possible to what you expect to see in yourstudy.Pilot study/studies sometimes needed first.

Good rule of thumb: choose the smallest effect size that wouldbe clinically meaningful (and you would hate to miss).

Will be okay if true effect size ends up being larger.

Meridith Blevins, MS (Vandy Biostats) Sample Size 8 / 33

Page 9: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Underlying principles, cont’d

. Establish the maximum chance that you will tolerate of makingwrong conclusions:

α: probability of committing a type I error; aka ‘level ofstatistical significance’.

β: probability of making a type II error.

Power: 1− β; probability of correctly rejecting the nullhypothesis in the sample if the actual effect in the population isequal to (or greater than) the effect size.

Aim: choose a sufficient number of ‘subjects’ to keep α and β atan acceptably low level without making the study unnecessarilylarge (ie, expensive or difficult).

α and β decrease as sample size increases.

Often α = 0.05; β = 0.20, 0.10 (power = 80, 90%).

Meridith Blevins, MS (Vandy Biostats) Sample Size 9 / 33

Page 10: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Additional considerations

. Variability of the effect size:

Statistical tests depend on being able to show a differencebetween the groups being compared.

The greater the variability (spread) in the outcome variableamong the subjects, the more likely it is that the values in thegroups will overlap, and the more difficult it will be todemonstrate an overall difference between them.

Use the most precise measurements/variables possible.

. Often have >1 hypothesis, but should specify a single primaryhypothesis for sample size planning.

Helps to focus the study on its main objective and provides aclear basis for the main sample size calculation.

Useful to rank other research questions/specific aims assecondary, etc.

Meridith Blevins, MS (Vandy Biostats) Sample Size 10 / 33

Page 11: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Calculating sample size

. Specific method used depends on

The specific aim(s)/objective(s).

The study design, including the planned number ofmeasurements per ‘subject’.

The outcome(s) and predictor(s).

The proposed statistical analysis plan.

. Will also need to consider:

Accrual/Enrollment (response rate for questionnaires).

Drop-outs (ie, lost to follow-up) and missing data.

Budgetary constraints.

. Requires you to make assumptions.

Assume specific effect size (variability), α, power, etc.

Meridith Blevins, MS (Vandy Biostats) Sample Size 11 / 33

Page 12: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Calculating sample size for analytic studies

. Often want to show a significant difference/association between 2groups.

. Most traditional ‘recipe’ in this case:

1 State the null and 1- or 2-sided alternative hypothesis.

2 Select the appropriate statistical test based on the type ofpredictor and outcome variables in the hypotheses.

3 Choose a reasonable effect size (and variability, if necessary).

4 Specify α and power.

5 Use an appropriate table, formula, or software program toestimate the sample size.

. Most common used statistical tests for comparing 2 groups:

The t-test.

The Chi-squared test.Meridith Blevins, MS (Vandy Biostats) Sample Size 12 / 33

Page 13: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Calculating sample size for analytic studies, cont’d

. The t-test:

Commonly used to determine whether the mean value of acontinuous outcome variable in one group differs significantlyfrom that in another group.

Assumes that the distribution of the variables in each of the 2groups is approximately normal (bell-shaped).

Assumptions:Whether the 2 groups are paired or independent.

Example ‘paired’: Comparing the mean BMI of ‘subjects’ beforeand after a weight loss program.Example ‘independent’: Comparing the mean depression score in‘subjects’ treated with 2 different antidepressants.

The mean value of the variable in each group.

Calculation actually uses the ‘effect size’ (the difference in themean values between the 2 groups).

Meridith Blevins, MS (Vandy Biostats) Sample Size 13 / 33

Page 14: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Calculating sample size for analytic studies, cont’d

. The t-test, cont’d:

Assumptions, cont’d:The standard deviation (SD) of the variable.

If 2 groups are paired: SD of the difference in the variablebetween matched pairs.If 2 groups are independent: SD of the variable itself.

Rules of thumb:

Smaller sample size needed for paired groups – SD of thedifference in a variable usually smaller than the SD of a variable.Sample size decreases as the difference in the mean valuesincreases (holding SD constant).Sample size increases as SD increases (holding the difference inthe mean values constant).

Meridith Blevins, MS (Vandy Biostats) Sample Size 14 / 33

Page 15: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Calculating sample size for analytic studies, cont’d

. Example using the t-test:

Research question: Is there a difference in the efficacy of salbutamol andipratropium bromide for the treatment of asthma?

Planned study: randomized trial of the effect of these drugs on FEV1

(forced expiratory volume in 1 second) after 2 weeks of treatment.

Previous data: mean FEV1 in persons with asthma treated with ipratropiumwas 2.0 liters, with a SD of 1.0 liter.

Wish: to be able to detect a difference of ≥10% in mean FEV1 between the2 treatment groups.

Assumptions: α (two-sided) = 0.05; power = 0.80; effect size = 0.2 liters(10% X 2.0 liters); SD = 1.0 liter.

Calculation: A sample size of 394 patients per group is needed to detect adifference of ≥10% in mean FEV1 between the 2 (independent) treatmentgroups with 80% power, using a two-sample t-test and assuming a(two-sided) α of 0.05, a mean FEV1 of 2.0 liters in the ipratropium group,and a SD of 1.0 liter.

Meridith Blevins, MS (Vandy Biostats) Sample Size 15 / 33

Page 16: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Calculating sample size for analytic studies, cont’d

. The Chi-square-test:

Commonly used to determine whether the proportion of‘subjects’ who have a binary outcome variable in one groupdiffers significantly from that in another group.

Assumptions:

Whether the 2 groups are matched/paired or independent.The proportion with the outcome in each group.

Rule of thumb: both the value of the proportions and thedifference between them affect sample size – sample size muchlarger for small proportions.

Can also calculate assuming a relative risk (instead of 2proportions).

Meridith Blevins, MS (Vandy Biostats) Sample Size 16 / 33

Page 17: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Calculating sample size for analytic studies, cont’d

. Example using the Chi-square-test:

Research question: Is there a difference in the adherence to antiretroviralmedication between HIV patients enrolled to standard of care (SOC) andintervention groups?

Previous data: the 1-year adherence to ART is about 0.60 in HIV patientson SOC.

Wish: to determine that the 1-year adherence is ≥0.75 in HIV patients onintervention.

Assumptions: α (two-sided) = 0.05; power = 0.80; P1 (1-year adherence inSOC) = 0.60; P2 (1-year adherence in intervention) = 0.75.

Calculation: A sample size of 152 HIV patients on SOC and 152 HIVpatients on intervention is needed to determine that the 1-year adherenceto ART is ≥0.75 in intervention group with 80% power, using anChi-square test and assuming a (two-sided) α of 0.05 and a 1-yearadherence to ART of 0.60 in SOC group.

Meridith Blevins, MS (Vandy Biostats) Sample Size 17 / 33

Page 18: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Calculating sample size for analytic studies, cont’d

. Alternative recipes – useful when sample size is fixed or thenumber of subjects who are available or affordable is limited:

Calculate the power for a given effect and sample size.

Calculate the effect size for a given power and sample size.Good general rule: always assume ≥80% power.

. Can also calculate the sample size required to determine whetherthe correlation coefficient between 2 continuous variables is significant(ie, significantly differs from 0) – see ‘Designing Clinical Research’.

. ‘Adjusting for covariates’: when designing a study, you may decidethat ≥1 variable will confound the association between the predictorand outcome, and plan to use regression analysis to adjust for theseconfounders.

Calculated sample size will need to be increased (‘10:1 rule’) –see a statistician.

Meridith Blevins, MS (Vandy Biostats) Sample Size 18 / 33

Page 19: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Calculating sample size for descriptive studies

. Usually do not involve hypotheses; goal is to calculate descriptivestatistics (eg, means and proportions).

. Approach: calculate sample size required to estimate a confidenceinterval (CI) of a specified confidence level (eg, 95%) and total width(ie, precision).

For a continuous variable: interested in the CI around the meanvalue of that variable.

For a binary variable: interested in the CI around the estimatedproportion of ‘subjects’ with one of the values.

. Rules of thumb:

A larger sample size is needed for a ‘tighter’ (ie, smaller totalwidth; more precise) CI of any confidence level.

For a given total width, a larger sample size is needed for a CIwith a higher confidence level.

Meridith Blevins, MS (Vandy Biostats) Sample Size 19 / 33

Page 20: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Sample size for descriptive studies, cont’d

. Assumptions made for calculating the sample size:

For a continuous variable: the standard deviation of the variable.

For a binary variable: the expected proportion with the variableof interest in the population.

If more than half of the population is expected to have thecharacteristic, then plan the sample size based on theproportion of expected not to have the characteristic.

For both:

(1) the desired precision (total width) of the CI.(2) the confidence level for the interval (eg, 95 or 99%).

. Example: A sample size of 166 ‘subjects’ is needed to estimate themean IQ among 3rd graders in an urban area with a 99% CI of ±3points (ie, a total with of 6 points), assuming a standard deviation of15 points.

Meridith Blevins, MS (Vandy Biostats) Sample Size 20 / 33

Page 21: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Sample size for descriptive studies, cont’d

. IMPORTANT: descriptive studies of binary variables includestudies of the sensitivity and specificity of a diagnostic test.

Goal of a diagnostic test (or procedure): to correctly classify‘subjects’ as having or not having a ‘disease’ (or symptom).

Sensitivity: proportion of true positives; probability of testingpositive when you truly have the ‘disease’.

Specificity: proportion of true negatives; probability of testingnegative when you truly do not have the ‘disease’.

Example: A sample size of 246 ‘diseased subjects’ is needed toestimate a 95% CI for the sensitivity of a new diagnostic test forpancreatic cancer, assuming 80±5% of the patients withpancreatic cancer will have positive tests.

When studying the specificity of a test, must estimate therequired number of ‘subjects’ who do not have the ‘disease’.

Meridith Blevins, MS (Vandy Biostats) Sample Size 21 / 33

Page 22: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Sample Size Software Programs

. PS: Power and Sample Size Calculation

Free from http://biostat.mc.vanderbilt.edu/PowerSampleSize.

Available for Windows and can run on Linux using Wine.

Handles several common analyses including t-test, Chi-squaretest (‘Dichotomous’), and Survival.

Can generate graphs (eg, power vs sample size) and keeps ‘log’.

. Simulations can also be done for any statistical technique.

Most valuable for complex analyses, such as mixed effects orGEE models – statistician (most likely) needed.

Meridith Blevins, MS (Vandy Biostats) Sample Size 22 / 33

Page 23: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Additional thoughts/considerations

. Consider strategies for minimizing sample size and maximizingpower, which include using

Continuous variables,

Paired measurements,

Unequal group sizes, and

A more common (ie, prevalent) binary outcome.

. Useful to calculate (and report) a range of sample sizes byassuming different combinations of parameter values – take thelargest sample size to ‘cover all bases’.. Always justify the feasibility of the calculated sample size.

How long would it take to accrue/enroll the subjects?

Need to consider the source of subjects, the inclusion/exclusioncriteria, the prevalence of the outcome, etc.

Meridith Blevins, MS (Vandy Biostats) Sample Size 23 / 33

Page 24: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

What do I include in my sample size write-up?

. Key: state all the information assumed such that anyone readingyour ‘Sample Size section’ would be able to re-calculate the samplesize given. This includes

The (primary) specific aim.

The outcome and predictor variable(s).

Primary comparison of interest (if applicable).

Parameter estimates (ie, α, power, effect size, variability, etc).

Data on which you based your assumptions.

Statistical test used (if applicable).

. Including graphs can be very helpful.

. Show the reviewers that you have solid reasoning behind yourcalculations (as well as your statistical analysis plan).

Acknowledge whether study will be a pilot or feasibility study.

Meridith Blevins, MS (Vandy Biostats) Sample Size 24 / 33

Page 25: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Take home message

. Sample size justification is an essential part of every researchstudy, and thus any IRB proposal or grant application.

. Sample size should be estimated early in the design phase of thestudy, when major changes are still possible.

. References & acknowledgments:

Designing Clinical Research (3rd edition) by Hulley, et al.

Terri Scott, Ayumi Shintani & Jennifer Thompson.

Meridith Blevins, MS (Vandy Biostats) Sample Size 25 / 33

Page 26: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Common Values for Critical Regions

Z1−α/2 ≈ 1.96, when α = 0.05Z1−α/2 ≈ 2.58, when α = 0.01Z1−β ≈ 0.84, when β = 0.20Z1−β ≈ 1.28, when β = 0.10

Meridith Blevins, MS (Vandy Biostats) Sample Size 26 / 33

Page 27: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Precision of Estimate

Estimating a Proportion:

N =(Z2

1−α/2)p(1−p)

D2

Estimating a Mean:

N =(Z2

1−α/2)(σ2/n)

D2

D, half-length of confidence interval (est ± D)

Meridith Blevins, MS (Vandy Biostats) Sample Size 27 / 33

Page 28: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Detecting a Difference

Difference of (Independent) Proportions:

N =(Z1−α/2 + Z1−β)2(p1(1− p1) + p2(1− p2))

(p1 − p2)2

Difference of (Independent) Means:

N =(Z1−α/2 + Z1−β)2(σ2

1 + σ22)

(µ1 − µ2)2

Meridith Blevins, MS (Vandy Biostats) Sample Size 28 / 33

Page 29: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Diagnostic Tests

Accuracy of One Test (same as estimating a proportion):

N =(Z2

1−α/2)Se(1−Se)

D2

Accuracy of Two Tests1: The null and alternative hypotheses are:

H0 : ϑ1 = ϑ2

Ha : ϑ1 6= ϑ2

where ϑ1 is the diagnostic acccuracy of test 1 and ϑ2 is thediagnostic accuracy of test 2. The presumed value of the differencein sensitivity is denoted as ∆1.

N =[Z1−α/2

√V0(ϑ1 − ϑ2) + Z1−β

√VA(ϑ1 − ϑ2)]2

(∆1)2

1Zhou X-H, Obuchowski NA, McClish DK. Statistical Methods inDiagnostic Medicine. New York, NY: Wiley; 2002.

Meridith Blevins, MS (Vandy Biostats) Sample Size 29 / 33

Page 30: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Diagnostic Tests, cont’d

With a paired-study design, the variance functions under the null andalternative hypotheses are given by:

V0(Se1 − Se2) = ψ VA(Se1 − Se2) = ψ −∆12

whereψ = Se1 + Se2 − 2× Se2 × P(T1 = 1|T2 = 1)

Se1 and Se2 are the presumed values of sensitivity from thealternative hypothesis and P(T1 = 1|T2 = 1) is the probability thatthe test 1 is positive given that test 2 is positive. The value of ψranges from ∆1 (perfect correlation of test results) toSe1 × (1− Se2) + (1− Se1)× Se2 (zero correlation).

Meridith Blevins, MS (Vandy Biostats) Sample Size 30 / 33

Page 31: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Cluster Randomized Trials

Our objective is to compare the population proportions forintervention and control groups of randomized clusters. Supposethere are n study subjects in each cluster, c .

c = 1+(zα/2+zβ)2[π0(1−π0)/n+π1(1−π1)/n+km2(π0

2+π12)]/(π0−π1)2

where π1 and π0 are the true proportion for intervention and controlgroups, and km is the coefficient of variation.Clusters (eg, communities) are matched on the basis of factors thatare expected to be correlated with the main study outcomes, with theaim of minimizing the degree of between-cluster variation withinmatched pairs. Then c , the number of clusters required, is given by:

c = 2+(zα/2+zβ)2[π0(1−π0)/n+π1(1−π1)/n+km2(π0

2+π12)]/(π0−π1)2

1Hayes RJ, Bennett S. Simple sample size calculation forcluster-randomized trials. Int J Epidemiol 1998;28:319-326.

Meridith Blevins, MS (Vandy Biostats) Sample Size 31 / 33

Page 32: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

Cohort and Case-control

Depends on the outcome, see reference.

1Schlesselman, JJ. Sample size requirements in cohort andcase-control studies of disease. Am J Epidemiol 1974;9:6:381-384.

Meridith Blevins, MS (Vandy Biostats) Sample Size 32 / 33

Page 33: Sample Size Planning, Calculation, and Justificationbiostat.mc.vanderbilt.edu/wiki/pub/Main/BlevinsMPH/SAMPLE_SIZE.pdfSample Size Planning, Calculation, and Justi cation Meridith Blevins,

The 10-20 Rule

A fitted regression model is likely to be reliable when the number ofpredictors is less than m/10 or m/20 where m is the ‘limiting samplesize’.

Table: Limiting Sample Sizes for Various Response Variables

Type of Response Variable Limiting Sample Size mContinuous n (total sample size)Binary min(n1,n2)

Ordinal (k categories) n − 1n2

∑ki=1 n3

i

Failure (survival) time number of failures

1Harrell, FE. Regression Modeling Strategies. New York, NY: Springer;2001.

Meridith Blevins, MS (Vandy Biostats) Sample Size 33 / 33