Download - Introduction to Sample Size Estimation
![Page 1: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/1.jpg)
Introduction toSample Size Estimation
Jaranit KaewkungwalDepartment of Tropical Hygiene,
Faculty of Tropical Medicine, Mahidol University
![Page 2: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/2.jpg)
Topics• 6 Aug 2015 – Sample Size for:
– Parameter Estimation (Descriptive studies)– Hypothesis Testing (Analytic studies)
• Generic formula• Observational studies (X-section / Case Control / Cohort)• Experimental studies (Clinical Trials )
SW: N4Studies (mobile phone Apps)
• 7 Aug 2015 – Sample Size for:– Regressions (Linear / Logistic / Poisson/Cox)– Repeated Measures– Power analysis– Effect size EstimationSW: GPower3 (computer) & PS (computer)
![Page 3: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/3.jpg)
Population & Sample
![Page 4: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/4.jpg)
Sample vs. Population
![Page 5: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/5.jpg)
Sample vs. Population
![Page 6: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/6.jpg)
Sample Specification· Inclusion Criteria
Specifying the characteristics that definepopulations that are relevant to the researchquestion and efficient for the study:· Demographic characteristics· Clinical characteristics· Geographic (administrative) characteristics· Temporal characteristics· Exclusion Criteria
Specifying subset of the population that will not bestudied because of:· high likelihood of being lost to follow-up· unable to provide good/complete data· ethical barriers· subject’s refusal to participate
![Page 7: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/7.jpg)
Methods of Sampling• Probability Sampling -- methods that utilizes some form of random selection
1. Simple Random Sampling2. Stratified Random Sampling3. Systematic Random Sampling4. Cluster (Area) Sampling5. Multi-stage Sampling
• Non-probability Sampling - methods that based on either accidental or purposive; usually approach the sampling problem with a specific plan in mind.
1. Accidental Sampling2. Purposive Sampling
2.1 Expert Sampling2.2 Quota Sampling2.3 Heterogeneity Sampling2.4 Snowball Sampling
ChanceEPS: Equal Probability of
Selection PPS: Proportionate to Size
RelevancyRepresentativeness
Specific Characteristics
![Page 8: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/8.jpg)
Basic ofSample Size Estimation
![Page 9: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/9.jpg)
Important questions in SS estimation
What is the key outcome of interest (primary objective(s)) which is to be evaluated statistically?
How will the key outcome be measured?
What kind of study does one have?
Are there explicit or implicit dependencies in the data which need to be accounted for?
Cured/Not Cured, BP, Survival time of patient, No. of E. coli, …( Categorical, Continuous, or Time-to-event )
Rate, Percent, Prevalence, Incidence, Proportion, Mean, Median, etc.(Proportion / Percent/ Probability or Mean)
Descriptive (Parameter estimation), Analytic (Hypothesis testing)
Finite / Infinite Population, Fixed /Limited Sample, Ratio of groupsCompleteness, Non-responses, Follow-up rate, Screening etc.
![Page 10: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/10.jpg)
•Types of Observational Study• Descriptive(Parameter Estimation)
- Cross-sectional• Analytic
(Hypothesis Testing)- Cross-sectional- Case-Control- Cohort
• Types of Experimental Study• True Experimental
- Randomize Control Trial (RCT)• Quasi Experimental
D D
E a b m
E c d n
o p N
Coh
ort
Case-Control
RC
T
• Other Types of Medical Research Study• Diagnosis• Prognostic Factor Study
Types of Study Design
![Page 11: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/11.jpg)
11
Types of Statistics
• By Level of Generalization– Descriptive Statistics– Inferential Statistics
• Parameter Estimation• Hypothesis Testing
– Comparison– Association– Multivariable data analysis
• By Level of Underlying Distribution– Parametric Statistics– Non-parametric Statistics
Sampling Techniques
Generalization/Inferential Statistics
![Page 12: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/12.jpg)
Normal Distribution
12
![Page 13: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/13.jpg)
Elements in sample size estimation
• A priori information about parameters (key outcomes) of interest
• Precision (in parameter estimation) Effect size (in hypothesis testing)
• Confidence level (in parameter estimation) Tail of the test (in hypothesis testing)
• Type I error (α) (in parameter estimation) Type I (α) & Type II (β) errors (in hypothesis testing)
![Page 14: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/14.jpg)
• Source of a priori information about parameters of interest– Literature Review
– Pilot Study
– Expert Opinion
From previous report, it was shown that cure rate of Drug A = 70%
A pilot survey from 30 bottles of drinking water in the market shows that there are E. coli in 5 bottles.
3 out of 5 experts say that about 10% of workers in the XXX factory have health problem related to toxic chemicals.
Priori Information
![Page 15: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/15.jpg)
Priori information
• Example: a priori information about parameters of interest – previous survey (baseline)
![Page 16: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/16.jpg)
• Example: a priori information about parameters of interest – previous studies
Priori Information
Results from various experiments studying the effects of zinc supplements on diarrhea in children.
http://www.stat.columbia.edu/~gelman/stuff_for_blog/chap20.pdf
![Page 17: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/17.jpg)
• Example: a priori information about parameters (primary outcome) of interestDefinition of Pimary Outcome: PIDTenderness: abdominal direct, motion of cervix and uterus, andGC+ or fever > 38°C or leucocytosis >10,000 WBC/ml or purulent material from peritoneal cavity on culdocentesis or pelvixabscess or inflammatory complex on bimanual exam
Estimating the Incidence of PID for Sample Size Calculations• Government officials estimated 40%• Ob/GYN from Med School estimated 12%• Pilot study found 4% We conservatively set initially at 6%
Priori Information
![Page 18: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/18.jpg)
Precision (for descriptive survey study)
• What is “precision”?– Magnitude around the estimated statistics
regarding the true population parameter – Not Statistical Significance
Cohen (1988) defines the statistical precision of a samplestatistic as "the closeness with which it can be expected toapproximate the relevant population value. It is necessarily anestimated value in practice, since the population value isgenerally unknown" (Cohen, 1988,). This precision is usuallyestimated using a standard error, that is, the amount of chancefluctuation (or lack of precision) we can expect in sampleestimates. We can use the standard error as an estimate of theprecision of a statistic in two ways: descriptively or inferentially.
Source: Sample size and statistical precision. By James Dean Brown (University of Hawai'i at Manoa)
![Page 19: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/19.jpg)
• What is “effect size”?– Clinical/ Public Health Importance – Not Statistical Significance
Effect Size (for analysis study)
Evie McCrum-Gardner. International Journal of Therapy and Rehabilitation, January 2010, Vol 17, No 1
![Page 20: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/20.jpg)
Effect Size (for analysis study)
Evie McCrum-Gardner. International Journal of Therapy and Rehabilitation, January 2010, Vol 17, No 1
![Page 21: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/21.jpg)
• Example of “precision” and “effect size”– Clinical/ Public Health Importance – Not Statistical Significance
Current cure rate = 70%New drug should be 10% better => 80%
Previous survey found infected rate = 15%New survey expected to find infected rate not different from previous survey at + 3% => 12-18%
Effect Size (for analysis study)
![Page 22: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/22.jpg)
22
Bacterial Vaginosis Study
• Example: Relationship Between Priori Info and Effect Size
Effect Size (for analysis study)
![Page 23: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/23.jpg)
23
Bacterial Vaginosis Study
• Example: Relationship Between Priori Info and Effect Size
Effect Size (for analysis study)
![Page 24: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/24.jpg)
• Example: Relationship Between Priori Info and Effect Size• Sample size is function of the
– α type I error allowed– β type II error allowed– actual predicted risk– expected reduction of risk
• The estimated sample size of each arm of a clinical trial, if the tolerated α type I error is 0.05 and β type II error is 0.1?
Predicted Risk
1% 2% 3% 4% 10%
10% risk reduction
197,750 97,924 64,649 48,011 18,064
50% risk reduction
6,253 3,100 2,049 1,524 578
10%- 1%9%
10%- 5%5%
Effect Size (for analysis study)
![Page 25: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/25.jpg)
25
Hypothesis Testing
• Hypothesis & Tail of the test– One-sided vs. Two-sided Test
Two-sided test: Ho: Outcome 1 = Outcome 2 Ha: Outcome 1 ≠ Outcome 2 One-sided test: Ho: Outcome 1 ≤ Outcome 2 Ha: Outcome 1 > Outcome 2 Ho: Outcome1 ≥ Outcome 2 Ha: Outcome 1 < Outcome 2
O1<O2 | O1=O2 | O1>O2 2.5% 95% 2.5%
O1<O2 | O1 >= O2 5% 95%
![Page 26: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/26.jpg)
26
Hypothesis Testing
Not Reject Ho !!µ1 = µ2
Ho: µ1 = µ2Ho: µ1 − µ2 = 0 Ha: µ1 − µ2 = 0
µ1 µ2
![Page 27: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/27.jpg)
27
µ1 µ2
Ho: µ1 − µ2 = 0 Ha: µ1 − µ2 = 0 Reject Ho !!
µ1 < µ2
Hypothesis Testing
![Page 28: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/28.jpg)
28
at α = 0.05Reject H0 !!µ1 > µ2
H0: µ1 − µ2 = 0 Ha: µ1 − µ2 = 0
α / 2 = 0.005
- 2.576
α / 2 = 0.005
2.576
at α = 0.01Not Reject H0 !!µ1 = µ2
given n = very large
p-value = 0.04
Hypothesis Testing
![Page 29: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/29.jpg)
Accept Ho
Reject Ho
Reality/TruthHo True (G1=G2) Ho False (G1<>G2)
Decision
Correct
Correct Type I Error
Type II Error
Power : 1-β
Confidence : 1-α.99, .95
.01, .05
.10, .20
.90, .80
Ho: G1 = G2
Type I & Type II ErrorsConfidence & Power
α
β
![Page 30: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/30.jpg)
Type I & Type II ErrorsConfidence & Power
![Page 31: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/31.jpg)
The Decision Matrix on TrialThe OJ Simpson Trial Analogy
Ho: OJ = Other
Ha: OJ = Other
Type I & Type II ErrorsConfidence & Power
![Page 32: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/32.jpg)
Ho: OJ = Other
Ha: OJ = Other
Ho: OJ = Other Ha: OJ = Other
Type I & Type II ErrorsConfidence & Power
![Page 33: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/33.jpg)
Summary – Generic Concept of SS Estimation
• Parameter Estimation • Hypothesis Testing
µ1 µ2π1 π2
Ho x1 = x2Ho p1 = p2
Population
Sample 1 Sample 2µ X
π p
Para
met
er Statistics
Population Sample
• 2 Types of Study = 2 Types of Formula
![Page 34: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/34.jpg)
nZx //2σα±µ =
nppZp /)1(2/ −± απ =Categorical outcome:
Continuous outcome:
µ X
π p
Para
met
er StatisticsPopulation Sample
Sample size for Parameter Estimation
• 2 Types of Outcome = 2 Types of Formula
Summary – Generic Concept of SS Estimation
![Page 35: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/35.jpg)
• 2 Types of Outcome = 2 Types of Formula
Summary – Generic Concept of SS Estimation
Sample size for Hypothesis Testing(Differences between independent groups)
Ho: π1 = π2
Ho: µ1 = µ2
•Categorical outcome:
•Continuous outcome:
µ1 µ2π1 π2
Ho x1 = x2Ho p1 = p2
Population
Sample 1 Sample 2
![Page 36: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/36.jpg)
Summary – Generic Concept of SS Estimation
π1 / µ1
π2 / µ2
π1 / µ1
π2 / µ2
π1 / µ1
π2 / µ2
π1 / µ1
π2 / µ2
![Page 37: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/37.jpg)
• A priori information about parameters (key outcomes) of interest
• Precision / Effect size
• Confidence level (in parameter estimation) = 95%Tail of the test (in hypothesis testing) = two-tailed
• Type I error (α) (in parameter estimation) = 5%Type I (α) & Type II (β) errors (in hypothesis testing)= 5% & 20%
• Prepare 2 elements for sample size estimation
Summary – Generic Concept of SS Estimation
![Page 38: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/38.jpg)
Case Scenarios
![Page 39: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/39.jpg)
Describe all elements in SS calc
![Page 40: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/40.jpg)
SS Calc afterwards
![Page 41: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/41.jpg)
Large SS without sample size calc
![Page 42: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/42.jpg)
Good explanation of priori info but …
![Page 43: Introduction to Sample Size Estimation](https://reader031.vdocuments.us/reader031/viewer/2022020700/61f372f63d40bf496e710526/html5/thumbnails/43.jpg)
43
The End Sample Size Estimation - Intro