statistics and epidemiology robert f. waters, ph.d
DESCRIPTION
Statistics and Epidemiology Robert F. Waters, Ph.D. Statistics “status” (manner of standing) In medicine Biostatistics Biometrics Epidemiology Epi (upon) demos (people) Study of health and illness in human populations Pattern recognition. Reasons to use Biostatistics. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/1.jpg)
Statistics and EpidemiologyRobert F. Waters, Ph.D.
Statistics– “status” (manner of standing)
– In medicine• Biostatistics
• Biometrics
Epidemiology– Epi (upon) demos (people)
– Study of health and illness in human populations
– Pattern recognition
![Page 2: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/2.jpg)
Reasons to use Biostatistics
Evaluation of medical research Applying study results to patient care Understanding epidemiological problems Interpreting information about drugs Evaluating study protocols Participating and directing research projects
![Page 3: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/3.jpg)
Elementary Probability Theory
Probability of success (p)– p = Pr{E} = h/n
• h = # of ways
• n = total number of ways
Probability of failure (q)– q = Pr{not E} = n - h/n = 1 - h/n= 1-p
p + q = 1.00
![Page 4: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/4.jpg)
Probability Example:
– 1000 tosses of fair coin get 529 heads– Another 1000 tosses gives 493 heads– Keep repeating tosses should approach p = .5
Cards– Mutually exclusive events..add Pr
• What is probability of drawing an Ace?– 4/52
• What is probability of drawing a king?– 4/52
• How about an ace and king? (With Replacement)– 4/52 + 4/52
![Page 5: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/5.jpg)
Probability
How about dice?– Throwing two fair die
• Probability of 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12
• Let’s play “craps”
![Page 6: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/6.jpg)
Combinatorial Analysis
What is a factorial?– N! = n(n – 1)(n – 2) ….. 1
– What is 5! ?
Permutation– nPr = n(n – 1)(n – 2) …. (n – r + 1) or:
• n!/(n – r)!
– Problem: How many permutations of the letters a b c be taken 2 at a time?
• 3!/(3 – 2)! = 3 x 2 = 6
• List the permutations.
![Page 7: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/7.jpg)
Combinatorial Analysis
Combinations (Order does matter)– nCr = n!/r!(n – r)! Or nPr/r!– Number of combinations of a b c taken two at a
time.• 3C2 = 3!/2!(3 – 2)! = ?
• List the combinations.
![Page 8: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/8.jpg)
Data
Continuous variable– Temperature
Discrete variable– Number of children in a family
• Can’t have 2.3 children
Nominal data– Pretty, tall, etc.
Ordinal data– 0, 3, 5 (order from worst to best)
![Page 9: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/9.jpg)
What is a Population?
Infinite? Finite We have a sample! Sometimes we need to sample from a large
population. Therefore statistics is called..
– Statistical Inference– Inductive Statistics
• Trying to characterize infinite population
![Page 10: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/10.jpg)
Measures of Central Tendency Mean
– Arithmetic mean– Harmonic mean (RMS)– Geometric mean
Median Mode
When would the mean median and mode be the same?
What is a variate?
![Page 11: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/11.jpg)
Measures of Dispersion
Old story of two surgical students! Variance
–
Standard Deviation
![Page 12: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/12.jpg)
Properties of Standard Deviation
+ & - 1s from mean– 68.27%
+ & -2s from mean– 95.45%
+ & -3s from mean– 99.73%
Problem: Heights in Class
![Page 13: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/13.jpg)
Moments, Skewness, Kurtosis Major problem in biometrics Moments about the mean
– First moment• Arithmetic mean
– Second moment• Variance
– Skewness• Degree of asymmetry
– Kurtosis• Leptokurtic (narrow)• Platykurtic (flattened)• Mesokurtic (normal)
![Page 14: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/14.jpg)
Elementary Sampling Theory Many problems in biometrics
– Random samples– Without bias– Error evenly distributed– Level of significance (usually 0.05 in science)– Degrees of freedom
• Orthogonality (Comparisons)– Example:
• Ways to account for sources of variation• Patients with different doctors in different clinics• Patients with same doctors different clinics• Patients with same doctors same clinic
![Page 15: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/15.jpg)
Application to Epidemiology Binomial Distribution
– p(X) = nCxpxqn-x = [N!/X!(N-X)!] pxqn-x – Problem:
• What is the probability in a family of four children there will be at least 1 boy?
– 1 boy 4C1 (1/2)1 * (1/2)3
• = 4!/1!(4 – 1)! * ½ * 1/8 = ¼– 2 boys = 3/8– 3 boys = ¼– 4 boys = 1/16
• 4C1(1/2)4 * (1/2)0
• What is probability of at least one boy?– Pr(1boy) + Pr(2boys) + Pr(3boys) + Pr(4boys)– ¼ + 3/8 + ¼ + 1/16 = 15/16
![Page 16: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/16.jpg)
Application of Binomial
Out of 2000 families with four children, how many have at least one boy?– 1875
Out of 2000 families with four children, how many are expected to have two boys?– 750
![Page 17: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/17.jpg)
How can we tell if something is wrong? Chi-square
– Compares observed with expected
![Page 18: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/18.jpg)
Statistical Decision Theory
P value Statistical significance One-tailed vs. Two-tailed test Confidence intervals Standard Error
– Standard deviation
![Page 19: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/19.jpg)
The Correlation
Two independent variables Ice cream in Georgia story
![Page 20: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/20.jpg)
The Regression
Dependent with Independent Variable Least Squares analysis
![Page 21: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/21.jpg)
Multiple Linear Regression Analysis (MLRA)
One dependent and multiple independent variables
Predictive? Problems
– Variables normally distributed– Equal variances– True independence between independent
variables
![Page 22: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/22.jpg)
Hardy Weinberg Equilibrium
Alleles in populations tend towards H-W equilibrium
Answers the questions:– How can O be the most common of the blood types if it
is a recessive trait?
– If Huntington's disease is a dominant trait, shouldn't three-fourths of the population have Huntington's while one-fourth have the normal phenotype?
– Shouldn't recessive traits be gradually be swamped out so they disappear from the population?
![Page 23: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/23.jpg)
Hardy Weinberg Cont:
Hardy Weinberg equilibrium is achieved if:
– There is a large population– There is random mating– No selection for a particular allele– No mutations– No migration or isolation
![Page 24: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/24.jpg)
Trend Analysis
Autocorrelation Predictive with assumptions
![Page 25: Statistics and Epidemiology Robert F. Waters, Ph.D](https://reader036.vdocuments.us/reader036/viewer/2022062301/568148d0550346895db5eca8/html5/thumbnails/25.jpg)
Discussion Questions What should you expect in a paper (epidemiology) that
uses statistics? Why not just compare means of samples? Can we always assume statistical assumptions to be
correct? When should a correlation be used? How about a linear regression? MLRA? Binomial? Hardy Weinberg? Chi-Square?