stat 310 - probability and statisticsdobelman/courses/310.overview.0.pdf1. probability and...

56
ECON 307/STAT 310 Probability and Statistics or Introduction to Mathematical Statistics Fall 2017 George R. Brown School of Engineering - STATISTICS

Upload: others

Post on 20-Aug-2020

11 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

ECON 307/STAT 310Probability and Statistics

or

Introduction to Mathematical Statistics

Fall 2017

George R. Brown School of Engineering - STATISTICS

Page 2: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Everyone’s Favorite Subject

2

Page 4: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Elementary Statistics

4

Page 5: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Logistics and Expectations

• Fast Paced– Not an abbreviated exercise– Concise as possible– Required base knowledge for most of stat

courses to come• What to expect

– Get and read the book(s)!– Lectures– Class participation– Homework– Exams

5

Presenter
Presentation Notes
Win to “kick you own ass”
Page 6: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Understanding• Dr. Dobelman’s website:

– How to find– http://dobelman.rice.edu

• Course Syllabus• Canvas• Pace yourself

6

Page 7: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Mathematical Statistics

7

( ) is continuousE( ) ( )

isdiscretei i

xf x dx xX XdF x

x p x

= = Σ

∫∫

Page 8: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Statistics• What is “Statistics”

– vs. what “are” statistics• [sample] average• [sample] proportion

• What is N?• Why talk about probability and statistics?

8

Page 9: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Statistics• Branch of mathematics (BOM) which studies

methods for the calculation of probabilities• BOM - collection and interpretation of

quantitative data and the use of probability theory to estimate population parameters

• OED– The branch of political science concerned with

the collection, classification, and discussion of (esp. numerical) facts bearing on the condition of a state or community.

9

Page 10: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

– The branch of science or mathematics concerned with the analysis and interpretation of numerical data and appropriate ways of gathering such data.

– Also, the systematic collection and arrangement of numerical facts or data of any kind

10

Page 11: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Don’t Like Statistics• Why don’t people like statistics?

– Some people “hate” statistics– Too hard to understand or learn– Doesn’t make sense/not intuitive– Too many formulas– Do not understand the applicability– Easy to misuse or “lie”

11

Page 12: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

ECON 307/STAT 310

• Population versus samples• Descriptive Statistics

– Summarizing, describing, data reduction• Probability (populations)

– How the population behaves – Probabilities of obtaining outcomes

• Statistics and Inference (samples)– Data used to make statements or decisions

about the universe from which the data are obtained

12

Page 13: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Population v. Sample

F(x)

µ

X

13

X

Presenter
Presentation Notes
Ps. 1 1] Blessed is the man that walketh not in the counsel of the ungodly, nor standeth in the way of sinners, nor sitteth in the seat of the scornful.[2] But his delight is in the law of the LORD; and in his law doth he meditate day and night.[3] And he shall be like a tree planted by the rivers of water, that bringeth forth his fruit in his season; his leaf also shall not wither; and whatsoever he doeth shall prosper.[4] The ungodly are not so: but are like the chaff which the wind driveth away.[5] Therefore the ungodly shall not stand in the judgment, nor sinners in the congregation of the righteous.[6] For the LORD knoweth the way of the righteous: but the way of the ungodly shall perish. Ro 4. [3] For what saith the scripture? Abraham believed God, and it was counted unto him for righteousness.
Page 14: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Descriptive• Stem and Leaf• Histogram• Univariate summary

statistics

14

114,023 EarthquakeMag.Mean 1.98Median 1.90Mode 1.90Standard Deviation 0.8803Variance 0.7749Kurtosis 0.4995Skewness 0.4353Standard Error 0.0026Range 7.3Minimum -0.6Maximum 6.797% Conf. Level 0.0057

2 Days, Japan, 2/10/01

Page 15: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

• Probability– Mathematical foundation/basis for results in

statistics• Statistics

– Branch of scientific method that deals with [numerical] properties of populations that occur in nature (or our imaginations), of natural phenomena

– Natural phenomena includes all the happenings of the external world, human or not

– Estimators of parameters15

. .sup ( ) ( ) 0a snF x F x− →

ˆ( ) ; ( ) ?u P X u f aα ααβ> = =

Page 16: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Laplacian Determinism

• Laplacian Dæmon – 19th century ideal– Perfect knowledge of the past and the

system → perfect prediction

• Poincare complication (per Mirowski, 1990)

– Imperfect knowledge of past (minor errors) → wildly discrepant future predictions

– 60 years before Mandelbrot 16

Presenter
Presentation Notes
Simeon Denis Poisson Ph.D. École Polytechnique 1800 France Advisor 1: Joseph Louis Lagrange; Advisor 2: Pierre-Simon Laplace Mirowski, Philip. From Mandelbrot to Chaos in Economic Theory, Southern Economic Journal, Vol. 57, No. 2 (Oct., 1990), pp. 289-307
Page 17: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Keep in mind• Understand variability• Violin story• Fundamental theorem of probability

• Fundamental theorem of statistics

17

. .sup ( ) ( ) 0

a s

nF x F x− →

( )d

n X

X

Xn zµσ−

→Φ

. .a s

nX µ→

Page 18: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Black Magic Stats Courses/Books• Table of Contents

– 1. Histograms, time series charts– 2. Organizing Data– 3. Averages and Variation– 4. Correlation and Regression– 5. Elementary Probability Theory– 6. The Binomial Probability Distribution and Related Topics– 7. Normal Curves and Sampling Distributions – 8.1 Estimating µ When σ is Known– 8.2 Estimating µ When σ is Unknown– 8.3 Estimating p in the Binomial Distribution– 9.2 Testing the Mean of µ– 9.3 Testing a Proportion p– 10. Inferences About Differences– 11.1 Chi-Square: Tests of Independence– 11.2 Chi-Square: Goodness of Fit– 11.3 Testing a Single Variance or Standard Deviation– Part II: Inferences Relating to Linear Regression– 11.4 Inferences for Correlation and Regression

• Supposed to remember - Basic notation and definitions

Page 19: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

How to Handle This?

Page 20: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

MathStat Contents1. Probability and Distributions2. Multivariate Distributions3. Some Special Distributions4. Some Elementary Statistical Inferences5. Consistency and Limiting Distributions6. Maximum Likelihood Methods7. Sufficiency8. Optimal Tests of Hypotheses9. Inferences about Normal Models10. Nonparametric and Robust Statistics11. Bayesian Statistics

20

Page 21: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Appendix A. Mathematical CommentsAppendix B. R-FunctionsAppendix C. Tables of DistributionsAppendix D. List of Common DistributionsAppendix E. ReferencesAppendix F. Answers to Selected ExercisesIndex

21

Page 22: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

1. Probability and Distributions.

22

( ) ( )( ), , , ,XP FωΩ →

P( ) ?X A∈ =

( ) ( )E( ( )) ( ) P( ) ( ) ( )ω ω ωΩ

= =∫ ∫

g X g X d g X dF x

Page 23: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

1. Probability and Distributions.

• Set theory• The probability set function• Conditional probability and

Independence• Random variables

– Discrete random variables– Discrete random variables -

Transformations

23

Page 24: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

1. Probability and Distributions.

– Continuous random variables– Continuous random variables -

Transformations• Expectation of a random variable• Some special expectations• Important Inequalities

24

Page 25: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

2. Multivariate Distributions.

25

( ) ( )( )( ) 2

2 ,, , , ,XY

X YP Fωω

Ω →

, ?X Yf =

( )( ) ( )( )2

E E( ) E( ) E( ) E( ) XYX X Y Y X X Y Y dFΣ = − − = − −∫∫

( , ) ( , )U V g X Y=

1

2

( , )( , )

u g x yv g x y==

, ?U Vf =

Page 26: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

2. Multivariate Distributions

• Distribution of two random variables• Distribution of two random variables -

Expectation• Transformations - Bivariate random

variables• Conditional distributions and

expectations• The Correlation Coefficient

26

Page 27: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

2. Multivariate Distributions

• Independent random variables• Extension to several random variables• Covariance matrices• Transformations for several random

variables• Linear combinations of random

variables

27

Page 28: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

3. Some Special Distributions

28

P( ) ( )X xX x f x p= = =

P( ) ( ) ( )x x

XX x F x dF f x dx−∞ −∞

≤ = = =∫ ∫

Page 29: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

3. Some Special Distributions

• Binomial and related distributions• The Poisson distribution• The gamma, chi squared and beta

distributions• The Normal distribution

29

Page 30: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

3. Some Special Distributions

• The multivariate normal distribution• The t-distribution• The F-distribution• Student's theorem• Mixture distributions

30

Page 31: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

4. Elementary Statistical Inferences

31

1ˆ( ) ( , , )nW X W X X X θ= ≡ =

( )P ( ) ( ) 1 95%L X U Xθ α≤ ≤ = − =

0

1

: 100: 100

HH

µµ=≠

1 , , nX X

iid ( )with ( | )i X XX F x f x θ

Page 32: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

4. Elementary Statistical Inferences

• Sampling and statistics• Histogram estimates of pdmf's• Confidence intervals• Confidence intervals for differences in

means• Confidence intervals for differences in

proportions• Confidence intervals for parameters of

discrete distributions

32

Page 33: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

4. Elementary Statistical Inferences

• Order statistics• Quantiles and confidence intervals for

quantiles• Introduction to hypothesis testing• Additional comments about statistical

tests• Chi-Squared tests• Monte Carlo

33

Page 34: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

4. Elementary Statistical Inferences

• Bootstrap procedures• Percentile bootstrap confidence intervals• Bootstrap testing procedures

34

Page 35: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

5. Consistency/Limiting Distributions

35

. .

p

a sn

pLn n n

cn

d

X X

X X X X X XX X

→→ ⇒ → ⇒ →→

( ) dn

X

n X µφ

σ

−→

Page 36: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

5. Consistency/Limiting Distributions

• Convergence in probability• Convergence in distribution• Convergence in distribution - Bounded in

probability• Convergence in distribution - Delta method• Convergence in distribution - MGF

technique• Central limit theorem• Multivariate extensions

36

Page 37: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

37

Page 38: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

6. Maximum Likelihood Methods

38

1ˆ ( , , )nW X Xθ =

( | ) ( | )XL x f xθ θ=

2E( ( ))Var( )( )n

gWIθθ

Page 39: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

6. Maximum Likelihood Methods

• Maximum Likelihood estimation• Cramer-Rao lower bound and efficiency• Maximum likelihood tests• Multi parameter case: Estimates• Multi parameter case: Testing• the EM algorithm

39

Page 40: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

7. Sufficiency

400 20 40 60 80 100

01

23

45

6

Sum of Xi = 213

Index

Fish

Cau

ght p

er H

our

?λ =

Page 41: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

7. Sufficiency• Measures of quality of estimators• A sufficient statistic for a parameter• Properties of a sufficient statistic• Completeness and uniqueness• The Exponential class of distributions

41

( ) ( ) ( ) ( )( | ) w T x H x CXf x e θ θθ ′ + +=

Page 42: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

7. Sufficiency• Functions of a parameter• The case of several parameters• Minimal sufficiency and auxiliary

statistics• Sufficiency, completeness and

independence

42

Page 43: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

8. Hypothesis Testing

• Hypothesis

• Test statistic T=T(X1,X2,…,Xn)• Critical region (to reject H0)

• Significance level (α)

• Power of the test (β)43

0 0 0 0

1 1 1 0

: :: :

H HH H

θ θ θ θθ θ θ θ= == ≠

0 0

1 0

::

H f fH f f

=≠

0 : Reject R x H=

Page 44: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

8. Optimal Tests of Hypotheses

44

0( ) P(reject )Hβ θ θ= ∀

1 0( ) ( ) vs.β θ β θ θ θ′ ′≥ ∀ ∈Θ ∈Θ

0

sup ( )( )

sup ( )

L xx

L xθ

θ

∈Θ

∈Θ

Λ =

0:Reject ifR x H cλ= ≤

Page 45: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

8. Optimal Tests of Hypotheses

45

Page 46: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

8. Optimal Tests of Hypotheses

• Most powerful tests• Uniformly most powerful tests• Likelihood ratio tests• Sequential probability ratio test• Minimax procedure• Classification procedure

46

Page 47: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

9. Inferences about Normal Models

47

( )( ) ( )11

21/2/2

1( )2

TX X

X pf x eµ µ

π

−−− Σ −

1TX X−Σ

1( ) ( )TX Xµ µ−− Σ −

Y Xβ ε= +

( ) 1ˆ T TX X X Yβ−

=

Page 48: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

9. Inferences about Normal Models

• Quadratic forms• One-way ANOVA• Noncentral chi square and F-

distributions• Multiple comparisons• The analysis of variance• A regression problem• A test of independence

48

Page 49: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

9. Inferences about Normal Models

• The distribution of certain quadratic forms

• The independence of certain quadratic forms

49

Page 50: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

10. Nonparametric and Robust Statistics

50

Page 51: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

10. Nonparametric and Robust Statistics

• Location models• Sample median and the sign test

– ARE– estimating equations base on the sign test– CI for median

• Signed-rank Wilcoxon– ARE– estimating equations based on Signed-

rank Wilcoxon– CI for median

51

Page 52: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

10. Nonparametric and Robust Statistics

• Mann-Whitney-Wilcoxon procedure– ARE– estimating equations based on Mann-

Whitney-Wilcoxon– CI for shift parameter ∆

• General rank scores– Efficacy– estimating equations based on general

scores– optimization: best estimates

52

Page 53: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

10. Nonparametric and Robust Statistics

• Adaptive procedures• Simple linear model• Measures of association

– Kendall's tau τ– Spearman's rho ρ

• Robust concepts– location model– linear model

53

Page 54: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

11. Bayesian Statistics

54

Page 55: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

11. Bayesian Statistics

• Subjective probability• Bayesian procedures:

– Prior and posterior distributions– Bayesian point estimates– Bayesian interval estimation– Bayesian testing procedures– Bayesian sequential procedures

• More Bayesian terminology and ideas• Gibbs sampler• Modern empirical Bayes

55

Page 56: STAT 310 - Probability and Statisticsdobelman/courses/310.Overview.0.pdf1. Probability and Distributions. • Set theory • The probability set function • Conditional probability

Appendices

• Mathematical comments– Regularity conditions– Sequences

• R-Functions• Tables of Distributions• List of Common Distributions• References• Answers to Selected Exercises

56