exchangeability and de finetti's theoremsteffen/teaching/grad/definetti.pdf · theorems of...

22
Outline Exchangeable random variables Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References Exchangeability and de Finetti’s Theorem Steffen Lauritzen University of Oxford April 26, 2007 Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Upload: others

Post on 20-Oct-2019

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Exchangeability and de Finetti’s Theorem

Steffen LauritzenUniversity of Oxford

April 26, 2007

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 2: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Overview of Lectures

1. Exchangeability and de Finetti’s Theorem

2. Sufficiency, Partial Exchangeability, and Exponential Families

3. Exchangeable Arrays and Random Networks

Basic references for the series of lectures include Aldous (1985)and Lauritzen (1988). Other references will be given as we goalong. For this particular lecture, the article by Kingman (1978) isstrongly recommended.

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 3: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Exchangeable random variables

Theorems of deFinetti, Hewitt and Savage

Statistical implications

Finite exhangeability

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 4: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Definition

An infinite sequence X1, . . . ,Xn, . . . of random variables is said tobe exchangeable if for all n = 2, 3, . . . ,

X1, . . . ,XnD= Xπ(1), . . . ,Xπ(n) for all π ∈ S(n),

where S(n) is the group of permutations of {1, . . . , n}.For example, for a binary sequence, we may have:

p(1, 1, 0, 0, 0, 1, 1, 0) = p(1, 0, 1, 0, 1, 0, 0, 1).

If X1, . . . ,Xn, . . . are independent and identically distributed, theyare exchangeable, but not conversely.

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 5: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Polya’s Urn

Consider an urn with b black balls and w white balls. Draw a ballat random and note its colour. Replace the ball together with aballs of the same colour. Repeat the procedure ad infinitum.Let Xi = 1 if the i-th draw yields a black ball and Xi = 0otherwise.

The sequence X1, . . . ,Xn, . . . is exchangeable:

p(1, 1, 0, 1) =b

b + w

b + a

b + w + a

w

b + w + 2a

b + 2a

b + w + 3a

=b

b + w

w

b + w + a

b + a

b + w + 2a

b + 2a

b + w + 3a= p(1, 0, 1, 1).

X1, . . . ,Xn, . . . are not independent (nor even a Markov process).

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 6: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

A Gaussian example

Consider a sequence of Gaussian random variables with X1, . . . ,Xn

multivariate Gaussian Nn(0,Σ) with mean zero and covariancematrix determined as

E(X 2i ) = 1, E(XiXj) = ρ for i 6= j .

For the matrix to be positive definite so the distribution is welldefined, we must have

ρ ≥ − 1

n − 1

and since this should hold for all n, we further conclude ρ ≥ 0.Since a Gaussian distribution is determined by its mean andcovariance, this defines an exchangeable sequence.

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 7: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

de Finetti’s Theorem

de Finetti (1931) shows that all exchangeable binary sequences aremixtures of Bernoulli sequences:A binary sequence X1, . . . ,Xn, . . . is exchangeable if and only ifthere exists a distribution function F on [0, 1] such that for all n

p(x1, . . . , xn) =

∫ 1

0θtn(1− θ)n−tn dF (θ),

where p(x1, . . . , xn) = P(X1 = x1, . . . ,Xn = xn) and tn =∑n

i=1 xi .

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 8: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

More about de Finetti’s Theorem

It further holds that F is the distribution function of the limitingfrequency:

Y = X∞ = limn→∞

∑i

Xi/n, P(Y ≤ y) = F (y)

and the Bernoulli distribution is obtained by conditioning withY = θ:

P(X1 = x1, . . . ,Xn = xn |Y = θ) = θtn(1− θ)n−tn .

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 9: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Polya’s Urn

Urn has initially b black balls and w white balls. Draw a ball atrandom and note its colour. Replace the ball together with a ballsof the same colour. Repeat the procedure ad infinitum.Let Xi = 1 if the i-th draw yields a black ball and Xi = 0 otherwise.The average Xn converges to a random value Y which has a Betadistribution

limn→∞

Xn = Y ∼ B(b/a,w/a).

Conditionally on Y = θ, X1 . . . , Xn, . . . are independent andBernoulli with parameter θ.This has been known as folklore for a long time. First publishedproof seems to be in Blackwell and Kendall (1964), see alsoFreedman (1965).

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 10: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Hewitt–Savage

Hewitt and Savage (1955) have generalized de Finetti’s result:Let X1, . . . ,Xn, . . . be an exchangeable sequence of randomvariables with values in X . Then there exists a probability measureµ on the set of probability measures P(X ) on X , such that

P(X1 ∈ A1, . . . ,Xn ∈ An) =

∫Q(A1) · · ·Q(An) µ(dQ),

It further holds that µ is the distribution function of the empiricalmeasure:

M(A) = limn→∞

Mn(A) = limn→∞

1

n

n∑i=1

χA(Xi ), M ∼ µ.

and Q⊗∞ is the distribution obtained by conditioning with M:

P(X1 ∈ A1, . . . ,Xn ∈ An |M = Q) = Q(A1) · · ·Q(An).Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 11: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Gaussian example

X1, . . . ,Xn multivariate Gaussian Nn(0,Σ) with mean zero andcovariance

E(X 2i ) = 1, E(XiXj) = ρ for i 6= j , ρ > 0,

the empirical distribution function converges to a Gaussiandistribution

F∞(x) = limn→∞

1

n

n∑i=1

χ]−∞,x](Xi ) = Φ{(x − Y )/√

1− ρ},

where Φ is the Gaussian distribution function and

Y = limn→∞

Xn ∼ N (0, ρ).

Conditionally on Y = θ, X1 . . . , Xn, . . . are independent anddistributed as N (θ, 1− ρ).

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 12: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Simplest Bayesian model:Subjective exchangeable probability distribution P representingYour expectations for behaviour of X1, . . . ,Xn, . . ..

Simplest frequentist model:There is an unknown distribution Q, so that X1, . . . ,Xn, . . . areindependent and identically distributed with distribution Q, Q(A)being defined as the limiting proportion of X ’s in A.

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 13: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

de Finetti–Hewitt–Savage Theorem provides bridge between thetwo model types:In P, the distribution Q exists as a random object, also determinedby the limiting frequency. The distribution, µ, of Q is the Bayesianprior distribution:

P(X1 ∈ A1, . . . ,Xn ∈ An) =

∫Q(A1) · · ·Q(An) µ(dQ),

The empirical measure Mn (Xn in the binary case) is sufficient forthe unknown ‘parameter’ Q.

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 14: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Summary of de Finetti–Hewitt-Savage

For an exchangeable sequence X1, . . . ,Xn, . . . with jointdistribution P we have for most, (but not all) measure spaces X :

I An integral representation: P =∫

Q⊗∞ µ(dQ);

I The empirical distribution Mn converges to a randomdistribution M;

I µ is uniquely determined by P as the distribution of thelimiting empirical measure M, so we can write P = Pµ.

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 15: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Summary of de Finetti–Hewitt-Savage

For an exchangeable sequence X1, . . . ,Xn, . . . with jointdistribution P we have for most, (but not all) measure spaces X :

I An integral representation: P =∫

Q⊗∞ µ(dQ);

I The empirical distribution Mn converges to a randomdistribution M;

I µ is uniquely determined by P as the distribution of thelimiting empirical measure M, so we can write P = Pµ.

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 16: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Summary of de Finetti–Hewitt-Savage

For an exchangeable sequence X1, . . . ,Xn, . . . with jointdistribution P we have for most, (but not all) measure spaces X :

I An integral representation: P =∫

Q⊗∞ µ(dQ);

I The empirical distribution Mn converges to a randomdistribution M;

I µ is uniquely determined by P as the distribution of thelimiting empirical measure M, so we can write P = Pµ.

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 17: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

X1, . . . ,Xn is n-exchangeable if

X1, . . . ,XnD= Xπ(1), . . . ,Xπ(n) for all π ∈ S(n).

Every n-subset of an exchangeable sequence is n-exchangeable forevery n, but not conversely.Two binary variables X1,X2 are 2-exchangeable (pairwiseexchangeable) if and only if

p(0, 1) = p(1, 0).

A k-exchangeable X1, . . . ,Xk is n-extendable if it has the samedistribution as the k first of an n-exchangeable variables X1, . . . ,Xn

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 18: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Finite versions of de Finetti’s Theorem

Diaconis and Freedman (1980) show for X1, . . . ,Xk k-exchangeableand n-extendable, where Pk denotes distribution of X1, . . . ,Xk .The distribution Pk can be approximated with the k-marginal Pµk

of an exchangeable distribution Pµ, with bound satisfying

||Pk − Pµk || ≤2ck

n,

if |X | = c < ∞, and in the general case

||Pk − Pµk || ≤k(k − 1)

n.

Here ||P −Q|| = 2 supA |P(A)−Q(A)| is the total variation norm .

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 19: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Proof of classical theorem

Most proofs of the de Finetti–Hewitt–Savage Theorem are basedon martingale arguments, considering quantities such as

Znk = E{φ1(X1)φ2(X2) · · ·φk(Xk) |Xn,Xn+1, . . .}

which is a reverse martingale w.r.t. the decreasing family ofσ-algebras Sn = σ{Xn,Xn+1, . . .}.See for example (Kingman, 1978) for a simple and illuminatingproof.

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 20: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Proof of finite theorem

The proof of the finite version is essentially based on acombinatorial exercise, establishing sharp bounds for theapproximations of sampling with and without replacement.The finite version yields the integral representation of anexchangeable probability distribution as a corollary.

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 21: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Aldous, D. (1985). Exchangeability and related topics. InHennequin, P., editor, Ecole d’Ete de Probabilites de Saint–FlourXIII — 1983, pages 1–198. Springer-Verlag, Heidelberg. LectureNotes in Mathematics 1117.

Blackwell, D. and Kendall, D. (1964). The Martin boundary forPolya’s urn scheme, and an application to stochastic populationgrowth. Journal of Applied Probability, 1:284–296.

de Finetti, B. (1931). Funzione caratteristica di un fenomenoaleatorio. Atti della R. Academia Nazionale dei Lincei, Serie 6.Memorie, Classe di Scienze Fisiche, Mathematice e Naturale,4:251–299.

Diaconis, P. and Freedman, D. (1980). Finite exchangeablesequences. Annals of Probability, 8:745–764.

Freedman, D. A. (1965). Bernard Friedman’s urn. Annals ofMathematical Statistics, 36:956–970.

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem

Page 22: Exchangeability and de Finetti's Theoremsteffen/teaching/grad/definetti.pdf · Theorems of deFinetti, Hewitt and Savage Statistical implications Finite exhangeability References de

OutlineExchangeable random variables

Theorems of deFinetti, Hewitt and SavageStatistical implications

Finite exhangeabilityReferences

Hewitt, E. and Savage, L. J. (1955). Symmetric measures onCartesian products. Transactions of the American MathematicalSociety, 80:470–501.

Kingman, J. F. C. (1978). Uses of exchangeability. Annals ofProbability, 6:183–197.

Lauritzen, S. L. (1988). Extremal Families and Systems ofSufficient Statistics. Springer-Verlag, Heidelberg. Lecture Notesin Statistics 49.

Steffen LauritzenUniversity of Oxford Exchangeability and de Finetti’s Theorem