the statistical properties of large scale structure alexander szalay department of physics and...

29
The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Upload: magnus-rodgers

Post on 25-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

The Statistical Properties of Large Scale

Structure

Alexander SzalayDepartment of Physics and Astronomy

The Johns Hopkins University

Page 2: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Outline

• Lecture 1: – Basic cosmological background

– Growth of fluctuations

– Parameters and observables

• Lecture 2: – Statistical concepts and definitions

– Practical approaches

– Statistical estimators

• Lecture 3: – Applications to the SDSS

– Angular correlations with photometric redshifts

– Real-space power spectrum

Page 3: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Lecture #2

• Statistical concepts and definitions– Correlation function

– Power spectrum

– Smoothing kernels

– Window functions

• Statistical estimators

Page 4: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Basic Statistical Tools

• Correlation functions

– N-point and Nth-order

– Defined in real space

– Easy to compute, direct physical meaning

– Easy to generalize to higher order

• Power spectrum

– Fourier space equivalent of correlation functions

– Directly related to linear theory

– Origins in the Big Bang

– Connects the CMB physics to redshift surveys

• Most common are the 2nd order functions:

– Variance 82

– Two-point correlation function (r)

– Power spectrum P(k)

32

21 321 21

Page 5: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

The Galaxy Correlation Function

• First measured by Totsuji and Kihara, then Peebles etal

• Mostly angular correlations in the beginning

• Later more and more redshift space

• Power law is a good approximation

• Correlation length r0=5.4 h-1 Mpc

• Exponent is around =1.8

• Corresponding angular correlations

0

)(r

rr

1

0

)(rw

Page 6: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

The Overdensity

• We can observe galaxy counts n(x), and compare to the expected counts

• Overdensity

• Fourier transform

• Inverse

• Wave number

1)(

)( n

n xx

)(

2

1)( 3

3 kkx kx

ied

)()( 3 xxk kx ied

2

k

n

)(xn

Page 7: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

The Power Spectrum

• Consider the ensemble average

• Change the origin by R

• Translational invariance

• Power spectrum

• Rotational invariance

)()( 2*

1 kk

)()()(~

)(~

2*

1)(

2*

121 kkkk Rkk ie

2

121)3(3

2*

1 )()()2()()( kkkkk

)()( kPP k

2)()( kk P

Page 8: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Correlation Function

• Defined through the ensemble average

• Can be expressed through the Fourier transform

• Using the translational invariance in Fourier space

• The correlation function only depends on the distance

)()(),( 2121 xxxx

)()()2(

1),( 2

*1

)(2

31

3621

2211 kkkkxx xkxk

iedd

)()(),( 21)(3

2121 xxkxx xxk kPed i

)()()2(

1),( 121

)3()(2

31

3321

2211 kPedd i kkkkxx xkxk

Page 9: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

P(k) vs (r)

• The Rayleigh expansion of a plane wave gives

• Using the rotational invariance of P(k)

• The power per logarithmic interval

• The power spectrum and correlation function form a Fourier transform pair

)()(4

1)()( 0

2221 kPkrjkdkr

xx

l

llli Pkrjlie )ˆˆ()()12( rkkr

)()(ln4

1)( 3

02kPkkrjkdr

Page 10: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Filtering the Density

• Effect of a smoothing kernel K(x), where

• Convolution theorem

• Filtered power spectrum

• Filtered correlation function

1)(3 xx Kd

)'()'(')( 3 xxxxx KdKs

)()()( kkk Ks

222)()()()()( kkkkk KPKPs

2302

)()()(ln4

1)( kKkPkkrjkdrs

Page 11: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Variance

• At r=0 separation we get the variance:

• Usual kernel is a ‘top-hat’ with an R=8h-1 Mpc radius

• The usual normalization of the power spectrum is using this window

232

2 )()(ln4

1kKkPkkd RR

kr

krjkK

Rr

RrrK RR

)()(

if,0

if,1)( 1

2

83

2

28 )()(ln

4

1kKkPkkd

Page 12: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Selection Window

• We always have an anisotropic selection window, both over the sky and along the redshift direction

• The observed overdensity is

• Using the convolution theorem

outside

insideW

if,0

if,0)(r

)()()( xxx Ww

)'()'(')( 3 kkkkk Wdw

Page 13: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

The Effect of Window Shape

• The lines in the spectrum are at least as broad as the window – the PSF of measuring the power spectrum!

• The shape of the window in Fourier space is the conjugate to the shape in real space

• The larger the survey volume, the sharper the k-space window => survey design

)()()()()()( ''''*'''''*'''''3''3'* kkkkkkkkkk WWdd wwww

)()()()2()()( '''*''''''33'* kkkkkkkk WWPdww

Page 14: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

1-Point Probability Distribution

• Overdensity is a superposition from Fourier space

• Each depends on a large number of modes

• Variance (usually filtered at some scale R)

• Central limit theorem: Gaussian distribution

)(

2

1)( 3

3 kkx kx

ied

2

2

2)(

2

2

e

P

22

Page 15: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

N-point Probability Distribution

• Many ‘soft’ pixels, smoothed with a kernel

• Raw dataset x

• Parameter vector

• Joint probability distribution is a multivariate Gaussian

• M is the mean, C is the correlation matrix

• C depends on the parameters

)()(

2

1exp)2(),( 12/12/ mxCmxCΘx Tnf

xm jijiij mmxxC

Page 16: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Fisher Information Matrix

• Measures the sensitivity of the probability distribution to the parameters

• Kramer-Rao theorem:one cannot measure a parameter more accurately than

jiij

f

ln2

F

iiF

1

Page 17: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Higher Order Correlations

• One can define higher order correlation functions

• Irreducible correlations represented by connected graphs

• For Gaussian fields only 2-point, all other =0

• Peebles conjecture: only tree graphs are present

• Hierarchical expansion

• Important on small scales

321

4321

terms) 1-N(... nijkijNN Q

Page 18: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Correlation Estimators

• Expectation value

• Rewritten with the density as

• Often also written as

• Probability of finding objects in excess of random

• The two estimators above are NOT EQUIVALENT

)()(),( 2121 xxxx

21

221112

1121

2112

RR

DD

Page 19: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Edge Effects

• The objects close to the edge are different

• The estimator has an excessvariance (Ripley)

• If one is using the firstestimator, these cancelin first order (Landy and Szalay 1996)

21

2211

21

221112

))((

RR

RDRD

RR

RRDRDD

212

Page 20: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Discrete Counts

• We can measure discrete galaxy counts

• The expected density is a known <n>, fractional

• The overdensity is

• If we define cells with counts Ni

)()( rrr Dn

n

nn )(r

i

iii N

NN )(r

Page 21: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Power Spectrum

• Naïve estimator for a discrete density field is

• FKP (Feldman, Kaiser and Peacock) estimatorThe Fourier space equivalent to LS

n

i neN

f krk1

)(ˆ

Ne

Ne

NfkP

nn

i

nn

i nnnn111

)(ˆ)(ˆ'

)(2

',

)(2

''

rrkrrkk

)()()(ˆ krk kr wefn

in

n

)()(1

)()(

kPrn

rn

r

Page 22: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Wish List

• Almost lossless for the parameters of interest

• Easy to compute, and test hypotheses(uncorrelated errors)

• Be computationally feasible

• Be able to include systematic effects

Page 23: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

The Karhunen-Loeve Transform

• Subdivide survey volume into thousands of cells

• Compute correlation matrix of galaxy counts among cells from fiducial P(k)+ noise model

• Diagonalize matrix– Eigenvalues

– Eigenmodes

• Expand data over KL basis

• Iterate over parameter values:– Compute new correlation matrix

– Invert, then compute log likelihood

Vogeley and Szalay (1996)

Page 24: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Eigenmodes

Page 25: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Eigenmodes

• Optimal weighting of cells to extract signal represented by the mode

• Eigenvalues measure S/N

• Eigenmodes orthogonal

• In k-space their shape is close to window function

• Orthogonality = repulsion

• Dense packing of k-space => filling a ‘Fermi sphere’

Page 26: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Truncated expansion

• Use less than all the modes: truncation

• Best representation in the rms sense

• Optimal subspace filtering, throw away modes which contain only noise

NMbfM

iii

1

N

Miiff

1

2)ˆ(

Page 27: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Correlation Matrix

• The mean correlation between cells

• Uses a fiducial power spectrum

• Iterate during the analysis

)()(),( 2121)(

23

13 rWrWrrrdrd ji

sij

Page 28: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Whitening Transform

• Remove expected count ni

• ‘Whitened’ counts

• Can be extended to other types of noise=> systematic effects

• Diagonalization: overdensity eigenmodes

• Truncation optimizes the overdensity

i

iii n

ngd

ji

ij

i

ijijij nnn

R

Page 29: The Statistical Properties of Large Scale Structure Alexander Szalay Department of Physics and Astronomy The Johns Hopkins University

Truncation

• Truncate at 30 Mpc/h – avoid most non-linear effects

– keep decent number of modes