2-7 feb 2003samsi multiscale workshop1 nonstationary spatial covariance modeling via spatial...
Post on 21-Dec-2015
216 views
TRANSCRIPT
2-7 Feb 2003 SAMSI Multiscale Workshop 1
Nonstationary spatial covariance modeling via spatial deformation
& extensions to covariates and models for dynamic atmospheric processes
Paul D. Sampson
(w/ Doris Damian, Peter Guttorp, Tilmann Gneiting)
University of Washington and
National Research Center for Statistics and the Environment
2-7 Feb 2003 SAMSI Multiscale Workshop 2
A common framework for spatio-temporal modeling:
Response(x,t) = Trend(x,t) + Residual(x,t)
Some (obvious) observations on issues of scale: the practical definition and estimation of Trend and Residual depend on:
Understanding of the spatio-temporal processes of interest
Scientific questions to be addressed by analysis and modeling
Details of available spatio-temporal sampling data—inference at ``small’’ spatial and/or temporal scales generally requires sampling and modeling at corresponding scales.
2-7 Feb 2003 SAMSI Multiscale Workshop 3
My experience:
1. Hourly ozone monitoring data at 100 sites in the San Joaquin Valley, CA, for assessment of photochemical model predictions:
Accounting for sub-grid scale spatial variability in point monitoring observations to assess grid average photochemical predictions
Decomposing spatial difference fields—empirical grid cell estimates vs. model predictions—into components at different spatial scales
Comparing empirical spatial covariance structure with model spatial covariance structure at different spatial scales.
2. 10-day aggregate precipitation in Languedoc-Roussillon, France
3. Daily ozone monitoring summaries (max 8-hr ave) from 100’s of monitoring sites across the country for spatial prediction at target locations.
2-7 Feb 2003 SAMSI Multiscale Workshop 4
Languedoc-Roussillon Precipitation Sites
2-7 Feb 2003 SAMSI Multiscale Workshop 5
SEASONAL (Apr-Oct) ozone sampling sites with 75% complete data each year, 1992-94, N = 257
(target counties highlighted)
2-7 Feb 2003 SAMSI Multiscale Workshop 6
ANNUAL ozone sampling sites with 75% complete data each year, 1992-94, N = 407
(target counties highlighted)
2-7 Feb 2003 SAMSI Multiscale Workshop 7
Objectives for approaches to nonstationary spatial covariance modeling.
Characterizing spatially varying, locally (stationary) anisotropic structure.
Scientific understanding/representation of covariance structure—not just a method of providing covariances for kriging.
Capable of:
reflecting effects of known explanatory environmental processes such as transport/wind, topography, point sources
modeling effects of known explanatory environmental processes
2-7 Feb 2003 SAMSI Multiscale Workshop 8
Objectives (cont.)
Application to purely spatial problems and/or problems with data sampled irregularly in space and time
Application in context of dynamic models for space-time structure
Application to “large” problems/data sets
Diagnostics for local and large-scale correlation structure:
o is the spatial structure “right”
o is the nature/degree of nonstationarity (smoothness) right?
Evaluation of uncertainty in estimation (interpolation) of spatial covariance structure
Incorporation in an approach to spatial estimation accounting for uncertainty in estimation of (parameters of) spatial covariance structure
2-7 Feb 2003 SAMSI Multiscale Workshop 9
Selected Methods and References:1. Basis function methods (Nychka, Wikle, …)
2. Kernel/smoothing methods (Fuentes, …)
3. Process convolution models (Higdon, …)
4. Parametric models
5. Spatial deformation models
• Sampson & Guttorp, 1989, 1992, 1994 • Meiring, Monestiez, Sampson & Guttorp, 1997. “Developments...
Geostatistics Wollongong '96.
• Mardia & Goodall, 1993. in Multivariate Environmental Statistics.
• Smith, 1996. Estimating nonstationary spatial correlations.
UNC preprint.
2-7 Feb 2003 SAMSI Multiscale Workshop 10
4. Spatial deformation models (cont.)
• Perrin & Meiring, 1999. Identifiability J Appl Prob. • Perrin & Senoussi, 1998. Reducing nonstationary random fields to
stationarity or isotropy, Stat & Prob Letters. • Perrin & Monestiez, 1998. Parametric radial basis deformations.
GeoENV-II. • Iovleff & Perrin, 2000. Simulated annealing. (JSM 2000)
• Schmidt & O'Hagan, 2000. Bayesian inference. (JSM 2000)
• Damian, Sampson & Guttorp, 2000. Bayesian estimation of semiparametric nonstationary covariance structures. Environmetrics.
• Damian, Sampson & Guttorp, 2003. Variance modeling for nonstationary spatial processes with temporal replications. JGR (submitted)
• Related recent developments in the atmospheric science literature:
2-7 Feb 2003 SAMSI Multiscale Workshop 11
Spatial Deformation ModelDamian et al., 2000 (Environmetrics), 2003 (JGR)
1 2, , ,tZ x t x t x H x x t
( , ) spatio-temporal trendparametric in time; mv spatial process
x t
( ) temporal variance at ,log-normal spatial process
x x
2( , )
(0, ), ( )msmt error and short-scale variation
independent of t
x tN H x
( )( ( ), ( )) 1.
ndmean 0, var 1, 2 -order cont. spatial processCov
t
t t x y
H xH x H y
2-7 Feb 2003 SAMSI Multiscale Workshop 12
( )( ( ), ( )) 1.
ndmean 0, var 1, 2 -order cont. spatial processCov
t
t t x y
H xH x H y
( ), ( ) ( ) ( )
:
( )
smooth, bijective(Geographic Deformed plane)
isotropic correlation functionin a known parametric family(exponential, power exp, Matern)
Cor t t
f G D
H x H y f x
d
f y
i.e. The correlation structure of the spatial process is an (isotropic) function of Euclidean distances between site locations after a bijective transformation of the geographic coordinate system.
Model (cont.)
2-7 Feb 2003 SAMSI Multiscale Workshop 13
A deformation of the unit square: (b) true, (c) estimated
2-7 Feb 2003 SAMSI Multiscale Workshop 14
Biorthogonal grids for a deformation of the unit square,(a) true, (b) estimated
2-7 Feb 2003 SAMSI Multiscale Workshop 15
The spatial deformation f encodes the nonstationarity: spatially varying local anisotropy.
We model this in terms of observation sites as a pair of thin-plate splines:
Model (cont.)
1 2, , , Nx x x
Tf x c x x A W
c xA
T xW
1
N
x x
x
x x
2log 0
0 0
h h hh
h
Linear part: global/large scale anisotropy 2 1 2 2, c A
Non-linear part, decomposable into components of varying spatial scale: 2 1, ( ) N Nx W
2 2:{ , , }, , , , :{ , , }f c A W Model parameters:
2-7 Feb 2003 SAMSI Multiscale Workshop 16
Likelihood: given observation vectors Z1,…,ZN of length T:
with covariance matrix having elements
21
2 1 1
1, , |( , , , , ; , , )
( 1
, ,
)2 exp tr
| ,
2 2
N
T
Nf Z Z
T TZ
Z Z
Z
Z
Σ
Σ Σ
Σ
S Σ
S L
2
,1 ,i j i j
ij
j
i ji j N
i j
Integrating out a flat prior on the (constant) mean
1 2 1
1
| ,( 1)
exp2
| , d trT T
Z
S Σ SΣ ΣS Σ
2-7 Feb 2003 SAMSI Multiscale Workshop 17
Posterior:
2
112
2 2
2 2 2
1
( )
, , , , ,
1exp (log )
, : , , , , , , ,
, , ,
(l g
,
)2
,
o
,
Log-normal variance field
Full posterior is:
:
ix
c
c
f
c c
A W
Σ 1
A W S
A WS A W
Σ 1
Σ
1 1 2 2
, :
1exp ( ) , ( )
2
diffuse normal prior on 2 free params (4 constr)
"bending energy" prior on space orthogonal to linear
ij i j
c
I x x
W V V
A
W W SW W SW S
2-7 Feb 2003 SAMSI Multiscale Workshop 18
Computation
• Metropolis-Hastings algorithm for sampling from the highly multidimensional posterior.
• Given estimates of D-plane locations, f(xi), the transformation is extrapolated to the whole domain using thin-plate splines. (Visualization and diagnostics.)
• Predictive distributions for
(a) temporal variance at unobserved sites,
(b) the spatial covariance for pairs of observed and/or unobserved sites,
(c) the observation process at unobserved sites.
2-7 Feb 2003 SAMSI Multiscale Workshop 19
Application to Languedoc-Roussillon Precipitation Data
• 108 altitude-adjusted, 10-day aggregate precip records at 39 sites (Nov-Dec, 1975-1992).
• Data log-transformed and site-specific means removed (for this analysis).
• Estimated deformation is non-linear: correlation stronger in the NE region, weaker in the SW.
2-7 Feb 2003 SAMSI Multiscale Workshop 20
Languedoc-Roussillon Precipitation Sites
2-7 Feb 2003 SAMSI Multiscale Workshop 21
Estimated deformation of Languedoc-Roussillon region
(a)
9
19
22
25
33
41
4553
(b)
9
19
22
25
33
41
45
53
2-7 Feb 2003 SAMSI Multiscale Workshop 22
Correlation vs Distance in G-plane and D-plane
2-7 Feb 2003 SAMSI Multiscale Workshop 23
Equi-correlation (0.9) contours D-plane (a) and G-plane (b)
2-7 Feb 2003 SAMSI Multiscale Workshop 24
Estimated (•) and predicted (◦) variances, , vs. observed temporal variances, with one predictive std dev bars.
0( )x
2-7 Feb 2003 SAMSI Multiscale Workshop 25
Assessment of (10-day aggregate) precipitation predictions at validation sites
2-7 Feb 2003 SAMSI Multiscale Workshop 26
Assessment of (10-day aggregate) precipitation predictions at validation sites (pred interval and obs)
2-7 Feb 2003 SAMSI Multiscale Workshop 27
SEASONAL (Apr-Oct) ozone sampling sites with 75% complete data each year, 1992-94, N = 257
(target counties highlighted)
2-7 Feb 2003 SAMSI Multiscale Workshop 28
Spatio-temporal trend modeling:
• Standardize (center and scale) the data for each of N=750 sites over 3 years and assemble N x (3*364) data matrix (“tricks” to deal with missing data).
• Compute SVD and consider first 3-5 right singular vectors (EOFs); smooth these w.r.t. time.
• Use these temporal trend components as basis functions for estimation of site-specific trends by regression.
( , )x t
2-7 Feb 2003 SAMSI Multiscale Workshop 29
dates92to94
Tota
l.svd
$sv
d$
v[,
j]
-0.0
50
.05
01/01/1992 01/01/1993 01/01/1994
dates92to94
Tota
l.svd
$sv
d$
v[,
j]
-0.1
00
.00
.10
01/01/1992 01/01/1993 01/01/1994
dates92to94
Tota
l.svd
$sv
d$
v[,
j]
-0.1
00
.0
01/01/1992 01/01/1993 01/01/1994
dates92to94
Tota
l.svd
$sv
d$
v[,
j]
-0.1
00
.00
.10
01/01/1992 01/01/1993 01/01/1994
dates92to94
Tota
l.svd
$sv
d$
v[,
j]
-0.1
00
.00
.10
01/01/1992 01/01/1993 01/01/1994
dates92to94
Tota
l.svd
$sv
d$
v[,
j]
-0.1
00
.00
.10
01/01/1992 01/01/1993 01/01/1994
Annual Trend Components fitted to all sites
2-7 Feb 2003 SAMSI Multiscale Workshop 30
Region 6: Southern Calif (annual)
2-7 Feb 2003 SAMSI Multiscale Workshop 31
Region 6 : S Calif Starplot of temporal trend coefficients
60730001
60731001
60379002
60710012
6111000761113001
2-7 Feb 2003 SAMSI Multiscale Workshop 32
1992-1994
sqrt
(ma
x 8
hr
O3
)
0.0
0.1
0.2
0.3
0.4
01/01/1992 07/01/1993
60730001
1992-1994
sqrt
(ma
x 8
hr
O3
)
0.0
0.1
0.2
0.3
0.4
01/01/1992 07/01/1993
60731001
1992-1994sq
rt(m
ax
8h
r O
3)
0.0
0.1
0.2
0.3
0.4
01/01/1992 07/01/1993
60379002
1992-1994
sqrt
(ma
x 8
hr
O3
)
0.0
0.1
0.2
0.3
0.4
01/01/1992 07/01/1993
60710012
1992-1994
sqrt
(ma
x 8
hr
O3
)
0.0
0.1
0.2
0.3
0.4
01/01/1993 07/01/1994
61110007
1992-1994
sqrt
(ma
x 8
hr
O3
)
0.0
0.1
0.2
0.3
0.4
01/01/1992 07/01/1993
61113001
2-7 Feb 2003 SAMSI Multiscale Workshop 33
Region 6 : S Calif Circle radii proportional to (detrended) site variances
2-7 Feb 2003 SAMSI Multiscale Workshop 34
N= 55 , S Calif monitoring sites and their representation in a deformed coordinate system reflecting spatial covariance
Tue Feb 04 18:28:03 PST 2003
2-7 Feb 2003 SAMSI Multiscale Workshop 35
N=55, S Calif: 4 samples from the posterior distribution of deformations reflecting spatial covarianceTue Feb 04 18:30:29 PST 2003
2-7 Feb 2003 SAMSI Multiscale Workshop 36
Region 6 S Calif
Geographic Distance (km)
Co
vari
an
ce
0 100 200 300 400 500
0.0
0.0
00
50
.00
10
0.0
01
50
.00
20
Region 6 S Calif
D-plane Distance
Co
vari
an
ce
0 100 200 300 400
0.0
0.0
00
50
.00
10
0.0
01
50
.00
20
2-7 Feb 2003 SAMSI Multiscale Workshop 37
Region 6 S Calif
Geographic Distance (km)
Co
rre
latio
n
0 100 200 300 400 500
0.0
0.2
0.4
0.6
0.8
1.0
Region 6 S Calif
D-plane Distance
Co
rre
latio
n
0 100 200 300 400
0.0
0.2
0.4
0.6
0.8
1.0
2-7 Feb 2003 SAMSI Multiscale Workshop 38
Region 1: Northeast (seasonal)
2-7 Feb 2003 SAMSI Multiscale Workshop 39
Region 1 : Northeast Starplot of temporal trend coefficients
510410004516500004
90110008
250010002360530006
360430005
2-7 Feb 2003 SAMSI Multiscale Workshop 40
1992-1994
sqrt
(ma
x 8
hr
O3
)
0.0
0.1
0.2
0.3
0.4
01/01/1993 07/01/1994
510410004
1992-1994
sqrt
(ma
x 8
hr
O3
)
0.0
0.1
0.2
0.3
0.4
01/01/1993 07/01/1994
516500004
1992-1994
sqrt
(ma
x 8
hr
O3
)
0.0
0.1
0.2
0.3
0.4
01/01/1993 07/01/1994
90110008
1992-1994
sqrt
(ma
x 8
hr
O3
)
0.0
0.1
0.2
0.3
0.4
01/01/1993 07/01/1994
250010002
1992-1994
sqrt
(ma
x 8
hr
O3
)
0.0
0.1
0.2
0.3
0.4
01/01/1992 07/01/1993
360530006
1992-1994
sqrt
(ma
x 8
hr
O3
)
0.0
0.1
0.2
0.3
0.4
01/01/1993 07/01/1994
360430005
2-7 Feb 2003 SAMSI Multiscale Workshop 41
Region 1 : Northeast Circle radii proportional to (detrended) site variances
2-7 Feb 2003 SAMSI Multiscale Workshop 42
62 Northeast monitoring sites and their representation in a deformed coordinate system reflecting spatial covariance
Thu Jan 30 20:50:30 PST 2003
2-7 Feb 2003 SAMSI Multiscale Workshop 43
N=62, Northeast: 4 samples from the posterior distribution of deformations reflecting spatial covarianceTue Feb 04 18:34:34 PST 2003
2-7 Feb 2003 SAMSI Multiscale Workshop 44
Region 1 Northeast
Geographic Distance (km)
Co
vari
an
ce
0 200 400 600 800 1000 1200
0.0
0.0
00
50
.00
10
0.0
01
5
Region 1 Northeast
D-plane Distance
Co
vari
an
ce
0 200 400 600 800 1000 1200
0.0
0.0
00
50
.00
10
0.0
01
5
2-7 Feb 2003 SAMSI Multiscale Workshop 45
Region 1 Northeast
Geographic Distance (km)
Co
rre
latio
n
0 200 400 600 800 1000 1200
0.0
0.2
0.4
0.6
0.8
1.0
Region 1 Northeast
D-plane Distance
Co
rre
latio
n
0 200 400 600 800 1000 1200
0.0
0.2
0.4
0.6
0.8
1.0
2-7 Feb 2003 SAMSI Multiscale Workshop 46
Work in Progress
• Guidelines for MCMC estimation software, including better calibration of priors for spatial deformation and specification of MCMC proposal distributions.
• Incorporation of the spatio-temporal trend model in the software for spatial prediction.
• Incorporation of (simple) temporal autocorrelation structure.
• More flexible deformation modeling for nonstationary structure at multiple scales: “principal warp” decomposition of thin-plate splines and mappings R2→ Rk.
• Explicit modeling/incorporation of covariate effects, spatial (e.g. topography) and spatio-temporal (e.g. wind).
2-7 Feb 2003 SAMSI Multiscale Workshop 47
Some recent atmospheric science literature and proposals for spatio-temporal covariance models
• Desroziers, 1999, A coordinate change for data assimilation in spherical geometry of frontal structures. Monthly Weather Review.
The main impact of this transformation in the framework of data assimilation is that it enables the use of anisotropic forecast correlations that are flow dependent.
• Riishojgaard, 1998. A direct way of specifying flow-dependent background correlations for meteorological analysis systems. Tellus.
• Weaver and Courtier, 2001. Correlation modelling on the sphere using a generalized diffusion equation. Quar. J. Royal Met Soc.
Generalization to account for anisotropic correlations are also possible by stretching and/or rotating the computational coordinates via a ‘diffusion’ tensor.
• Wu et al., 2002. 3-D variational analysis with spatially inhomogeneous covariances. Monthly Weather Review.
2-7 Feb 2003 SAMSI Multiscale Workshop 48
Incorporating covariates
• Carroll and Cressie, 1997. geomorphic site attributes in correlation model for snow water equivalent in river basins.
1 2 1 2( , ) exp( ) c d e fc s s B s s CX DX EX FX Where X’s represent differences between the two sites in elevation, slope, tree cover, aspect,
Alternative: deform R2 into subspace of R6.
• Riishojgaard, 1998, “flow-dependent” correlation structures for meteorological analysis systems. For z(s), a realization of a random field in Rd,
1 2 1 2 1 1 2, ( ) ( )dc s s s s z s z s
an embedding and deformation of the geographic coordinate space
Rd into Rd+1, with a separable, stationary correlation model fitted in new coordinate space.
2-7 Feb 2003 SAMSI Multiscale Workshop 49
Covariance models for dynamic error structures in the context of data assimilation
• Cox and Isham, 1988: with v a velocity vector in R2, a physical model for rainfall leads to space-time covariance function
1 2 1 2 2 1 2 1( , ; , ) ( ) ( )c s s t t E G s s t t V V
where G(r) denotes area of intersection of two disks of unit radius
with centers a distance r apart.
There are variants in the meteorological and hydrological literature: depending on tangent line in a barotropic model; using geostrophic or semigeostropic coordinates; or working in a Lagrangian reference frame for convective rainstorms. These yield interesting, anisotropic and nonstationary correlation models (cf. Desroziers, 1997). They suggest interesting space-time extensions of current deformation approach and statistical model fitting questions.