some basic statistical modeling issues in...
TRANSCRIPT
Some Basic Statistical ModelingIssues in Molecular and Ocean
DynamicsPeter R. Kramer
Department of Mathematical Sciences
Rensselaer Polytechnic Institute, Troy, NY
and
Institute for Mathematics and Its Applications, Minneapolis, MN
Supported by NSF CAREER Grant DMS-0449717
and CMG OCE-0620956
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 1/29
Overview• StochasticDrift-Diffusion Parameterization of
WaterDynamics near Solute• with Adnan KhanandShekhar Garde
• Statistical mesoscalemodeling for oceanic flows• with Banu BaydilandShafer Smith(Courant,
CAOS)
y1
y 2
Poisson Vortex Streamlines, λ~ = 1
0 1 2 3 4 5 6 7 8 9 100
1
2
3
4
5
6
7
8
9
10
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 2/29
Biological Disclaimer
(www.molecularium.com, S. Gardeet al)
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 3/29
Stochastic Parameterization of Water
Dynamics near Solute
Simplifiedstatisticaldescription of water dynamics aspossible basis for implicit solvent method toaccelerate molecular dynamicssimulations forproteins, etc. (withAdnan Khan(Lahore) andShekhar Garde(Biochemical Engineering))
As a first step, we explore stochastic parameterizationof water nearC60 buckyballmolecule.
• isotropic, chemically simple
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 4/29
Molecular Dynamics Snapshot of Buckyball
Surrounded by Water
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 5/29
Statistical dynamics encoded in biophysical
literature in terms of a diffusion coefficient:
(Makarov et al, 1998; Lounnas et al, 1994,. . .)
DB(r) ≡
⟨
|X(t+ 2τ) − X(t)|2
6τ
∣
∣
∣
∣
∣
X(t) = r
⟩
−
⟨
|X(t+ τ) − X(t)|2
6τ
∣
∣
∣
∣
∣
X(t) = r
⟩
But this seems to mix together inhomogeneities inmeanandrandommotion.
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 6/29
Drift-Diffusion FrameworkWe explore capacity of models of the form
dX = U (X(t)) dt+ Σ(X(t)) dW (t),
for water molecule center-of-masspositionX(t).• drift vector coefficientU (r)
• diffusion tensor coefficientD(r) = 12Σ(r)Σ†(r)
For isometricsolute (buckyball):• U (r) = U‖(|r|)r̂,
• D(r) = D‖(|r|)r̂ ⊗ r̂ +D⊥(|r|)(I − r̂ ⊗ r̂),
for positionr = |r|r̂ relative to center of symmetry.
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 7/29
Physically Inspired (DD-I) Model
In analogy toBrownian dynamicssimulations, take
U (r) = −γ−1∇φ(r),
D(r) = D0I.
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 8/29
Physically Inspired (DD-I) Model
In analogy toBrownian dynamicssimulations, take
U (r) = −γ−1∇φ(r),
D(r) = D0I.
• Potential of mean forceobtained from measuringconcentrationc(r) andBoltzmann distributionc(r) ∝ exp(−φ(r)/kBT ).
• Diffusivity unchanged frombulk value.• Friction coefficientfrom Einstein relationγ = kBT/D0.
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 8/29
Physically Inspired (DD-I) Model
In analogy toBrownian dynamicssimulations, take
U (r) = −γ−1∇φ(r),
D(r) = D0I.
0.4 0.6 0.8 1 1.2 1.4 1.6 1.8−6
−4
−2
0
2
4
6
8
10
12
14Potential of mean force
r (nm)
φ (k
J/ m
ol)
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 8/29
Physically Inspired (DD-I) Model
In analogy toBrownian dynamicssimulations, take
U (r) = −γ−1∇φ(r),
D(r) = D0I.
0.4 0.6 0.8 1 1.2 1.4 1.6 1.8−100
0
100
200
300
400
500
600
700
r (nm)
FM
D (
kJ/ (
mol
nm
))Mean Force (filtered)
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 8/29
Systematic, Data-Driven Parameterization
(DD-II) Model
Parametrize drift and diffusion functions frommathematical definitions:
U‖(|r|) =
limτ↓0
⟨
X(t+ τ) − X(t)
τ· r̂
∣
∣
∣
∣
X(t) = r
⟩
,
D‖(|r|) =
limτ↓0
⟨
∣
∣(X(t+ τ) − X(t)) · r̂ − U‖(r)τ∣
∣
2
2τ
∣
∣
∣
∣
∣
X(t) = r〉 ,
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 9/29
Systematic, Data-Driven Parameterization
(DD-II) Model
Parametrize drift and diffusion functions frommathematical definitions:
D⊥(|r|) =
limτ↓0
⟨
1
4τ|(X(t+ τ) − X(t)) · (I − r̂ ⊗ r̂)|2
∣
∣
∣
∣
X(t) = r〉 .
Obtain statistical data fromMD simulations.
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 9/29
Time Difference τ must be chosen carefully
Takingτ = ∆t (time stepof MD simulation) may notbe appropriate
• Limit τ ↓ 0 implicitly refers to timeslargeenoughfor drift-diffusion approximation to bevalid.
Must chooseTv ≪ τ ≪ Tx, where:• Tv is time scale ofmomentum.• Tx is time scale ofposition.
See alsoPavliotisandStuart(2007) about need toundersample.How chooseτ in practice?
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 10/29
Explore simple Ornstein-Uhlenbeck (OU)
model
dX = V dt,
mdV = −γV dt− αX dt+√
2kBTγ dW (t)
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 11/29
Explore simple Ornstein-Uhlenbeck (OU)
model
dX = V dt,
mdV = −γV dt− αX dt+√
2kBTγ dW (t)
Forces:friction, potential, andthermal.
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 11/29
Explore simple Ornstein-Uhlenbeck (OU)
model
dX = V dt,
mdV = −γV dt− αX dt+√
2kBTγ dW (t)
Nondimensionalize:
dX = V dt,
dV = −aV dt− aX dt+ a dW (t)
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 11/29
Explore simple Ornstein-Uhlenbeck (OU)
model
dX = V dt,
mdV = −γV dt− αX dt+√
2kBTγ dW (t)
Nondimensionalize:
dX = V dt,
dV = −aV dt− aX dt+ a dW (t)
wherea = γ2/(mα) is ratioof position tomomentumtime scale.
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 11/29
Explore simple Ornstein-Uhlenbeck (OU)
model
dX = V dt,
mdV = −γV dt− αX dt+√
2kBTγ dW (t)
Nondimensionalize:
dX = V dt,
dV = −aV dt− aX dt+ a dW (t)
Exact drift-diffusion coarse-graining whena≫ 1:
dX = −X dt+ dW (t)
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 11/29
Explore simple Ornstein-Uhlenbeck (OU)
model
dX = V dt,
mdV = −γV dt− αX dt+√
2kBTγ dW (t)
Nondimensionalize:
dX = V dt,
dV = −aV dt− aX dt+ a dW (t)
What if we try to obtain this from analysis oftrajectorieswith finite but largea?
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 11/29
Drift and diffusion coefficients of exact OU
solution sampled with finite time differenceτ .
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9
−0.9
−0.8
−0.7
−0.6
−0.5
−0.4
−0.3
τ
U|| (
x)⋅ x
/|x|2
Longitudinal Drift Coefficient for OU, a=132
Blue : ExactRed : Asymptotic
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 12/29
Drift and diffusion coefficients of exact OU
solution sampled with finite time differenceτ .
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.45
τ
D||
Longitudinal Diffusivity for OU, a=132
Blue : ExactRed : Asymptotic
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 12/29
Inferences from OU model• Good choice ofτ may be the one which
maximizes drift magnitudeanddiffusivity.• Beginning estimateobtained from OU model
with samea value.
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 13/29
OU Model Insights→ MD Data Parameteriza-
tion in DD-II Model
To obtaintime scales, approximate main well inpotential of mean force byquadratic.
0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1−4
−2
0
2
4
6
8
Potential of mean force
r (nm)
φ (k
J/m
ol)
Red : Buckyball Potential
Green : Quadratic Fit to a characteristic part of the potential
This givesa = 132.
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 14/29
OU Model Insights→ MD Data Parameteriza-
tion in DD-II Model
Examinedrift anddiffusivity computed from variouschoices ofτ .
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 14/29
OU Model Insights→ MD Data Parameteriza-
tion in DD-II Model
Examinedrift anddiffusivity computed from variouschoices ofτ .
0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1 1.05
−0.1
−0.05
0
0.05
0.1
r (nm)
U|| (
nm/p
s)
Longitudinal Drift
Blue : τ=20fsGreen : τ=40 fsRed : τ=60 fsMagenta : τ=80 fsCyan : τ=100 fs
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 14/29
OU Model Insights→ MD Data Parameteriza-
tion in DD-II Model
Examinedrift anddiffusivity computed from variouschoices ofτ .
0.6 0.7 0.8 0.9 1 1.1 1.2 1.30.5
1
1.5
2
2.5
3
3.5
4
4.5
5
5.5
x 10−3
r (nm)
D|| (
nm2 /p
s)
Longitudinal Diffusivity
Blue :τ=60 fsGreen :τ=80 fsMagenta :τ=120 fsCyan :τ=160 fsBlack :τ=200 fsYellow :τ=240 fsRed :τ=280 fs
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 14/29
OU Model Insights→ MD Data Parameteriza-
tion in DD-II Model
Examinedrift anddiffusivity computed from variouschoices ofτ . Both desiderata about drift anddiffusivity behaviornot simultaneously satisfiable.
• Correct bulk diffusivity behavior more important
We chooseτ = 0.2 ps= 200 fs.
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 14/29
Parameterization Used in DD-II Model
0.6 0.8 1 1.2 1.4 1.6
−0.04
−0.02
0
0.02
0.04
0.06
0.08
r (nm)
U|| (
nm/p
s)
Longitudinal Drift
Blue : DD−II outputRed : DD−II input from MD data
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 15/29
Parameterization Used in DD-II Model
0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.50
1
2
3
4
5
6
7x 10
−3
r (nm)
D|| (
nm2 /p
s)
Longitudinal Diffusivity
Blue : DD−II outputRed : DD−II input from MD Data
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 15/29
Parameterization Used in DD-II Model
0.6 0.8 1 1.2 1.4 1.6 1.8
2
3
4
5
6
7
8x 10
−3
r (nm)
D⊥ (
nm2 /p
s)
Lateral Diffusivity
Blue : DD−II outputRed : DD−II input from MD data
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 15/29
Compare Predictions of
Biophysical Diffusivity Formula
DB(r) ≡
⟨
|X(t+ 2τ) − X(t)|2
6τ
∣
∣
∣
∣
∣
X(t) = r
⟩
−
⟨
|X(t+ τ) − X(t)|2
6τ
∣
∣
∣
∣
∣
X(t) = r
⟩
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 16/29
Compare Predictions of
Biophysical Diffusivity Formula
0.6 0.8 1 1.2 1.4 1.6 1.8 2 2.2
0
0.5
1
1.5
r (nm)
DB/D
0
DB (r) for DD−I Model
DDD−I
DMD
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 16/29
Compare Predictions of
Biophysical Diffusivity Formula
0.6 0.8 1 1.2 1.4 1.6 1.8
0
0.2
0.4
0.6
0.8
1
1.2
1.4
r (nm)
DB/D
0
DB (r) for DD−II Model
DMD
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 16/29
Future WorkNext steps
• anisotropies
• chemical heterogeneity
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 17/29
Unresolved Mesoscale Turbulence in Ocean
Circulation Models
Computational ocean modelsfor climate predictionhaveresolution∼ 100 km:
• does not adequately resolvemesoscale turbulentstructureson length scales. 100 km
• even smaller scales. 100 m : Kolmogorovturbulence
We focus on representing turbulent transport byunresolved mesoscale turbulence.
Wind-driven double-gyre2000 km basin-scalecomputational simulation, 5 km resolution, by ShaferSmith.
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 18/29
Mathematical Framework for Transport
∂T (x, t)
∂t+ u(x, t) · ∇T (x, t) = κ∆T (x, t),
T (x, t = 0) = Tin(x)
• Passive scalarfield T (x, t)
• Velocity field u = V + v: large-scalemean flow+ small-scalefluctuations
• “Molecular” diffusioncoefficientκ
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 19/29
Parameterization Problem• Obtain an equation forcoarse-grained〈T 〉:
∂〈T 〉
∂t+ V · ∇〈T 〉 = κ∆〈T 〉 − ∇ · F
〈T 〉(x, t = 0) = Tin(x),
• Turbulent FluxF = 〈v T 〉
• Problem of Parametrization:
F = F(〈T 〉,V )
whereF only involves a few parameters.
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 20/29
Parameterization of Small Scales
One parameterization used in AOS andengineering (Gent and McWilliams1990,Griffies1998,Middleton and Loder1989):
F = −K∗(x, t) · ∇〈T 〉,
with generallynonsymmetricK∗ = S∗ + A∗.• SymmetricpartS∗: variably
enhanced diffusion• AntisymmetricpartA∗:
effective driftU ∗ = −∇ · A∗ relative to meanflow
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 21/29
Parameterization ApproachesWithin the class of “effective diffusivity” or “ eddy
diffusivity” schemes, the way in whichK∗ is modeled differs.
Some examples:
• Mixing-lengththeory:K∗ ∼ ℓv.
• K − ε theory: parameterize based on localenergyand
energy dissipationrate
• Gent-McWilliams: related toslopeof surfaces of constant
potential density
These areempirical, with varying degrees of success.
Ocean circulation models often employ even simpler practice of
choosingK∗ asconstant multiple of identity, tuneda posteriori.
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 22/29
Homogenization-Based Parameterization
Under assumption ofscale separation(not so crazyfor ocean),homogenization theoryprovides rigoroussupport and formula for effective diffusivity(Avellaneda, Majda, McLaughlin, Papanicolaou,Varadhan, Fannjiang, Pavliotis & K):
K∗(x, t) = κ (I − 〈v ⊗ χ〉))
where
∂χ
∂τ+ (V + v) · ∇yχ − κ∆yχ = −v,
is cell problem, solved onsub-grid scalewith frozenmean flowV . 〈·〉 denotes subgrid-scale average.
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 23/29
Computational Homogenization
• Heterogenous multiscale methods (HMM)natural, but perhaps too complex a modificationto existing codes.
• More negotiable:parameterization formulaforK∗ based onstatistical subgrid-scale modelwithsmall numberof nondimensional parameters
• Our approach:build upfrom simple models,combinenumerical solutionswith new andexistingasymptoticresults (Avellaneda, Majda,Vergassola, Bonn, McLaughlin, Kesten,Papanicolaou, etc.)
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 24/29
Example: Dynamical Poisson Vortex Model
(Cinlar , Caglar,. . . )
Stream function:
ψ(y, t) =∑
n
ψvor(y − Y(n)c
(t− ξ(n)), t− ξ(n))
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 25/29
Example: Dynamical Poisson Vortex Model
(Cinlar , Caglar,. . . )
Stream function:
ψ(y, t) =∑
n
ψvor(y − Y(n)c
(t− ξ(n)), t− ξ(n))
composed of simple vortex blobs, e.g.,
v(y, t) =
[
∂ψvor/∂y2
−∂ψvor/∂y1
]
,
ψvor(y, t = 0) = 12v̄vorℓvor
[
1
9
∣
∣
∣
∣
y
ℓvor
∣
∣
∣
∣
3
−1
6
∣
∣
∣
∣
y
ℓvor]
∣
∣
∣
∣
2]
for |y| ≤ ℓvor
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 25/29
Example: Dynamical Poisson Vortex Model
(Cinlar , Caglar,. . . )
Stream function:
ψ(y, t) =∑
n
ψvor(y − Y(n)c
(t− ξ(n)), t− ξ(n))
with centersY(n)c (t) obeying certain simple
dynamical law• Brownian motion• intervortical advection
andψvor(y, t) → 0 for |t| → ∞,
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 25/29
Example: Dynamical Poisson Vortex Model
(Cinlar , Caglar,. . . )
Stream function:
ψ(y, t) =∑
n
ψvor(y − Y(n)c
(t− ξ(n)), t− ξ(n))
vortices nucleated according tospace-time Poissonprocess(η(n), ξ(n)) of prescribed intensityλ, with
Y(n)c (0) = η(n)
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 25/29
Sample Snapshot of Poisson Vortex Field
y1
y 2
Poisson Vortex Streamlines, λ~ = 1
0 1 2 3 4 5 6 7 8 9 100
1
2
3
4
5
6
7
8
9
10
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 26/29
Uncertainty Modeling IssuesHow choosethestatistical model parametersfordynamicalparameterization?
• for observedsubgrid-scale data, can tryparametricstatisticalfitting
• but want statistical subgrid-scale model torepresent unobserved small scales, with onlycoarse-scale data available
• so howpredict small-scale statistical parametersfrom large-scale observations?
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 27/29
References regarding Statistical Modeling in
Molecular Dynamics
• T. Schlick,Molecular Modeling and Simulation:An Interdisciplinary Guide, 2002.
• V. A. Makarov et al,Biophys J., 75 (1998)150–158.• biophysical modelingframework for water
near surface• G. Pavliotis and A. M. Stuart,J. Stat. Phys. 124
(2007) 741–781.• mathematicalframework for computing
coarse-grained coefficients statisticallyfrommicroscale data
Some Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 28/29
References regarding Statistical Modeling in
Ocean Turbulence• Mathematics of Subgrid-Scale Phenomena in
Atmospheric and Oceanic Flows, Institute forPure and Applied Mathematics,www.ipam.ucla.edu/programs/atm2002
• A. J. Majda and P. R. Kramer,Phys. Rep. 314
(1999) 237–574.
• review of somestatistical flow models
• M. Çaglar,et al, J. Atmos. Ocean. Tech. 23
(2006) 1745–1758.
• parameterically fitting observational datato
statistical vortex modelSome Basic Statistical Modeling Issues in Molecular and Ocean Dynamics – p. 29/29