![Page 1: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/1.jpg)
DNA-protein electrostatic recognition: analysis of Protein Data Bank structures of DNA-protein complexes
A. G. Cherstvy1, A. B. Kolomeisky2, and A. A. Kornyshev3
1 Institut für Festkörperforschung, Theorie-II, Forschungszentrum Jülich, D-52425 Jülich, Germany
2 Department of Chemistry, Rice University, Houston, Texas 77005 , USA
3 Department of Chemistry, Faculty of Natural Sciences, Imperial College London, SW7 2AZ, London, UK
![Page 2: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/2.jpg)
Introduction to protein-DNA interaction and recognition
• DNA-protein recognition is vital for many biological processes (e.g., gene expression and regulation)
• Extreme diversity of proteins: humans ∼500 000 proteins, ∼ 25 000 genes.
• Protein classes: gene regulatory (transcription factors), repair proteins, structural proteins (histoneshistoneshistoneshistones), processing proteins (RNA Poly), etc.
• Main interactions: hydrogen bonding (HB), electrostatic electrostatic electrostatic electrostatic (DNA/proteins), hydrophobic, van der Waals forces.
• Protein recognition motifs: helix-turn-helix, zinc finger, leucine zipper.
• Complex and rather probabilistic code of DNA-protein recognition.
• Protein-DNA binding affinity: DNA sequenceDNA sequenceDNA sequenceDNA sequence, pH, [salt], T, helper proteins, DNA 3D conformation, etc.
• Physical mechanisms behind electrostatic DNA-protein interactions.
![Page 3: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/3.jpg)
Electrostatic DNA-protein interactions: lac repressor
Enormous dependence of lac repressor association binding constant K on [salt]
Upon sliding, condensed cations are removed in front and they bind back on DNA behind the protein.
specific: 7-8 non-specific: 11
R B. Winter et al., Biochem., 20 6961 (1981) M.T. Record et al., Biochem., 16 4791 (1977)
Winter & von Hippel: Electrostatic DNA-protein interactions are largely sequence non-specific !?
![Page 4: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/4.jpg)
Hydrogen bonds with DNA bases: DNA-protein recognition code
• HB donors and acceptors determine the unique code of DNA-protein HB interactions; HB strength is 1-5 kBT
• HB formation preferences in DNA-protein complexes: Arg NH1/NH2 and Lys NZ with O6 and N7 of Guanine, Asn and Gln with Adenine, Glu and Asp with Cytosine.
N. M. Luscombe et al., Nucl. Acids Res., 29 2860 (2001)
![Page 5: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/5.jpg)
Electrostatic potential of lac repressor
• Protein residues Lysine (pKa=10), Arginine (12), Histidine (6.5) are in close proximity to DNA phosphates
• DNA-induced charge patterns on proteins that are recognized by DNA?
Non-specific: 1osl.pdb Specific 1l1m.pdb
PositiveNegative
MDL Chime 6 Protein Explorer 2.80
![Page 6: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/6.jpg)
Positive protein charges “love” DNA: sequence specificity of interactions?
Oxygens OD1/OD2 of Asp and OE1/OE2 of Glu.
Nitrogens NZ on Lys, NH1/NH2 on Arg, and
ND1 on His NCP stability ([salt])
![Page 7: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/7.jpg)
W. Kabsch et al., Nucl. Acids Res., 10 1097 (1982)W. K. Olson et al., PNAS, 95 11163 (1998)
B-DNA charge and structure non-ideality-1 e0 per each 1.7 Å along DNA axis
Major groove
Minor groove
H=34Å
2a=20Å
DNA corrugated structure is recognized by proteins
![Page 8: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/8.jpg)
R
Protein
DNA
s1 s2
sph
Model
• Extract atomic coordinates from PDB files of protein-DNA complexes (Math 6)
• Identify closest protein N+ charges, R~lB~7Å
• s1,2 on the samesamesamesame DNA strand; DNA direction
• Histogram of s1-s2 distribution
• If uniform distr. ⇒ no DNA sequence specificity
• Two-peaks distr. ⇒ protein N+ follow DNA P-
• As 3D DNA structure is sequence specific, individual P- are tracked by Lys and Arg
• Complementary DNA-protein interaction lattices
• Sequence-specific electrostatic interactionssph~7Å
![Page 9: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/9.jpg)
Arginine: N CA C O CB CG CD NE CZ NH1 NH210th and 11th atoms are N
Lysine: N CA C O CB CG CD CE NZ9th atom is N
Histidine: N CA C O CB CG ND1 CD2 CE1 NE27th atom is N, charged or neutral
Protein positive residues and DNA negative charges
DNA: PO4- groups
![Page 10: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/10.jpg)
Nucleosomes: DNA-wrapping proteins of eukaryotes
K. Luger et al., Nature, 389 251 (1997)
![Page 11: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/11.jpg)
-4 -2 0 2 4 60
5
10
15
20
s1-s2, Þ
N+
,1ao
i
-4 -2 0 2 4 602468
101214
s1-s2, Þ
N+
,1f6
6
-4 -2 0 2 4 60
5
10
15
s1-s2, Þ
N+
,1kx
3
-4 -2 0 2 40
2
4
6
8
10
12
s1-s2, Þ
N+
,2cv
5
-4 -2 0 2 4 60
2
4
6
8
10
s1-s2, Þ
N+
,2nq
b-4 -2 0 2 4 6
02468
101214
s1-s2, Þ
N+
,1eq
z
-4 -2 0 2 4 60
2
4
6
8
10
12
s1-s2, Þ
N+
,1p3
4
-4 -2 0 2 4 60
2
4
6
8
10
s1-s2, Þ
N+
,1m
18
-4 -2 0 2 40
10
20
30
40
50
s1-s2, Þ
N+
,Sum
-4 -2 0 2 4 602468
101214
s1-s2, Þ
N+
,2nz
d
-4 -2 0 2 4 60
2
4
6
8
10
12
s1-s2, Þ
N+
,2f8
n
-4 -2 0 2 4 602468
101214
s1-s2, Þ
N+
,1u3
5
-4 -2 0 2 4 60
5
10
15
20
s1-s2, Þ
N+
,1kx
5
-4 -2 0 2 4 60
2
4
6
8
10
12
s1-s2, Þ
N+
,2fj
7
-4 -2 0 2 4 602468
101214
s1-s2, Þ
N+
,2py
o 147 bp: 1kx5, 2fj7, 2pyo
145 bp: 1nzd, 2f8n, 1u35
Canonical 146 bp NCP: 1aoi, 1f66, 1kx3, 2cv5, 2nqb, 1eqz. 1p34, 1m18
Sum of all complexes: frog, human, fruit fly, chicken NCPs
Results for s1-s2 in NCPs: 75-100 N+ close, 160-230 in totaldepleted
![Page 12: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/12.jpg)
-6 -4 -2 0 2 4 60
5
10
15
20
25
s1-s2, Þ
N+
,Sum
Prokaryotic DNA-bending proteins also reveal two peaks
Complexes analyzed:2np2, 1ihf, 1p51,1p71,
1p78, 1ouz, 1owf, 1owg
U-turn like severe bending of DNA
![Page 13: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/13.jpg)
Main DNA-binding motifs of proteins
Helix-turn-Helix,λλλλ repressorrepressorrepressorrepressor, 1lmb.pdb:
2 α-helices in major groove, HB with DNA bases,ES with phosphates
Zinc finger, Zif268Zif268Zif268Zif268, 1aay.pdb: 3 α-helices in major groove,
each finger recognizes 3 bps,HB+ES
Leucine zipper, GCN4GCN4GCN4GCN4, 1ysa.pdb: 2 consecutive major grooves are recognized by 2 long bound α-
helices, HB+ES
PositiveNegative
![Page 14: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/14.jpg)
-4 -2 0 2 40
10
20
30
40
s1-s2, Þ
N+
,Sum
-4 -2 0 2 4 60
5
10
15
20
25
s1-s2, Þ
N+
,Sum
-4 -2 0 2 405
101520253035
s1-s2, Þ
N+
,Sum
Basic DNA-binding protein motifs: uniform distributions and no sequence-specificity
zinc fingers leucine zippers lac-, lambda-, 434-, cro-, arc-repressor-like complexes
1aay, 1a1l, 1p47, 1jk1, 1jk2, 1a1f, 1a1g, 1a1j, 1a1k, 1a1h, 1a1i, 1zaa, 1g2f, 1g2d, 1f2i, 1llm, 1mey, 1ubd, 1tf3, 2jp9,
2gli, 3dfx
1ysa, 2c9l, 2c9n, 2h7h, 1d66, 1fos,
1gu5, 1hjb,
Repressors (1osl, 1l1m, 2bjc, 1cjg; 1lmb, 3bdn, 6cro, 1lli, 1rio; 1par,
1bdt, 1bdv, 2bnz, 2cax; 1au7, 2or1, 1per, 3cro, 1rpe, 2p5l, 1gt0, 1hf0, 1ic8, 1o4x, 2r1j) and CAP proteins (1cgp, 1zrc, 1zrd, 1zre, 1zrf, 1o3q,
1o3r, 1o3s, 1j59, 1run, 2cgp),
![Page 15: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/15.jpg)
Conclusions and outlook
• For largelargelargelarge DNA-protein complexes, NCP and HU, tracking of individual DNA phosphates. • DNA-induced sph~7 Å charge periodicity along DNA-protein interfaces.• Up to 100 charge-charge contacts, large 10-30 kBT energy profit due to complementarityof DNA-protein charge lattices. • Recognition of native and strongly bound DNA sequences.• Nucleosome positioning on DNA, together with sequence-specific DNA bending code.• 146 vs. 145/147 bp DNA NCPs. Different DNA affinities to histones? No data.
• For smallsmallsmallsmall complexes, with 5-10 ES contacts, no statistical preference and weak/no sequence specificity of ES interactions. • Electrostatics is weak and other interactions contribute to recognition (HB).
• Interplay of HB+ES : future research.
![Page 16: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/16.jpg)
Thank you
Grant CH 707/2-1
• A. G. Cherstvy, A. B. Kolomeisky, and A. A. Kornyshev, J. Phys. Chem. B, 112 4741 (2008).
• A. G. Cherstvy, J. Phys. Chem. B, 113 4242 (2009).
![Page 17: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/17.jpg)
E. Segal et al., Nature 442 772 (2006)
NCP positioning code
![Page 18: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/18.jpg)
hydrogen bonding hydrophobic electrostatic
Lac repressor contacts with DNA
C. G. Kalodimos et al., Science, 305 386 (2004)
Electrostatic contacts are believed to be sequence-nonspecific, while hydrogen bonding is highly sequence specific
![Page 19: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/19.jpg)
Spiraling RNA Polymerase: protein binding requires DNA
groove tracking
K. Sakata-Sogawa et al., PNAS, 101 14731 (2004)
Sliding vs. Spiralingelectrostatic barriers vs. hydrodynamic friction
• Theory of Schurr for lac spiraling: 100 times stronger hydrodynamic drag and smaller D1: D1=5×10-9 cm2/s
• Old experiments (Blomberg): D1=3×10-9 cm2/s
M. J. Schurr, Biophys. Chem., 9 41 (1979)
A.G. Cherstvy and R.G. Winkler, J. Chem. Phys., 120 9394 (2004)
![Page 20: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/20.jpg)
h
R
h
Protein charge array
DNA charge array
z
K
δm
∆n
Model of DNA-protein recognition: charge complementarity
• Random charge displacements mimic bp specific nonideality of DNA/protein structure
• Long-range correlations zn=nh+∆n
• Recognition region -- similar charge variations ∆n = δm -- stronger DNA-protein attraction
• Potential well near the homology region
PositiveNegative
![Page 21: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/21.jpg)
Artificial charge periodicity in protein DNA-binding domains
• Periodicity of ≈ 7 Å and ≈34 Å is expected from PDB data analysis.
• Next step: backbone elasticity + DNA helicity + PDB files analysis + computer simulations of protein diffusion
34 Å
7 Å
PO4−
![Page 22: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/22.jpg)
np
r
λ konkoff
protein
DNA
target
x
nads
Macroscopic qualitative model of protein diffusion in DNA coil
• Every cycle: 3D diffusion in solution + 1D sliding along DNA
• [Protein] in solution cp=np/V and on DNA cads=nads/V
• Volume of DNA coil ∼Lr2
PositiveNegative
![Page 23: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/23.jpg)
Mechanisms of protein diffusion on DNA
J. Marko et al., Nucl. Acids Res., 32 3040 (2004)
Sliding/1D diffusion Hopping Inter-segmental transfer (loop-facilitated process)
Actual diffusion is a combination of these basic steps
![Page 24: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/24.jpg)
Time of target search: 3D + 1D
0 0
exp[ ( )]exp[ ( ')] '
( )
x z
c
G zdz G z dz
D z
λ βτ β
+
= −∫ ∫
expon p p eff
eff
off ads ads B
k n c Ey y
k n c k T
= = =
3
1
, 0( )
,
D z xD z
D x z x λ
< <=
< < +
0, 0( )
, .eff
z xG z
E x z x λ
< <= − < < +
3 1
2 2
12 2c
eff
x
D y
x
D D
λτ
λ= + +
1/
c
ads
L
n
α
τ τλ
van Kampen: Mean First Passage Time for 1 cycle
Diffusion coefficient profile
Non-equilibrium protein adsorption constant on DNA;equilibrium: y=kon/koff
Free energy profile: no DNA bp specificity
Total search time along DNA of length L: α=1: random protein attachment every stepα>1: super-diffusion
3D + 1D + correlation term (missing previously)protein unbinding before travelling length λ on DNA
![Page 25: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/25.jpg)
2 2pn Lx Lr V= =
3 12 2
21 1 2,on p off ads
free ads
D Dk c k c
xτ τ λ= = = =
ads
r yd
nλ =
11
3
1 1 22
p
p ads ads adsp
nLr r L
D n n r n d nn yd
αλτ
λ λ
− = + +
2
3 3
12 2S
p p
Lr
D ac D anτ = =
11
3/ 2
21 p
S ads d pa s adsn y
n ya L
r n yd n d r n ydd
αττ
− = + +
1
3
1D
dD
= <<
Total search time vs. Smoluchovski time
L scales out
Rates of protein binding and unbinding
Optimal sliding length λ
Smoluchovski 3D diffusion rate to a drain of radius a
Final ratio of search times
12
S p
a d
r dy n
ττ
+= At equilibrium, yeff=1, d<<1,
correlation term
/ px r n= Length of 3D path
![Page 26: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/26.jpg)
Minimal search time at intermediate y and np values
a=1 nm, r=30 nm, α=1, na=1000, np=1, d=0.001
Weak attraction to DNA
Strong attraction: long λineffective 1D search only
Unbinding drift is strong, λ is short
Always exist proteins close to the target in solution
• As nads grows, τ decreases -- parallel search of DNA by many proteins
• Dotted curves: without correlation term -- wrong results
d=0.001
• Diffusion times faster than Smoluchovski
![Page 27: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/27.jpg)
Part 2: Electrostatic key-lock mechanism of protein-DNA recognition
![Page 28: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/28.jpg)
Electrostatic DNA-protein interaction and recognition energy
( ) ( ) ( )0
22 20
0m n
Niqh m n iqiqz
el
m N nc
eW dq K q R e e e
δκε π
∞∞ − −∆
−∞=− =−∞
= − + ∑ ∑∫
g=2π/h, <∆n2>=∆2, <δm
2>=δ2, Ω2=δ2+∆2
( ) ( ) [ ]
( ) ( ) ( )
2 2 2
2 2
22 2 2 / 20
0 0 01
22 2 / 20
0 * 00
22 cos
2cos 1
n g
el long rangenc
q
c
eW K R K n g R e ngz
b
edq K q R q z z
M
M e
κ κε
κπε
∞− Ω
−=
∞ − Ω
= − + + −
+ − −
∑
∫
( )
( )5/ 22
2 2
2
2 22
long range B
B c
W z l
k T R
RM
z
zεε
− Ω ∆
+ ∆
−∆ ∆≈ −
z=z0
zz=z*
M=2N+1 charges
General electrostatic interaction energy
Averaged
Simple form for Ω<<1, κ=0
0 *z z z∆ = − ( )20 /B Bl e k Tε=
Recognition energy
![Page 29: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/29.jpg)
Electrostatic recognition energy ∆W
• Well is accompanied by the barriers• Well depth is several kBT
• Narrow wells: no “funnels” for protein diffusion • Screening makes wells shallower• Well depth d grows linearly with M• d scales as 1/R3 at κ=0 and as e-κR with salt
o o o2 211, 10A, 2, 3.4A, , 1AcM R hε δ= = = = = ∆ Ω =
o
1/ 7 Aκ =
0κ =
max 3/ 2z R=
0 *z z z∆ = −
![Page 30: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/30.jpg)
Protein residence time in the well
• Wells of ∼kBT in depth slow down protein diffusion
• Enough time to provoke protein conformational changes (µs- ms) and to induce stronger protein binding to DNA
• ES DNA-protein recognition is the first step of protein docking
• Stronger Hydrogen Bonding interactions can be formed afterwards
Kramers <escape time>
van Kampen M-F-P-Time
τc
![Page 31: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/31.jpg)
Thank you
cond-mat: 0708.0021
![Page 32: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/32.jpg)
Funny energy barriers: Coulomb case
δ
R
∆z
( )
( ) ( )( )
( )( )
( )
2 2
2 20 0
2 2 22
2 2200
3/ 2 5/ 22
2
2 2 2
2
( )
0
2
2
el el el
c c
TaylorExpansion
el
c c
R R z
e eW W W
R z R z
ee zW
R z
R z R zδ
δ δ
δε ε δ
δδ
ε ε
= + ∆ +
∆ = − = −+ ∆ + ∆ +
∆∆ ≈
+ ∆
− ∆+
+ ∆
At R2>2∆z2 fluctuations of charges always reduce their attraction energy
R
∆z
δ
⊥
⊥
||||||||
-4 -2 0 2 4
-0.20
-0.15
-0.10
-0.05
0.00
x and z
z only
x only
W (
arb
. u
nits)
z (arb. units)
Computer Simulations
||
⊥ + ||
![Page 33: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/33.jpg)
Electrostatic DNA-protein interactions: lac repressor
Enormous dependence of lac repressor association binding constant K on [salt]
Upon sliding, condensed cations are removed in front and they bind back on DNA behind the protein.
specific: 7-8 non-specific: 11
R B. Winter et al., Biochem., 20 6961 (1981) M.T. Record et al., Biochem., 16 4791 (1977)
Electrostatic DNA-protein interactions are largely sequence non-specific? [complex]
M[DNA][protein]
K = ⋅
![Page 34: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/34.jpg)
1 5 10 50 100 5001000
Ll1
5
10
50
100
500
1000
t t S
Simple computer test
Compl
ete D
NA cov
erage
First p
assa
ge ti
me to t
he ta
rget
• single protein hopping randomly to left/right• random target location• random protein attachment point • average over 5 runs• L=20000, λ = 50, 100, 200
![Page 35: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/35.jpg)
Interaction-induced folding and conformational adaptation
C. G. Kalodimos et al., Science, 305 386 (2004)
![Page 36: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/36.jpg)
Lac repressor: D1 << D3
• Brownian Protein Motion with large D1 variations • Extract D1 from Mean Square Displacements of proteins• Experiment (Austin): D1: D1=2×10-10 cm2/s • Experiment D3: D3=4×10-7 cm2/s
1MSD( 1) 2n D t= ≈ ∆
R. Austin et al., Phys. Rev. Lett., 97 048302 (2006)
![Page 37: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/37.jpg)
DNA loops formed by lac repressor
![Page 38: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/38.jpg)
Electrostatic potential of RNA Polymerase II
R. D. Kornberg et al., Science, 292 1863 (2001)
![Page 39: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/39.jpg)
Looping uncharged elastic rods: buckling instability
Elasticity theory: 2D and 3D elastica, Euler and Kirchhoff -- local balance of forces and moments
Excess twist energy Etw turns into loop bending energy Eb
Looping of submarine cables [J. Coyne, IEEE J. Ocean. Ing., 15 72 (1990)]
FF
2ΛA
C=kBTltw -- twist modulus, ltw=750 ÅB= kBTlp -- bend modulus, lp=500 Å
Λ=√B/F, A2=1-C2τ2/(4BF)K2(s)=4FA2/cosh[As]2 -- curve curvature
Eb=4FAΛ -- loop bending energy
F0>C2τ2/(4B) -- force to keep cable straight
∆L=4AΛ -- cable slack upon looping
Twist rate τ
Tw =Lτ
Every loop removes about 2π of the excess twist Tw: τ = τ 0 - 2π/L
![Page 40: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis](https://reader033.vdocuments.us/reader033/viewer/2022042222/5ec8588ad42b5d5f21665921/html5/thumbnails/40.jpg)
Looping charged rods: limitations of OSF theory
Optimal loop shape in 3D is a complicated problem: non-locality, self-contacts.
Numerical summation of the Debye-Hückel potentials along the loop contour
∆Eel=Eellooped – Eel
straight
Eel of loops with Debye-Hückel interactions: OSF electrostatic rod stiffening works only
for large loops R>>1/κ with no close contacts
lp→ lp,el = lp + lB/(4κ2h2)
08 Bl nκ π=( )2
r
el
eE r e
r
κ
ε−=
1/κ≈10Å in physiological solutionScreened interactions
of charges
h is charge-charge separation
Applicability of OSF to tight DNA loops