protein structure determinationand our software tools

45
Protein structure determination and our software tools Mark Berjanskii Edmonton Februrary 2015

Upload: mark-berjanskii

Post on 07-Jan-2017

453 views

Category:

Science


0 download

TRANSCRIPT

Page 1: Protein structure determinationand our software tools

Protein structure determination

and our software tools Mark Berjanskii

Edmonton Februrary 2015

Page 2: Protein structure determinationand our software tools

Outline

1) X-ray crystallography

2) Cryo-electron microscopy (Cryo-EM)

3) NMR spectroscopy

4) Mass spectrometry

5) MS23D

Page 3: Protein structure determinationand our software tools

Why do we need to know protein

structure?

Page 4: Protein structure determinationand our software tools

Why do we need to know protein

structures?

1) Prediction of protein function from 3D structure (e.g. fold,

motifs, active site prediction)

2) Mechanism of protein function (e.g. enzyme catalysis,

structural effect of known mutations).

3) Rational drug design

4) Design of novel proteins with novel function. Sequence-to-

function and sequence-to-structure predictions

1) Ubiquitin

- degradation by the proteasome,

2) Ubiquitin-like modifiers

- function regulation by post-translation

modification

Page 5: Protein structure determinationand our software tools

X-ray crystallography

Page 6: Protein structure determinationand our software tools

X-ray crystallography

Quality metrics: 1) Experimental data: -Number of reflections -Signal to noise ratio 2) Model-to-experiment agreement: - R factor -R free factor

3) Coordinate uncertainty: - B-factor 4) Stereo-chemical normality: - backbone torsion angles (Ramachandran plot) - bond length, angles - side-chain torsion angles

Page 7: Protein structure determinationand our software tools

X-ray Resolution

Minimum spacing (d) of crystal lattice planes that still provide measurable diffraction of X-rays. Minimum distance between structural features that can be distinguished in the electron-density maps.

High resolution Low resolution High resolution

Low resolution

200,000 reflections

500 reflections Many reflections Few reflections

Page 8: Protein structure determinationand our software tools

Resolution and protein quality

Page 9: Protein structure determinationand our software tools

X-ray resolution by proxy

ResProx would be able to detect

most of the withdrawn X-RAY

structures from the Murthy lab

ResProx vs X-ray resoltion

Page 10: Protein structure determinationand our software tools

Number of protein structures

per year

Page 11: Protein structure determinationand our software tools

Cryo-electron microscopy

(Cryo-EM)

Page 12: Protein structure determinationand our software tools

Cryo-electron microscopy

Image formation in the electron

microscope.

(a) Electrons, emitted by a source that is

housed under a high

vacuum, are accelerated down the

microscope column . After passing through

the specimen, scattered electrons are

focused by the electromagnetic lenses of

the microscope

(b) Schematic illustrating the principle of

data collection for electron tomography. As

the specimen is tilted relative to the

electron beam, a series of images is

taken of the same field of view.

(c) Rendering of selected projection views

generated during cryo-electron tomography

Page 13: Protein structure determinationand our software tools

3D image from Cryo-EM

Page 14: Protein structure determinationand our software tools

Examples of Cryo-EM images

(a,b) Illustration of spiral architecture of

the nucleoid in Bdellovibrio

bacteriovorus showing

(a) a 210 Å thick tomographic slice

through the 3D volume of a cell

(b) a 3D surface rendering of the same

cell, with the spiral nucleoid

highlighted

(c) Higher magnification view of a

tomographic slice through the cell,

showing well-separated nucleoid

spirals and ribosomes (dark dots)

distributed at the edge of the

nucleoid.

(d) Expanded views of 210 Å thick

tomographic slices, showing top-

views of polar chemoreceptor arrays.

Page 15: Protein structure determinationand our software tools

Cryo-EM revolution in structural

biology

Page 16: Protein structure determinationand our software tools

Cryo-EM can now achieve a resolution

necessary for de novo structure

determination

Page 17: Protein structure determinationand our software tools

Cryo-EM structures <5Å

Page 18: Protein structure determinationand our software tools

Examples of “high-resolution”

de novo structures from Cryo-EM

A) transient receptor

potential cation

channel subfamily V

member 1 (TRPV1) ion

channel

B) F420-reducing [NiFe]

hydrogenase

C) large subunit of the

yeast mitochondrial

ribosome

D) γ-secretase.

Page 19: Protein structure determinationand our software tools

NMR spectroscopy

Page 20: Protein structure determinationand our software tools

Protein NMR spectroscopy Experiment Spectra processing Spectra assignment

NOE assignment

Distance restraints

Model generation

Page 21: Protein structure determinationand our software tools

Resolution of NMR structures

Macromolecular NMR spectroscopy for the non-spectroscopist. Kwan AH, Mobli M, Gooley PR, King GF, Mackay JP. FEBS J. 2011 Mar;278(5):687-703

Page 22: Protein structure determinationand our software tools

Protein NMR structures from

Wishart group 2B0F

Human Rhinovirus

3C Protease

1DE1

Oxidized bacteriophage

T4 glutaredoxin.

1DE2

Reduced bacteriophage

T4 glutaredoxin.

1NHO

Thioredoxin-like protein

(Mt0807)

1Z9V

MTH0776

Page 23: Protein structure determinationand our software tools

Secondary structure from NMR chemical

shifts PANAV

s

CSI

Page 24: Protein structure determinationand our software tools

Torsion angles from NMR

chemical shifts

Page 25: Protein structure determinationand our software tools

Accessible surface area from

NMR chemical shifts

Page 26: Protein structure determinationand our software tools

Prediction of NMR chemical

shifts from structure

Page 27: Protein structure determinationand our software tools

Protein model building from NMR data

Page 28: Protein structure determinationand our software tools

Validation of NMR protein models

Page 29: Protein structure determinationand our software tools

Mass-spectrometry

Page 30: Protein structure determinationand our software tools

Distance restraints from MS cross-

linking experiments

Distance restraints

Model generation

Page 31: Protein structure determinationand our software tools

Residue accessibility by MS

limited proteolysis

Page 32: Protein structure determinationand our software tools

Secondary structure localization

by MS HD exchange

Page 33: Protein structure determinationand our software tools

Problem Many structural solutions may be compatible

with few restraints and solvent exposure info

1 restraint

2 restraints

3 restraints

Trypsin-inhibitor complex (1TAB)

Lys222E-Lys16I, Lys224E-Lys16I, and

Lys60E-Lys31I (21Å links)

Probing native protein structures by chemical cross-

linking, mass spectrometry and bioinformatics. Leitner A,

Walzthoeni T, Kahraman A, Herzog F, Rinner O, Beck M,

Aebersold R. Mol Cell Proteomics. 2010 Mar 31.

Page 34: Protein structure determinationand our software tools

How many distance restraints

do we need?

For 2,3,4:

Young, M. M., Tang, N., Hempel, J. C., Oshiro, C. M., Taylor, E. W., Kuntz, I. D., Gibson, B.

W., and Dollinger, G. (2000) High throughput protein fold identification by using

experimental constraints derived from intramolecular cross-links and mass

spectrometry. Proc. Natl. Acad. Sci. U. S. A. 97, 5802-5806

1) Atomic resolution - 10- 20 restraints per residue (NMR)

2) Residue resolution – 3 restraint per residue

3) Fold – 1/10 restraint per residue ( = protein length/10 )

4) Complex of rigid bodies – 3 restraint per complex

5) Experimentally biased comparative / ab initio model – 1 restraint

Distance restraint requirements for different levels of

structure determination

Page 35: Protein structure determinationand our software tools

One contact per 12 residues is

enough to model protein topology

Page 36: Protein structure determinationand our software tools

MS-based structure determination

requires knowledge-based information

Cross-

links

Residue exposure

Advanced

Force-field:

solvation term

full electrostatic

knowledge-based

potentials

Fragment

information

Homology

information

Page 37: Protein structure determinationand our software tools

Disulfide restraints can bias conformational

search for BPTI towards the native state No restraints Three disulfide restraints

Native No restraints

RMSD= 8.2A

3 disulfide restraints

RMSD= 2.14A

BPTI from disulfide distance restraints

Page 38: Protein structure determinationand our software tools

PrP 90–232 modeled using the interlysine

cross-link distance constraints.

Mol Cell Proteomics. 2012 Jul;11(7): Use of proteinase K nonspecific digestion for selective and

comprehensive identification of interpeptide cross-links: application to prion proteins.

Petrotchenko EV1, Serpa JJ, Hardie DB, Berjanskii M, Suriyamongkol BP, Wishart DS, Borchers CH.

Page 39: Protein structure determinationand our software tools

N-terminus of PrP 68-228 has propensity to interact with the end of helix B, which is the first PrP region to unfold at low pH

HB

HC HA

HC HA HB

HA

HC

pH 5.2 pH 3.2

CA of N-terminal residue Gly68 is shown with blue spheres.

Page 40: Protein structure determinationand our software tools

MS-GAMDy

Page 41: Protein structure determinationand our software tools

Rigid-body docking in Cartesian space by XPLOR

Monomer A Monomer B

Monomer B backbone optimization

Monomer A backbone optimization

Docking

Dimer backbone optimization

Distances for monomer A

Starting model for monomer A

Distances for monomer B

Starting model for monomer B

Inter distances for dimer

Page 42: Protein structure determinationand our software tools

XPLOR rigid-body docking with initial alignment by distance restraints

1 min, 64 structures with no restraint violations from 64

Page 43: Protein structure determinationand our software tools

Beta-strand pairing - 5 beta-strands Unknowns: 1) parallel or anti-parallel 2) interacting residues 3) internal or external

Shift Partial coverage

assign ( ( resid 1 OR resid 2 OR resid 3 OR resid 4 OR resid 7 OR resid 6 OR resid 5 ) and name N ) ( ( resid 12 OR resid 13 OR resid 14 OR resid 17 OR resid 16 OR resid 15 ) and name O ) 2.8 0.8 0.2 assign ( ( resid 1 OR resid 2 OR resid 3 OR resid 4 OR resid 7 OR resid 6 OR resid 5 ) and name O ) ( ( resid 12 OR resid 13 OR resid 14 OR resid 17 OR resid 16 OR resid 15 ) and name N ) 2.8 0.8 0.2 assign ( ( resid 1 OR resid 2 OR resid 3 OR resid 4 OR resid 7 OR resid 6 OR resid 5 ) and name N ) ( ( resid 66 OR resid 67 OR resid 68 OR resid 71 OR resid 70 OR resid 69 ) and name O ) 2.8 0.8 0.2 assign ( ( resid 1 OR resid 2 OR resid 3 OR resid 4 OR resid 7 OR resid 6 OR resid 5 ) and name O ) ( ( resid 66 OR resid 67 OR resid 68 OR resid 71 OR resid 70 OR resid 69 ) and name N ) 2.8 0.8 0.2 assign ( ( resid 41 OR resid 42 OR resid 43 OR resid 45 OR resid 44 ) and name N ) ( ( resid 66 OR resid 67 OR resid 68 OR resid 71 OR resid 70 OR resid 69 ) and name O ) 2.8 0.8 0.2 assign ( ( resid 41 OR resid 42 OR resid 43 OR resid 45 OR resid 44 ) and name O ) ( ( resid 66 OR resid 67 OR resid 68 OR resid 71 OR resid 70 OR resid 69 ) and name N ) 2.8 0.8 0.2 assign ( ( resid 41 OR resid 42 OR resid 43 OR resid 45 OR resid 44 ) and name N ) ( ( resid 48 OR resid 49 ) and name O ) 2.8 0.8 0.2 assign ( ( resid 41 OR resid 42 OR resid 43 OR resid 45 OR resid 44 ) and name O ) ( ( resid 48 OR resid 49 ) and name N ) 2.8 0.8 0.2

24 possible arrangements of beta-strands into a beta-sheets via XPLOR ambiguous restraints Example:

RMSD = 0.6A Convergence ~ 20%

Page 44: Protein structure determinationand our software tools

Fragment-based modelling

MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQR MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQR MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQR MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQR MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQR MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQR

Torsion angle restraints

Distance restraints

CS23D SFAssembler

Homodeller

Starting model pool

Ubiquitin from 2FAZA 4.86A

Page 45: Protein structure determinationand our software tools

Template-derived distance restraints

82 N-O restraints

1UBQ GAMDy model0.6A

Ubiquitin from 2FAZA 4.86A

82 N-O restraints

GAMDy model 4.8A

GAMDy model 2A