computational biophysics - bioinfosummer 2012 (jason roberts)
DESCRIPTION
Case Study: Atomistic Molecular Dynamics Simulations of PoliovirusTRANSCRIPT
Case Study: Atomistic Molecular Dynamics Simulation of Poliovirus
Jason A. Roberts, Senior Medical Scientist,National Enterovirus Reference Laboratory, WHO Regional Poliomyelitis Reference Laboratory, VIDRLApplied Sciences, RMIT University, Australia.
�Animation –� Creates the illusion of movement
� Simulation –� Tool for testing a hypothesis
� Study of a molecule(s) structure and function using computational methods such as:
� Semi-empirical quantum mechanics
� Atomistic Molecular Dynamics
� Coarse-Grained Molecular Dynamics
� Hybrid methods
� Ultimately these are just models!
0.01
0.10
1.00
10.00
100.00
1,000.00
10,000.00
100,000.00
1992 1994 1996 1998 2000 2002 2004 2006 2008 2010 2012
TF
OL
PS
Year
Supercomputer Peak Speed (RMax)
vs Time (years)
Rmax (TFLOPS)
First 1µs simulation of enzyme folding (BPTI)10,000 atoms (4 months)
Desktop + GPU
Cray Titan
17.59 PFLOPS
iPhone
3.3 million atom poliovirus or
rhinovirus 1µs simulation
(4 months) Eg. Avoca VLSCI 2012
BlueGene/Q using 25% of RMax
4U Server + 8GPU
� BlueGene/Q (VLSCI@25%) = 8.9 days� 65,536 threads (~172 TFLOPS)
� BlueGene/P = 3 months� 2048 cores (~7 TFLOPS)
� 4u GPU Server (CUDA) = 3 months� 32cores + 8GPUs (~10.5 TFLOPS)
� Desktop GPU (CUDA) = 1 year� 12core + 2GPU (~2.2 TFLOPS)
� Modern Laptop = 20 years� 4core i7 (~0.15 TFLOPS)
� iPhone, Android = 300 years� (~0.01 TFLOPS)
� Side-chain behaviour
� Hydrophobicity
� Variations in pH conditions
� Side-chain to side-chain interactionseg. S-S bonds in Cysteine
� Salt Bridges
� Long-range electrostatics (eg. PME)
� Nucleic Magnetic Resonance� Small structures, very high resolution.
� Provides some conformational state data
� X-ray crystallography� High resolution
� Requires production of a crystal
� Cryogenic Electron Microscopy� Biologically representative, relatively low resolution
� Based on existing data attained by traditional methods eg:� Nuclear Magentic Resonance (<1.0 Å)
� X-ray crystallography (0.5-10 Å) most ~2.0 Å
� Cryo-EM (~4-20 Å)
� Able to simulate various biological environments
� Place a static structure in a simulated biological environment
X-Ray Crystal
NMR
CryoEM
PubMed search
Polio – 22,922 (c1879 onwards)
Poliovirus – 14,381 (c1951 onwards)
Picornavirus – 30,708 (c1945 onwards)
RCSB records
– Poliovirus (56)
- 41 X-Ray
- 8 EM
- 7 NMR
– Picornaviridae (207)
- 180 X-Ray
- 19 EM
- 8 NMR
� Optimal match selected manually from RCSB-Protein Data Bank and phylogenetic analysis
� Models created using SWISS-Model with template 1HXS (2.2 angstrom resolution, most complete chain information)
� Matrix data used from template to recreate full capsid using VMD multiseq for 3D alignment and mon2poly script to generate new chains
� Custom parameter files generated using SwissParam website
Figure derived from Fields Virology 6th edition and Roberts, JA ., et al DOI 10.1016/j.jmgm.2012.06.009
� Quantum mechanics� Realistic throughput of ~100 atoms
� Ab-initio modelling (eg. protein folding)� μs – ms timeframe with multiple conformations = very
unreliable and computationally expensive.
� Homology Modelling
% Amino Acid Identity
Unreliable OK – Careful checking Good - Reliable Excellent
30% 60% 90% 100%
� Virus reconstruction
� 241 protein chains (52,812 Amino Acids)
� 120 lipids (60 covalent bonded to N-terminus VP4)
� 7.5 kb RNA genome
� Total virus model >1 million atoms
� Simulation size 4.2 million atoms (Cubic PBC)
� More than 3 million water atoms and ions!
� ~15-40 million CPU hours to simulate 1μs!
Rhombic dodecahedron 3.3 Million atoms
Figure derived from Roberts, JA ., et al DOI 10.1016/j.jmgm.2012.06.009
� Topology file generation� Ion placement� Data transfer and analysis
� Files for creation, simulation and analysis
▪ 0.1μs simulation time = 50GB of files (0.1ns steps)
� Simple calculations (RMSD/F) for protein coat
▪ 10ns trajectory data at 0.1ns time points
▪ 52,800 amino acids x 100 intervals
▪ 5.28 million data pointsExcel spread sheets need not apply
Figure derived from Roberts, JA ., et al DOI 10.1016/j.jmgm.2012.06.009
Figure derived from Roberts, JA ., et al DOI 10.1016/j.jmgm.2012.06.009
Figure derived from Roberts, JA ., et al DOI 10.1016/j.jmgm.2012.06.009
A. Pentamer B. Empty Capsid C. Full Virus
Figure derived from Roberts, JA ., et al DOI 10.1016/j.jmgm.2012.06.009
A. Empty Capsid B. Full Virus
Figure derived from Roberts, JA ., et al DOI 10.1016/j.jmgm.2012.06.009
� exaFLOP (1018) computing� Better scaling via architecture and software
improvements� petaFLOP = >10ns/day full virus� exaFLOP ~= >10μs/day full virus� Simulations approaching billions of atoms� Newer force fields to emulate quantum
mechanics� Increased integration of MD in research
� A/Prof. Bruce Thorley - VIDRL
� Dr. Mike Kuiper - VLSCI
� Dr. Andrew Hung - RMIT University
� Prof. Peter Smooker - RMIT University
� WHO Regional Poliomyelitis Reference Lab
� Tom Aitken
� Aishah Ibrahim
� Linda Hobday