mark thompson planaria software seattle, wa molecular docking with arguslab

24
Mark Thompson Planaria Software Seattle, WA http://www.planaria-software olecular Docking With ArgusL

Upload: mariah-george

Post on 17-Dec-2015

238 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

Mark ThompsonPlanaria Software

Seattle, WA

http://www.planaria-software.com

Molecular Docking With ArgusLab

Page 2: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

     Find the optimal ligand/protein configurations and accurately (at least consistently) predict their binding free energy without using formal statistical mechanics approaches.

     Ligand is flexible.

     Protein binding site is flexible (side chains and protein backbone).

     Do this in under ~5 seconds on a commodity processor (ideally under 1 second or faster).

Molecular DockingMolecular DockingA complicated search problemA complicated search problem

Page 3: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

We have developed two docking engines and an empirical scoring function in ArgusLab 4.0.

•ShapeDock: shape-based method, approximates exhaustive search.

•GADock: Lamarckian genetic algorithm similar to AutoDock.

•AScore: scoring function based on XScore of Wang and coworkers.

Typical ShapeDock times for ligands with 10-15 torsions are < 30 seconds on a 2.4 GHz Pentium laptop computer.

 Our docking code is implemented for both interactive docking and screening

of ligand databases.

AbstractAbstract

Page 4: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

Molecular modeling application runs on Windows platforms.

ArgusLab 4.0ArgusLab 4.0

Page 5: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

  1. Ligand is described as a torsion tree Nodes are groups of bonded atoms that do not have rotatable bonds; connections between nodes are torsions. Topology of tree is crucial to efficient docking. A balanced tree with a large central node is best. 2. Construct two grids that overlay the binding siteGrid points marked as inside or outside the free volume of binding site. Fine grid used to determine if atoms of a pose fragment are inside or outside the binding site. Coarse grid is used to establish the search points inside the binding site. 3. At each “search point of interest”Ligand’s root node is placed on a search point and a set of diverse and energetically favorable rotations is created. Translations near the search point are allowed to remove bumps with the target.

ShapeDock ShapeDock Approximates an exhaustive searchApproximates an exhaustive search

((similarities to Fred, Dock, and Glide)similarities to Fred, Dock, and Glide)

Page 6: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

4. For each rotation, construct the torsions in breadth-first order Use pre-defined torsion values based on bond order of central bond. Use fine grid to test newly added atoms for bumps with protein and intra-ligand contacts to accept or reject pose fragment. Allow small torsion adjustments to minimize bumps. (rings are treated as rigid)

5. Score pose candidates Poses candidates are those that survive the torsion search. They are ranked to maintain set of N-lowest energy poses (N typically 50-150). Clustering poses as they are found maintains diversity in the final set.  6. Optimize the final set of poses

•Coarse minimization of all poses.•Re-cluster and rank using more aggressive cluster cutoff.•Minimize 25 lowest energy poses more aggressively.•Stochastic search of 25 lowest poses to find nearby minima.

 

ShapeDockShapeDock

Page 7: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

Translation Rigid-body rotation Torsions

Population of individuals Fitness of each is docking

score Each generation:

Select breeding individuals Mutation Crossover Local minimization Elitism Check for convergence

Genetic Degrees of Freedom Search procedure

GADockGADockLamarckian Genetic AlgorithmLamarckian Genetic Algorithm

Page 8: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

CompareCompare GADock & ShapeDock GADock & ShapeDock

Robust & General Slow, hard to define

convergence Not reproducible

(Stochastic) Can get caught in a local

minima

Some ligand/binding site types still cause problems

Fast! Reproducible Formally explores all

minima

GADock ShapeDock

Page 9: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

LigandHydrogens addedHybridization and AScore atom types assigned

 Target

Crystal waters remainHybridization and AScore atom types assigned

 Miscellaneous

Atom charges not required All steps are done automatically inside ArgusLab (no user intervention required). However, manual modifications to above may be done if desired. Running the docking calculation: Select the ligand and binding site. Accept default parameters (grid size and resolution) or modify them. Run the docking.

Sample Preparation and RunSample Preparation and Run

Page 10: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

2.4 GHz Pentium(R) 4 Dell Inspiron laptop

ShapeDock: Typical TimingsShapeDock: Typical Timings

Target Ligand Torsions Time (sec)

1HPV VX478 14 21 1HVR XK263 8 16 4DFR Methotrexate 9 3 1IEP Gleevec 7 11

1CBX Benzylsuccinate 5 5 1STP Biotin 5 3 3PTB Benzamidine 0 1

Page 11: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

AScore AScore an empirical scoring functionan empirical scoring function

AScore is based on terms taken from the HPScore piece of XScore [1]

[1] “Further development and validation of empirical scoring functions for structure-based binding affinity prediction” Wang, R, Lai, L, and Wang, S. J. Comp. Aided Mol. Design 16, 11-26, 2002

Gbind = Gvdw + Ghydrophobic + GH-bond + GH-bond (chg) + Gdeformation + G0

Gvdw = CVDW VDW

Ghydrophobic = Chydrophobic HP

GH-bond = CH-bond HB

GH-bond (chg-chg & chg-neutral) = CH-bond(chg) HB

Gdeformation = Crotor RT

G0 = Cregression

Page 12: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

AScoreAScore

Sum is over hydrophobic ligand-protein atom pairs f(dij) = 1.0 d < dij,0 + 0.5Å

= 2/3 (d0 + 2 – d) dij,0 + 0.5 Å < d <= dij,0 + 2.0 Å

= 0 d > dij,0 + 2.0 Å

HP = protein

jij

ligand

i

df

VDW =

ligand

i

ligand

ij ij

ij

ij

ijprotein

j ij

ij

ij

ijligand

i r

d

r

d

r

d

r

d4

0,

8

0,

4

0,

8

0, 22

dij,0 is sum of vdW radii of atoms i,j

intra-ligand VDW excludes 1-2, 1-3 bonded pairs.

Page 13: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

AScoreAScore

RT =

RTi = 0 atom i not involved in any torsion.

= 0.5 atom i involved in 1 torsion. = 1.0 atom i involved in 2 torsions. = 0.5 atom i involved in > 2 torsions.

ligand

iiRT

HB = protein

j

ligand

i

HBij

HBij = f(rij) f(1,ij) f(2,ij)

rij distance between donor/acceptor atoms

1,ij angle between donor root-donor-acceptor

2,ij angle between donor-acceptor-acceptor root

Each term varies from 1.0 to 0.0 depending on how close to ideal value. Maximum number of H-bonds per donor/acceptor atom imposed.

Page 14: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

      AScore extends XScore to allow it to be used as the docking objective function.

      Separate H-bond term involving charged donor and/or acceptor groups.

      Max. number of H-bonds per donor/acceptor imposed by uniformly scaling total found to the maximum number allowed for any given ligand atom.

      Ligand has hydrogens added.

      Hydrogens included in the VDW term.

      Crystal waters retained (but hydrogens not added). H-bonds with crystal waters treated as having ideal H-bond geometry but with a scaling factor fit to experiment.

      H-bonds with target metals treated as ideal geometry, but with scaling factor fit to experiment.

      SH treated as H-bond donor/acceptor, >S treated as H-bond acceptor.

      Intra-ligand VDW energy included.

AScoreAScoreDifferences with XScoreDifferences with XScore

Page 15: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

•Begin with the published XScore parameters.[1]

•Begin with Wang’s data set of 100 protein-ligand structures.[2]

•Remove incorrect structures to get a final training set of 84 structures:

39 hydrophilic, 20 hydrophobic, 25 mixed

•Modify H-bond parameters & other new parameters to improve correlation of score of x-ray

pose and experiment binding free.

[1] “Further development and validation of empirical scoring functions for structure-based binding affinity prediction” Wang, R, Lai, L, and Wang, S. J. Comp. Aided Mol. Design 16, 11-26, 2002

[2] “Comparative Evaluation of 11 Scoring Functions for Molecular Docking” Renxiao Wang, Yipin Lu, and Shaomeng Wang. J. Med. Chem. 2003, 46, 2287-2303

Structure Type

Correlation Gbind with Gexperiment

RMSD Binding Affinity

(kcal/mol)

Hydrophilic 0.53 2.3

Hydrophobic 0.84 2.0

Mixed 0.70 2.1

All Structures 0.70 2.2

Parameterization & ValidationParameterization & Validation(in progress)(in progress)

Page 16: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

Dock the training set using the ShapeDock engine.

Structure Type

Correlation Gbind with Gexperiment

RMSD Binding Affinity

(kcal/mol)

Ave. RMSD(Å)

Hydrophilic 0.43 2.4 1.4

Hydrophobic 0.80 2.2 1.9

Mixed 0.61 2.4 1.7

All Structures

0.64 2.3 1.6

Parameterization & ValidationParameterization & Validation

Page 17: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

[1] “The Effect of Small Changes in Protein Structure on Predicted Binding Modes of Known Inhibitors of Influenza Virus Neruaminidase: PMF-Scoring in Dock4” Ingo Muegge, Med. Chem. Res. 9, 1999, 490-500.

Trial Study: Trial Study: Influenza Virus NeuraminidaseInfluenza Virus Neuraminidase [1]

• Glycoprotein enzyme cleaves sialic acid residues from maturing virus particles. • Eleven conserved residues make up the binding site.• Dominated by H-bonding & charge-charge group interactions (e.g. carboxyl : guanidino)

DANA GANA

100,000 x increase in binding affinity

-10.2 kcal/mol -11.8 kcal/mol ~ 3x enhancement

Page 18: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

Neuraminidase DockingsNeuraminidase DockingsShapeDockShapeDock

9 of the 10 structures reproduced the experimental binding mode.

Correlation of predicted and measured binding affinities

R2 = 0.70Ave. RMSD = 1.55 Angstroms

-12

-11

-10

-9

-10 -9 -8 -7 -6 -5 -4 -3 -2

log IC50

AS

co

re S

co

re (kcal

/mo

l)

Page 19: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

• ShapeDock and GADock engines (IDockEngine interface, DockEngineFactory, etc).

• AScore scoring function with modifiable parameter set (IScore interface).

• Easy to make the ligand and binding site groups with one mouse click.

• Dock ligand as flexible, rigid, or using only selected torsions.

• Score current pose, optimize current pose, and full docking.

• Scoring function pre-evaluated on a scoring grid(s).

• Database docking supports SDF file as ligand database (IDataSource).

• Efficient reuse of scoring and docking grids allows user to interactively

modify ligand or choose new ligand and quickly dock new structures.

• Results summarized in external file and in a tree-view. User can click on

poses to view details.

Docking in ArgusLab 4.0Docking in ArgusLab 4.0

Page 20: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

• 3D interactive molecule builder & viewer

• Computational experiments•QM: Extended Huckel, Semi-empirical (MNDO, AM1, PM3), ZINDO, and ab initio (via interface to Gaussian 98/03). •MM: Universal Force Field (UFF), CVFF, AMBER, custom force fields for research. Polarizable molecular mechanics, Rappe & Goddard’s charge equilibration scheme for UFF.•Geometry optimizations, electronic excited states, MD simulations, free-energy perturbation, and potential of mean force.•QM/MM and QM/MMpol.•Molecular Docking.

• Properties & misc. dipole moments, atom-charges, transition properties, surface properties, animate normal modes, view dock poses, ribbons, solvent-accessible surfaces, SCRF solvent effects,

explicit solvent, periodic boundary conditions, Ewald sums, etc.

• Manage/organize results: treeview tool for editing structures and viewing results. Results and structures can be saved in ArgusLab XML file.

ArgusLab CapabilitiesArgusLab Capabilities

Page 21: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

• Multi-document interface, multi-threaded.

• Written in C++ (some old legacy C-code is wrapped in C++)

• Uses OpenGL for graphics, Win32 API for windowing system.

• Garbage collection for graphics objects, events, etc.

• Custom hash-tables & containers in addition to use of STL.

• Custom Model-View-Controller (MVC) transport layer.

• 3D editor built on a command processor model (support undo/redo). 

 Installed User Base

• ~20,000 downloads/licenses.

• Popular in university teaching programs and with students. (free )

• Used in several industrial settings.

Arguslab ArchitectureArguslab Architecture

Page 22: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab
Page 23: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab
Page 24: Mark Thompson Planaria Software Seattle, WA  Molecular Docking With ArgusLab

Score the PDBbind DatabaseScore the PDBbind Database

Score the 786 structures from the PDBbind database[1] (14 incorrect structures were removed from the original 800 in database)

[1] “The PDBbind Database: Collection of Binding Affinities for Protein-Ligand Complexes with Known Three-Dimensional Structures” Renxiao Wang, Xueliang Fang, Yipin Lu, and Showmeng Wang. J. Med. Chem. 2004, 47, 2977-2980

PDBbind Database

Correlation Gbind with Gexperiment

RMSD Binding Affinity

(kcal/mol)

786 Structures 0.47 2.9