beyond teraflops toward petaflops 2 october 2002 computational science and engineering department...

24
1 Beyond Teraflops toward Petaflops 2 October 2002 onal Science and Engineering Department Daresbury La Beyond Teraflops toward Petaflops Beyond Teraflops toward Petaflops Computational Chemistry: Computational Chemistry: Challenges and Opportunities Challenges and Opportunities Martyn F. Guest and Paul Sherwood Martyn F. Guest and Paul Sherwood CCLRC Daresbury Laboratory CCLRC Daresbury Laboratory [email protected] [email protected]

Upload: randolph-johnston

Post on 31-Dec-2015

218 views

Category:

Documents


0 download

TRANSCRIPT

1 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Beyond Teraflops toward PetaflopsBeyond Teraflops toward PetaflopsComputational Chemistry:Computational Chemistry:

Challenges and OpportunitiesChallenges and Opportunities

Martyn F. Guest and Paul SherwoodMartyn F. Guest and Paul Sherwood

CCLRC Daresbury LaboratoryCCLRC Daresbury Laboratory

[email protected]@daresbury.ac.uk

2 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

OutlineOutline From Gigaflops to Teraflops - Computational Chemistry TodayFrom Gigaflops to Teraflops - Computational Chemistry Today

Migration from replicated to distributed dataMigration from replicated to distributed data Parallel linear algebra (diagonalisation, FFT etc.)Parallel linear algebra (diagonalisation, FFT etc.) Exploiting multiple length and time scalesExploiting multiple length and time scales

From Teraflops to PetaflopsFrom Teraflops to Petaflops Problem scaling and re-formulation - Four DimensionsProblem scaling and re-formulation - Four Dimensions

Time to Solution, Problem Size, Enhanced Sampling, AccuracyTime to Solution, Problem Size, Enhanced Sampling, Accuracy Long Term - New horizons for simulationLong Term - New horizons for simulation

““Simulation of whole systems, and not just system components”Simulation of whole systems, and not just system components” Scientific Challenges in Key Application areasScientific Challenges in Key Application areas

Catalytic Processes, Biomolecular Simulations, Heavy Particle DynamicsCatalytic Processes, Biomolecular Simulations, Heavy Particle Dynamics““Current Status”, “Towards Petaflops”, “New Horizons”Current Status”, “Towards Petaflops”, “New Horizons”

The Software ChallengeThe Software Challenge Recommendations and SummaryRecommendations and Summary

3 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

1 2 4 8 16 32 64 128 2561

2

4

8

16

32

64

128

256

CPUs per SMP

SM

Ps

per

Clu

ster

How Today’s codes exploit Today’s HardwareHow Today’s codes exploit Today’s Hardware

10GF

100GF

1TF

10TF

Shared Memory

Shared Memory

Distrib

uted

M

emory

Distrib

uted

M

emory

HPCxHPCx

GaussianGaussian

GAMESS-UKGAMESS-UK

NWChemNWChem

DL_POLY 3DL_POLY 3

CHARMMCHARMM

NAMD NAMD

4 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

High-End Computational ChemistryHigh-End Computational ChemistryThe NWChem SoftwareThe NWChem Software

Capabilities Capabilities (Direct, Semi-direct and conventional):(Direct, Semi-direct and conventional): RHF, UHF, ROHFRHF, UHF, ROHF using up to 10,000 basis functions; analytic 1st using up to 10,000 basis functions; analytic 1st

and 2nd derivatives.and 2nd derivatives. DFTDFT with a wide variety of local and non-local XC potentials, using with a wide variety of local and non-local XC potentials, using

up to 10,000 basis functions; analytic 1st and 2nd derivatives.up to 10,000 basis functions; analytic 1st and 2nd derivatives. CASSCFCASSCF; analytic 1st and numerical 2nd derivatives.; analytic 1st and numerical 2nd derivatives. Semi-direct and RI-based MP2Semi-direct and RI-based MP2 calculations for RHF and UHF wave calculations for RHF and UHF wave

functions using up to 3,000 basis functions; analytic 1st derivatives functions using up to 3,000 basis functions; analytic 1st derivatives and numerical 2nd derivatives.and numerical 2nd derivatives.

Coupled cluster, CCSD and CCSD(T)Coupled cluster, CCSD and CCSD(T) using up to 3,000 basis using up to 3,000 basis functions; numerical 1st and 2nd derivatives of the CC energy. functions; numerical 1st and 2nd derivatives of the CC energy.

Classical molecular dynamics and free energy simulations with the forces obtainable from a variety of sources

5 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Memory-driven Approaches: NWChem - DFT (LDA) Memory-driven Approaches: NWChem - DFT (LDA) Performance on the SGI Origin 3800Performance on the SGI Origin 3800

• DZVP Basis (DZV_A2) and DgaussDZVP Basis (DZV_A2) and Dgauss A1_DFT Fitting basis: A1_DFT Fitting basis:

AO basis: AO basis: 3554 3554 CD basis:CD basis: 1271312713

• MIPS R14k-500 CPUs (Teras)MIPS R14k-500 CPUs (Teras)

Wall time (13 SCF iterations):Wall time (13 SCF iterations):64 CPUs = 5,242 seconds64 CPUs = 5,242 seconds128 CPUs= 3,451 seconds128 CPUs= 3,451 seconds

Est. time on 32 CPUs = Est. time on 32 CPUs = 40,00040,000 secs secs

Zeolite ZSM-5Zeolite ZSM-5

• 3-centre 2e-integrals = 1.00 X 103-centre 2e-integrals = 1.00 X 10 12 12

• Schwarz screening = 5.95 X 10Schwarz screening = 5.95 X 10 10 10

• % 3c 2e-ints. In core = 100%% 3c 2e-ints. In core = 100%

Fock matrix (N = 3888)

0

20

40

60

80

100

120

16 32 64 128 256 512

PeIGS 2.1PeIGS 3.0PDSYEV (Scpk 1.5)PDSYEVD (Scpk 1.7)BFG-Jacobi (DL)

Number of processors

Tim

e (s

ec)

6 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Multiple Time and Length ScalesMultiple Time and Length Scales

QM/MM - first step towards multiple length scalesQM/MM - first step towards multiple length scales QM treatment of the active siteQM treatment of the active site

reacting centrereacting centre problem structures (e.g. transition metal centres)problem structures (e.g. transition metal centres) excited state processes (e.g. spectroscopy)excited state processes (e.g. spectroscopy)

Classical MM treatment of environmentClassical MM treatment of environment enzyme structure, zeolite framework, explicitenzyme structure, zeolite framework, explicit

and/or dielectric solvent modelsand/or dielectric solvent models

Multiple time scale algorithms for MDMultiple time scale algorithms for MD Recompute different parts of energy expression at different intervals e.g. Recompute different parts of energy expression at different intervals e.g.

variants of the Reference System Propagation Algorithm (RESPA)variants of the Reference System Propagation Algorithm (RESPA)

But to date length / time scales only differ by ~ 1 order of magnitude For an example of an effort to link the atomistic and meso-scales see RealityGrid: http://www.realitygrid.org/information.html

7 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

• QM region 35 atoms (DFT BLYP) – include residues with possible proton donor/acceptor roles – GAMESS-UK, MNDO, TURBOMOLE

• MM region (4,180 atoms + 2 link)– CHARMM force-field, implemented in CHARMM, DL_POLY

Triosephosphate isomerase (TIM)

• Central reaction in glycolysis, catalytic interconversion ofDHAP to GAP

• Demonstration case within QUASI (Partners UZH, and BASF)

Triosephosphate isomerase (TIM)

• Central reaction in glycolysis, catalytic interconversion ofDHAP to GAP

• Demonstration case within QUASI (Partners UZH, and BASF)

QM/MM Applications

1467

1030

1487

714

802

540

778

419

508

308

428

274

196

257

213

0

400

800

1200

1600

8 16 32 64

CS7 AMD K7/1000 + SCI

CS9 P4/2000 + Myrinet 2k

SGI Origin3800/R14k-500

AlphaServer SC ES45/1000

Measured Time (seconds)

T T 128128 (O3800/R14k-500) = 181 secs (O3800/R14k-500) = 181 secs

8 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

From Teraflops to PetaflopsFrom Teraflops to Petaflops Short Term - Problem Scaling and Re-formulationShort Term - Problem Scaling and Re-formulation

Approaches to efficient exploitation of larger systemsApproaches to efficient exploitation of larger systemsOpportunities for more realistic modellingOpportunities for more realistic modellingNeed to avoid dependency on continued scaling of existing algorithmsNeed to avoid dependency on continued scaling of existing algorithms

Scientific Targets: Catalysis, Enzymes and Biomolecules, Heavy Particle Scientific Targets: Catalysis, Enzymes and Biomolecules, Heavy Particle DynamicsDynamics

Long Term - New HorizonsLong Term - New Horizons ““Simulation of whole systems, and not just system components”Simulation of whole systems, and not just system components” Automated problem solutionAutomated problem solution

Focus on parallel supercomputers build with commodity compute Focus on parallel supercomputers build with commodity compute servers tied by high performance communication fabricservers tied by high performance communication fabric

New PetaOPs architecture ProjectsNew PetaOPs architecture Projects IBM’s Blue LightIBM’s Blue Light

cellular architecture with 10cellular architecture with 1055 or more CPUs; intended to be general purpose or more CPUs; intended to be general purpose

IBM’s Blue GeneIBM’s Blue Gene collocation of 32 CPUs, 8MB RAM on same chip (may scale to 10collocation of 32 CPUs, 8MB RAM on same chip (may scale to 1066 CPUs) CPUs) application specific, protein foldingapplication specific, protein folding

9 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

“ “State of the Art”State of the Art”

Tera-flop computingTera-flop computing

~ 1000 Processors~ 1000 Processors

TimeTime

Same model but exploit faster executionSame model but exploit faster execution• Longer timescale for simulations• Interactive exploration! Limited scalability of current algorithms! Limited scalability of current algorithms! Long timescales demand higher accuracy! Long timescales demand higher accuracy

Size Size

More Accurate More Accurate MethodsMethods

• Better forcefields• Increased use of ab-initio methods• Higher level QM• QM/MM, DFT replacing semi-empirical methods• Finer numerical grids

!! Accuracy of Accuracy of methods tends to methods tends to increase slowly increase slowly with costwith cost !! Most major Most major challenges involve challenges involve larger systemslarger systems

Sampling Sampling

Study many configurations or systems at onceStudy many configurations or systems at once• Better Statistics, Free energies• Combinatorial methods• MC Ensembles!! Often satisfied by cheaper, commodity systemsOften satisfied by cheaper, commodity systems

Accuracy Accuracy

Larger problemsLarger problems

• Distributed data methods can exploit large global memories

!! Many algorithms Many algorithms contain serious contain serious bottlenecks, e.g. bottlenecks, e.g. diagonalisation diagonalisation O(NO(N33))

!! Sampling confor-Sampling confor-mational space mational space becomes harder becomes harder with system sizewith system size

10 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

What is Petaflop Computational What is Petaflop Computational ChemistryChemistry

Consider large classical simulationsConsider large classical simulations start from a typical 100,000 particle biomolecule + water start from a typical 100,000 particle biomolecule + water cost per timestep ~50 seconds on 1GF (peak) processorcost per timestep ~50 seconds on 1GF (peak) processor MD simulations of order 0.5MD simulations of order 0.5s will require approx 100,000,000 stepss will require approx 100,000,000 steps Cost 5,000 Pflops per state point i.e. 1.5 hours on a Pflop machineCost 5,000 Pflops per state point i.e. 1.5 hours on a Pflop machine Energy Scales as scales as O(N log (N)) and the equilibration time as Energy Scales as scales as O(N log (N)) and the equilibration time as

O(N O(N 5/35/3)) 1,000,000 particle simulation (for 25 us) will take 840 Pflop hour 1,000,000 particle simulation (for 25 us) will take 840 Pflop hour

(complex membrane protein)(complex membrane protein)

Quantum SimulationsQuantum Simulations Assuming a cost of 5 hours on 1 Gflop processor, 1 day on a 1 Pflop Assuming a cost of 5 hours on 1 Gflop processor, 1 day on a 1 Pflop

resource will simulate 25 nanoseconds of motion, corresponding to the resource will simulate 25 nanoseconds of motion, corresponding to the equilibration time of a 10,000 atom system.equilibration time of a 10,000 atom system.

11 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Modelling of Catalysis Modelling of Catalysis 1. Current Status1. Current Status

ToolsTools Classical simulationClassical simulation

equilibrium structures and transport propertiesequilibrium structures and transport properties QM simulationsQM simulations

reactivity of molecular models, surface structures, supercellsreactivity of molecular models, surface structures, supercells QM/MM methodsQM/MM methods

solvent, ligand and lattice effects on local chemistrysolvent, ligand and lattice effects on local chemistry

Scientific DriversScientific Drivers From mechanisms to reaction ratesFrom mechanisms to reaction rates From simplified models to multi-component systems, defect sitesFrom simplified models to multi-component systems, defect sites

Collaboration QUASI (Quantum Simulation in Industry) Collaboration QUASI (Quantum Simulation in Industry) see http://www.cse.clrc.ac.uk/qcg/quasisee http://www.cse.clrc.ac.uk/qcg/quasi

12 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Modelling of CatalysisModelling of Catalysis 2. Towards Petaflops2. Towards Petaflops

Extended dynamical simulation, solvent, counterions etc Extended dynamical simulation, solvent, counterions etc

Advanced forcefields (polarisabilities, cross-terms etc)Advanced forcefields (polarisabilities, cross-terms etc) More extensive use of quantum methods (large DFT More extensive use of quantum methods (large DFT

clusters and periodic supercells)clusters and periodic supercells)

Combinatorial approach to catalyst formulationCombinatorial approach to catalyst formulation Quantum dynamics & tunnelling Quantum dynamics & tunnelling viavia path integral methods path integral methods Finite difference approach to local surface vibrationsFinite difference approach to local surface vibrations

ab-initioab-initio treatment of larger surface domains treatment of larger surface domains

TimeTime

SizeSize

Sampling Sampling

AccuracyAccuracy

13 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Require integrated models spanning time and length scales.Require integrated models spanning time and length scales. Initially the interface between the length and time scales will be via Initially the interface between the length and time scales will be via

construction of parametric modelsconstruction of parametric models There is also the possibility that lower-level data might be computed on There is also the possibility that lower-level data might be computed on

demand as required to sustain the accuracy of the large-scale simulationdemand as required to sustain the accuracy of the large-scale simulation

Reactor geometriesReactor geometries

Diffusion of reactants and productsDiffusion of reactants and products

Heterogeneous surface structures (films etc)Heterogeneous surface structures (films etc)

Reaction ratesReaction rates

Detailed chemical energeticsDetailed chemical energetics

Modelling of CatalysisModelling of Catalysis3. New Horizons3. New Horizons

14 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Biomolecular Simulation Biomolecular Simulation 1. Current Status 1. Current Status

ToolsTools Classical SimulationClassical Simulation

Simple but well established forcefields for proteins, nucleic acids, Simple but well established forcefields for proteins, nucleic acids, polysacharrides etc polysacharrides etc

QM and QM/MMQM and QM/MMenzyme reaction energetics, ligand bindingenzyme reaction energetics, ligand binding

Continuum electrostaticsContinuum electrostaticsPoisson-Boltzmann, Generalised BornPoisson-Boltzmann, Generalised Born

Statistical MechanicsStatistical Mechanics Scientific TargetsScientific Targets

Accurate free energies for more complex systemsAccurate free energies for more complex systems Faster and more accurate screening of protein / ligand binding Faster and more accurate screening of protein / ligand binding Membrane proteins (e.g. receptors)Membrane proteins (e.g. receptors) Complex conformational changes (e.g. protein folding)Complex conformational changes (e.g. protein folding) Excited state dynamicsExcited state dynamics

15 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Biomolecular SimulationBiomolecular Simulation2. Towards Petaflops2. Towards Petaflops

Faster simulations to approach real timescales Faster simulations to approach real timescales !! major scalability problems!! major scalability problems

New forcefields incorporating polarisation, cross-terms, etcNew forcefields incorporating polarisation, cross-terms, etc Increased use of Increased use of ab-initioab-initio methods methods

Tremendous potential due to importance of free energies.Tremendous potential due to importance of free energies. Multiple independent simulationsMultiple independent simulations Replica path - simultaneous minimisation or simulations of an Replica path - simultaneous minimisation or simulations of an

entire reaction pathwayentire reaction pathway Replica exchange - Monte Carlo exchange of configurations Replica exchange - Monte Carlo exchange of configurations

between an ensemble of replicas at different temperaturesbetween an ensemble of replicas at different temperatures Combinatorial approach to ligand bindingCombinatorial approach to ligand binding

membranes, molecular assemblies … membranes, molecular assemblies …

TimeTime

SizeSize

Sampling Sampling

AccuracyAccuracy

16 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Replica Path MethodsReplica Path Methods

Replica path method - simultaneously optimise a series of Replica path method - simultaneously optimise a series of points defining a reaction path or conformational change, points defining a reaction path or conformational change, subject to path constraints.subject to path constraints.

Suitable for QM and QM/MM HamiltoniansSuitable for QM and QM/MM Hamiltonians

Parallelisation per point Parallelisation per point

Communication is limited to Communication is limited to adjacent points on the path - adjacent points on the path - global sum of energy functionglobal sum of energy function

PP3636

PP44

PP3232 PP3333

PP11PP00

PP3434 PP3535

PP33PP22

EE

Reaction Co-ordinateReaction Co-ordinate

Collaboration with Bernie Brooks (NIH) Collaboration with Bernie Brooks (NIH) http://www.cse.clrc.ac.uk/qcg/chmgukhttp://www.cse.clrc.ac.uk/qcg/chmguk

17 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Biomolecular Simulation Biomolecular Simulation 3. New Horizons3. New Horizons

Towards full quantum simulation (e.g. Car Parrinello)Towards full quantum simulation (e.g. Car Parrinello)

Towards Whole cell simulationTowards Whole cell simulation Mechanical deformation, Electrical behaviourMechanical deformation, Electrical behaviour

Diffusion of polymeric molecules (e.g. neuro-transmitters) by DPDDiffusion of polymeric molecules (e.g. neuro-transmitters) by DPD

Nanoscale models for supra-molecular Nanoscale models for supra-molecular structures (e.g. actin filaments in muscles)structures (e.g. actin filaments in muscles)

Atomistic Molecular DynamicsAtomistic Molecular Dynamics

Quantum chemistry of reacting sites Quantum chemistry of reacting sites

CCP1 Flagship project - Simulation of Condensed Phase CCP1 Flagship project - Simulation of Condensed Phase Reativity: http://www.ccp1.ac.uk/projects.shtmlReativity: http://www.ccp1.ac.uk/projects.shtml

18 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Heavy Particle Dynamics Heavy Particle Dynamics 1. Current Status1. Current Status

ToolsTools Many methods require evaluation of energies on a massive multi-Many methods require evaluation of energies on a massive multi-

dimensional griddimensional gridUse high-level computational chemistry methods (e.g. NWChem, Use high-level computational chemistry methods (e.g. NWChem,

MOLPRO), together with task farming.MOLPRO), together with task farming. Complex parameter fittingComplex parameter fitting

Can exploit interactivity (INOLLS) incorporating experimental dataCan exploit interactivity (INOLLS) incorporating experimental data Dynamical Simulation MethodsDynamical Simulation Methods

Wavepacket evolution on a GridWavepacket evolution on a GridClassical path methods (multiple direct dynamics trajectories)Classical path methods (multiple direct dynamics trajectories)Variational solutions for spectroscopic levelsVariational solutions for spectroscopic levels

TargetsTargets Larger species (5 atoms and beyond)Larger species (5 atoms and beyond) Influence of surrounding molecules (master equation)Influence of surrounding molecules (master equation)

CCP6 Collaboration, ChemReact consortium on national HPC facilities CCP6 Collaboration, ChemReact consortium on national HPC facilities

19 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Heavy Particle DynamicsHeavy Particle Dynamics2. Towards Petaflops2. Towards Petaflops

Interactivity in potential energy surface fitting.Interactivity in potential energy surface fitting. Longer time simulations (slower reactions)Longer time simulations (slower reactions)

PE surfaces from large basis set CCSD(T) etcPE surfaces from large basis set CCSD(T) etc MRCI for excited states and couplingsMRCI for excited states and couplings

Large grids for higher dimensionality systems (5 atoms)Large grids for higher dimensionality systems (5 atoms) Multiple coupled PE surfaces (involvement of excited Multiple coupled PE surfaces (involvement of excited

states) states) J > 0 - additional angular momentum statesJ > 0 - additional angular momentum states

Larger grids for wavepacket evolution Larger grids for wavepacket evolution

TimeTime

SizeSize

Sampling Sampling

AccuracyAccuracy

20 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Integration with Combustion, detonation, atmospheric models, most Integration with Combustion, detonation, atmospheric models, most likely through detailed parameter and rate constant derivationlikely through detailed parameter and rate constant derivation

CFD simulation (transient or steady state, CFD simulation (transient or steady state,

turbulence models etc)turbulence models etc)

Reduced chemical kineticsReduced chemical kinetics

Reaction rates, sensitivity to pressure and temperatureReaction rates, sensitivity to pressure and temperature

Heavy particle dynamicsHeavy particle dynamics

Heavy Particle DynamicsHeavy Particle Dynamics3. New Horizons3. New Horizons

21 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

The Software ChallengeThe Software Challenge Lack of sustained focus within the Chemical Sciences for responding Lack of sustained focus within the Chemical Sciences for responding

to the challenges of Petascale Computingto the challenges of Petascale Computing The impact of providing such a focus periodically demonstrated at The impact of providing such a focus periodically demonstrated at

points on the road to Terascale Computingpoints on the road to Terascale Computing The HPCC Grand Challenge projectsThe HPCC Grand Challenge projects NWChem - DOE (PNNL) - principally Electronic Structure;NWChem - DOE (PNNL) - principally Electronic Structure; NAMD - NIH funded initiative in bio-molecular sciences and Classical MD.NAMD - NIH funded initiative in bio-molecular sciences and Classical MD.

Such Initiatives demand: Such Initiatives demand: The successful integration of multi-disciplinary teams including Application The successful integration of multi-disciplinary teams including Application

and Computational scientists, Computer Scientists and Mathematicians;and Computational scientists, Computer Scientists and Mathematicians; A long term commitment to the challenge, with funding in place to respond A long term commitment to the challenge, with funding in place to respond

to the inevitable pace of architecture / hardware change.to the inevitable pace of architecture / hardware change. Relying on the efforts of individual groups to overcome this software Relying on the efforts of individual groups to overcome this software

challenge challenge will not workwill not work..

22 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Problem Solving EnvironmentsProblem Solving EnvironmentsRequirement:Requirement: A comprehensive A comprehensive problem solving environment (PSE) for problem solving environment (PSE) for molecular modeling and simulation. molecular modeling and simulation. Key components include:Key components include:

• common graphical user interfacescommon graphical user interfaces• scientific modelling managementscientific modelling management• seamless transfer of information seamless transfer of information

between applicationsbetween applications• persistent data storagepersistent data storage• integrated scientific data managementintegrated scientific data management• tools for ensuring efficient use of tools for ensuring efficient use of

computing resources across a computing resources across a distributed network i.e. GRIDdistributed network i.e. GRID

• visualization of multi-dimensional data visualization of multi-dimensional data structuresstructures

23 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Summary and Recommendations 1.Summary and Recommendations 1.

Computational chemistry on Tera-scale resources “needs work”, but Computational chemistry on Tera-scale resources “needs work”, but there are plenty of opportunities to advance there are plenty of opportunities to advance collaborativelycollaboratively chemical chemical sciences “towards petaflops”sciences “towards petaflops”

The short term priority is scaling and adapting current methodologiesThe short term priority is scaling and adapting current methodologies Advancing use of distributed data algorithmsAdvancing use of distributed data algorithms O(N) techniques to remove bottlenecks and enhance scalabilityO(N) techniques to remove bottlenecks and enhance scalability Detailed work on parallel scaling Detailed work on parallel scaling

– Library developmentsLibrary developments

– Performance analysis and prediction toolsPerformance analysis and prediction tools Re-formulation of problems in terms of more weakly interacting Re-formulation of problems in terms of more weakly interacting

ensemblesensembles Parallel implementations of more complex physical modelsParallel implementations of more complex physical models Automation, data handling, PSEs for combinatorial workAutomation, data handling, PSEs for combinatorial work

In the longer term by tackling more integrated problemsIn the longer term by tackling more integrated problems Modularity of softwareModularity of software Science of the time / length scale interfacesScience of the time / length scale interfaces

24 Beyond Teraflops toward Petaflops 2 October 2002

Computational Science and Engineering Department Daresbury Laboratory

Summary and Recommendations 2.Summary and Recommendations 2. Investment in Software: “Code Sharing”Investment in Software: “Code Sharing”

UK has kept a strong applications focus, but has lagged behind the US in UK has kept a strong applications focus, but has lagged behind the US in the radical re-design of simulation packagesthe radical re-design of simulation packages

Sustained investment in Petascale Software DevelopmentSustained investment in Petascale Software Development Current UK and International collaborationsCurrent UK and International collaborations

Scalable QC algorithmsScalable QC algorithmsNWChem, MOLPRO, GAMESS-UK (PNNL, ORNL, SDSC, DL, NWChem, MOLPRO, GAMESS-UK (PNNL, ORNL, SDSC, DL,

CCP1/5/6); CCP1/5/6); Replica path methods in CHARMM/GAMESS-UKReplica path methods in CHARMM/GAMESS-UK

DL collaboration with NIH, PSC; DL collaboration with NIH, PSC; Flexible QM/MM models incorporating classical polarisation Flexible QM/MM models incorporating classical polarisation

ChemShell / GULP / GAMESS-UK / ChemShell / GULP / GAMESS-UK / NAMDNAMD;; Distributed data classical models, electrostatic models Distributed data classical models, electrostatic models

DL_POLY, NWChem – CCP1/5.DL_POLY, NWChem – CCP1/5. Stronger Links between UK initiatives and US programs SciDAC and Stronger Links between UK initiatives and US programs SciDAC and

NSF PACI, NPACI etc.NSF PACI, NPACI etc.