university of toledo national lab day hpc plenary panellanl vision of simultaneous excellence:...
TRANSCRIPT
Managed by Triad National Security, LLC for the U.S. Department of Energy's NNSA
University of Toledo National Lab Day
Irene QualtersAssociate Lab Director
Simulation and Computation
October 11, 2019
Los Alamos National Laboratory
HPC Plenary Panel
LA-UR-19-31231
2
• The senior DOE Federally Funded Research and Development Center (FFRDC) and operated by Triad National Security, LLC under contract to the National Nuclear Security Administration (NNSA)
• People: >11,000; >1,100 students and > 400 post-docs; 19% PhD.; 47% minorities; 33% female
• Budget: $2.55B• Location: 1,000 buildings on 35 acres
35 miles NW of Santa Fe, NM
Los Alamos National Laboratory Delivering science and technology to protect our nation and promote world stability
Los Alamos National Laboratory
3
LANL Vision of Simultaneous excellence: Balance between operations and mission
Excellence in Mission Operations
Excellence in Nuclear Security
Excellencein Mission-Focused
Science, Technology,
and Engineering
Excellence in Community
Relations
Execute sustained operations that are reliable and responsive to mission needs
Sustain and enhance LANL’s partnership with
regional and national communities
Design, produce, and certify current and future nuclear weapons and reduce global nuclear threats
Deliver scientific discovery and technical
breakthroughs that support DOE/NNSA
missions
Los Alamos National Laboratory
4
IBM 4051943
MANIAC I1952
(MPP/Data Parallel) (Vector machine) (SMP) (Massively process parallel) (commodity computing)
CDC 6600 1966
HPC’s mission at the Lab has always been two-fold:• Provide HPC to execute scientific mission• Advance HPC to help execute future scientific mission
(Hybrid architecture)
(Large core memory)
IBM Stretch begat IBM 360 1962
Cray 11976
Cray XMP1983
TMC CM5 1992
SGI Blue Mountain
1998
LNXI Lightning
2004
Roadrunner 2007
Trinity 2016
(Massive Memory/Burst Buffers)
DwaveIsing 2016
(Quantum Annealing)
(Tubes)
(Transistors)
(App, Workflow, Code Exploitation Efficiency)
2012
LANL: Leading edge of computing in service to the mission
(Mechanical)
Los Alamos National Laboratory
5
“Under ASC, computer simulation capabilities are developed to analyze and predict the performance, safety, and reliability of nuclear weapons and to certify their functionality.”
ASC provides simulation-based confidence in the U.S. stockpile
DOE/NNSA Advanced Simulation and Computing (ASC) program provides the computational surrogate for testing
Los Alamos National Laboratory
6
LANL provides major HPC systems (and significant broader capability) for mission execution
Classified ASC HPC SystemsFire: Penguin, 1390 teraflop/sIce: Penguin, 1390 teraflop/sCyclone: Penguin, 1390 teraflopsViewmaster2 : Appro, VisualizationTrinity: Cray, 41,800 teraflop/s
Cielo ASC FundedInstitutionally Funded
Unclassified HPC SystemsGrizzly: Penguin, 1890 teraflop/sSnow: Penguin, 464 teraflop/sLightshow: Appro, VisualizationKodiak: Penguin, gpu, 1850 teraflopsTrinitite: Cray, 364 teraflopsBadger: Penguin, 450 teraflopsWoodchuck: Dell, cloud/container 60 teraflopsIsing, DwaveDarwin: test bed for technology/architecture exploration
7
Co-design: Hardware, software, applied mathematics & domain science work together to enable scientific discovery
• Co-design is a process, not a goal unto itself
• Directly underpins our broad national security mission
• Crosscuts a wide swath of LANL scientists from STE, Weapons, Global Security
APP!
SW!HW!
Domain Workload!
Exascale Simulation
Environment!
The process of co-design as presented in the ECDC White Paper written by
five exascale co-design centers
Co-design … it’s in our DNA
Suites of Experiment,
Data
High-Performance Computing, Simulation,
Visualization
Theory, Modeling,
Data Analytics
Accelerating Discovery, Innovation, Impact for
Present & Future Complex Missions
Prediction & Control
Los Alamos National Laboratory
8
Exascale is required to address both urgent and longer-term NNSA needs• Urgent national security exascale drivers:
– Aging and manufacturing challenges in current stockpile systems
– Changing threat environment for both current and future systems
• Longer-term national security issues requiring exascale for resolution include:– Nuclear device disablement– 3D assessment of surety options– Understanding plutonium– Determining the effects of hydrodynamic
mixing on performance• Similar motivations exist for
many science fields– e.g., earth systems modeling
Simulation scale
Teraflops
Petaflops
2D Microns of Cu and Al
Microns ofTa and Fe
Microns ofU and Ce
3D
Exaflops
Mod
el c
ompl
exity
Los Alamos National Laboratory
9
While our confidence in the stockpile remains high, the approach to underwrite that confidence has changed
The need for resolution and fidelity at scale drives our need for increased computing capability
Los Alamos National Laboratory
10
Energy Exascale Earth System Model (E3SM) coastal focus
Average Ocean Surface Nitrate
Low resolution Enhanced resolution along North American coasts
Los Alamos National Laboratory
11
Predictive understanding of human and environmental coastal dynamics for society and national security
New DOE projects
Joint research across DOE/BER, National Laboratories, Academia
Integrated Coastal Modeling (ICoM)
NOAA Arctic Report CardInterdisciplinary Research for Arctic Coastal Environments (InteRFACE)
LANL’s Coastal Grand Challenge
New Science for Multisector
Adaptation (NeMSA)
USGS
Lori Murray
Los Alamos National Laboratory
12
LANL’s strategic computing priorities extend beyond exascale
• Data aware computing for mission– AI/Machine learning, complex workflows, visualization, programming models
• Current Focus: Domain science-informed machine learning– Experimental-facility-centric data streams– Informed by theory; integrated with simulation; trustworthy– Cyber-physical systems; optimization methodology
• Novel computing architectures, methods, and algorithms– Quantum, Neuromorphic, …– Hybrid and heterogeneous
• Revolutionary approaches to extreme scale computing– Blurring the boundaries of applications, algorithms, software & ASICs post-exascale
(“Co-Design on steroids”)– 3D methods; multiphysics/multiscale approaches– Engineering agility into the software platforms
Los Alamos National Laboratory
13
LANL’s Data Science at Scale strategy leverages historic strengths to impact emerging directions• Data Science at Scale seeks to exploit the extremely large datasets
and extremely high-rate data-streams from sensors, experimental and observational, embedded computing, and traditional high-performance computing
• Proven Leadership Capabilities– Statistics/UQ …for AI trustworthiness– Visualization & Data Analytics– Robust understanding of physical models
based on theory and experiment
• Emerging Thrusts– Scientific Machine Learning– Data Science for Experimental and Design Workflows Image and
Signal Analysis
Visualizationand DataAnalytics
Statistics/UQ
Los Alamos National Laboratory
Machine Learning
14
LANL is developing machine learning for science and security missions
• ML has expanded rapidly at LANL• Converging communities of multidisciplinary
researchers• Leveraging historical strengths in computational
physics – “physics-informed ML”• Investing with LDRD program to develop capability
and teams for mission • Engaging community through workshops and
conferences• Developing pipeline with schools and student
internships.
Los Alamos National Laboratory
92 AIML publications in 2018
15
Applications to Mission Science
OE/GMLC: AE, A, T DOE/NCI: B, T, CCS
Electric Grid
ISO/RTO/BAA
DSO DSO DSO
PrimaryDistribution Substation
PrimaryDistribution Substation
PrimaryDistribution Substation
PrimaryDistribution Substation
Community Solar
RDistribution Scale Storage
Prosumer DER
B2G Services
SDirect Inter-DSO Power Flow
UPFC R
Prosumer DERB2G Services
Merchant Storage
Coordinator node
Laminar terminal node
Inter-domain Communication bus
Electrical connection
Substationcomms agg Substation
comms agg
Grid Edge
Utility Edge
DEV
DER
Comm bus
Comm bus
Comm bus
DER
Agg
Grid Edge
Utility Edge
DEV
DER
Comm bus
Comm bus
Comm bus
DER
Agg
Subs
tatio
n
Feeder
Section
DO
Comm bus Utility FAN
Feeder
Section
SubstationSubstation
Comm bus
Layered decomposition node allocation
Communication network models(Graphical String Notation)
Graphical models for optimal power flow
Cancer
Deep learning for MD, drug response, personalized treatment
High Performance Computing
NNSA/ASC: HPC
Detected Anomalies
Process system logs to identify & correct network, storage & application issues.
Quantum Chemistry and Molecular
Dynamics
Use ML to construct potentials based upon large datasets of quantum calculations
NNSA/ASC
Los Alamos National Laboratory
16
The Center for Nonlinear Studies (CNLS) stimulates vibrant intellectual exchange in key areas of interest
Mechanistic Studies of Human Disease
Theory and Computation of Quantum Systems
Machine Learning Enhanced Modeling
Dynamics of Systems Far From Equilibrium
Angel E. Garcia, DirectorLos Alamos National Laboratory
17
• A robust and diverse postdoctoral fellowships that pairs postdoctoral fellows with laboratory staff
• A graduate student program during the calendar year and a summer student program (27 in 2015; 25 in 2016; 24 students in 2017, 23 in 2018)
• A large visitors program (~200 visitors/ year from academia and industry)• Active colloquium and seminar series (Colloquium, Q-mat seminar, Quantum lunch,
Postdoctoral fellows seminar, and other visitors seminars)• Organization of international conferences that explore science at the interfaces• The Ulam Scholar program that hosts long term (sabbatical) visitors at CNLS
CNLS meets its goals by maintaining:
Los Alamos National Laboratory
18
Information Science and Technology Institute (ISTI )Summer Schools address strategic areas to fill recruiting pipelines
Summer School Program• 2nd Quantum Computing Summer School
• 3rd Applied Machine Learning Research Internship
• 3rd Cyber Toaster• 4th Parallel Computing School• 9th Computational Co-Design School
• 9th Data Science at Scale School
• 13th Computer Systems, Clusters, and Networking Summer Institute
• Computational Physics Summer Workshop (affiliate)
Co-sponsors• Advanced Simulation and Computing Program• Center for Nonlinear Studies• Center for Space and Earth Science• Office of the Chief Information Officer
103 offers
Los Alamos National Laboratory
19
High performance computing
• Traditional HPC for National Need• Advanced Solutions for Exascale
Computation• Novel Computing Applications
Los Alamos National Laboratory