nvidia dgx a100...10+ years of ai innovation 100+ gpu-optimized software and tools on ngc 10,000+ of...

1
10+ Years of AI Innovation 100+ GPU-Optimized Software and Tools on NGC 10,000+ of AI-Fluent Practitioners Around the Globe Unlock AI in Your Organization Today www.nvidia.com/DGXA100 © 2020 NVIDIA Corporation. All rights reserved. NVIDIA, the NVIDIA logo, DGX, DGX SuperPOD, Mellanox, Mellanox Quantum, NGC, NVLink, and NVSwitch are trademarks and/or registered trademarks of NVIDIA Corporation in the U.S. and other countries. All other trademarks and copyrights are the property of their respective owners. DIRECT ACCESS TO A GLOBAL TEAM OF NVIDIA DGXPERTS GET UNMATCHED AI EXPERTISE WITH EVERY DGX SYSTEM NVIDIA With You Every Step of the Way DESIGN / PLAN / BUILD / TEST / DEPLOY / OPERATE / MONITOR INSIGHTS INSPIRATION ? WOOL COAT CUSTOMERS ALSO BOUGHT 1 ACCELERATE AI, ACCELERATE ROI DEPLOY FLEXIBLE AI INFRASTRUCTURE THAT REINS IN CAPEX 25 developers the performance equivalent of 2 dedicated GPUs each or 6 dedicated 28-core dual CPU servers each THE MOST POWERFUL TOOL FOR DATA SCIENCE TEAMS GIVE EVERY DEVELOPER THE POWER TO EXPLORE ACCELERATION FOR EVERY AI WORKLOAD RUN DATA ANALYTICS, TRAINING, AND INFERENCE WORKLOADS ON THE SAME SYSTEM Multi-Instance GPU (MIG) in a single DGX A100 gives: NVIDIA DGX A100 THE UNIVERSAL SYSTEM FOR AI INFRASTRUCTURE $11 Million | 25 Racks | 630 kW $1 Million | 1 Rack | 32.5 kW 5 DGX A100 Systems for AI Training and Inference 1/10 th THE COST 1/20 th THE POWER TODAY’S AI DATA CENTER DGX A100 DATA CENTER 1/25 th THE SPACE THE BUILDING BLOCK OF THE AI DATA CENTER AI INFRASTRUCTURE REIMAGINED, OPTIMIZED, AND READY FOR ENTERPRISE AI 500 GB/s Peak Bi-directional Bandwidth 10x NVIDIA ® Mellanox ® ConnectX-6 200 Gb/s Network Interface 3.2X More Cores to Power the Most Intensive AI Jobs Dual 64-Core AMD Rome CPUs 2 TB RAM 4.8 TB/s Bi-directional Bandwidth 2X More than Previous-Generation NVSwitch 6x NVIDIA NVSwitches 50 GB/s Peak Bandwidth 2X Faster than Gen3 NVME SSDs 30 TB Gen4 NVME SSD Up to 640 GB Total GPU Memory 12 NVIDIA NVLinks per GPU 600 GB/s GPU-to-GPU Bi-directional Bandwidth 8x NVIDIA A100 Tensor Core GPUs 1 TRILLION Transistors 1 KILOMETER of Traces 5 PETAFLOPS of AI Performance 1 MILLION Drill Holes 30 THOUSAND Components 50 THOUSAND Connector Pins DGX A100 INFRASTRUCTURE IS AGILE Run any workload on any system for maximized utilization. TRADITIONAL INFRASTRUCTURE IS CONSTRAINED Infrastructure silos starve AI workloads or waste capacity. Optimal Over Under CLUSTER 1: TRAINING CLUSTER 2: ANALYTICS CLUSTER 3: INFERENCE NVIDIA DGX A100 Systems NVIDIA Mellanox Quantum HDR 200 Gb/s InfiniBand Smart Switches Petabytes of All-Flash Storage Modular, with 20 Node Scalable Units Optimized Software from NVIDIA NGC Deployable in Weeks, Instead of Months UNIFIED AI INFRASTRUCTURE 0% 10% 20% 30% 40% 50% 60% 70% 80% Today’s Workload Tomorrow’s Workload Next Week’s Workload INFERENCE ANALYTICS TRAINING GAME-CHANGING PERFORMANCE FOR INNOVATORS NVIDIA DGX SUPERPOD: UNLOCKING AI AT SCALE FOR EVERY ENTERPRISE

Upload: others

Post on 03-Jan-2021

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: NVIDIA DGX A100...10+ Years of AI Innovation 100+ GPU-Optimized Software and Tools on NGC 10,000+ of AI-Fluent Practitioners Around the Globe Unlock AI in Your Organization Today www

10+Years of AIInnovation

100+GPU-Optimized Software

and Tools on NGC

10,000+of AI-Fluent Practitioners

Around the Globe

Unlock AI in Your Organization Todaywww.nvidia.com/DGXA100

© 2020 NVIDIA Corporation. All rights reserved.NVIDIA, the NVIDIA logo, DGX, DGX SuperPOD, Mellanox, Mellanox Quantum, NGC, NVLink, and NVSwitch are trademarks and/or registered trademarks of NVIDIA Corporation in the U.S. and other countries. All other trademarks and copyrights are the property of their respective owners.

DIRECT ACCESS TO A GLOBAL TEAM OF NVIDIA DGXPERTSGET UNMATCHED AI EXPERTISE WITH EVERY DGX SYSTEM

NVIDIA With You Every Step of the WayDESIGN / PLAN / BUILD / TEST / DEPLOY / OPERATE / MONITOR

INSIGHTSINSPIRATION

? WOOL COAT

CUSTOMERS ALSO BOUGHT

1

ACCELERATE AI, ACCELERATE ROIDEPLOY FLEXIBLE AI INFRASTRUCTURE THAT REINS IN CAPEX

25developers the

performance equivalent of

2dedicated GPUs each or

6dedicated 28-core dual

CPU servers each

THE MOST POWERFUL TOOL FOR DATA SCIENCE TEAMSGIVE EVERY DEVELOPER THE POWER TO EXPLORE

ACCELERATION FOR EVERY AI WORKLOADRUN DATA ANALYTICS, TRAINING, AND INFERENCE WORKLOADS ON THE SAME SYSTEM

Multi-Instance GPU (MIG)in a single DGX A100 gives:

NVIDIA DGX A100THE UNIVERSAL SYSTEM FOR AI INFRASTRUCTURE

$11 Million | 25 Racks | 630 kW $1 Million | 1 Rack | 32.5 kW5 DGX A100 Systems for AI Training and Inference

1/10th

THE COST

1/20th

THE POWER

TODAY’S AI DATA CENTER DGX A100 DATA CENTER

1/25th

THE SPACE

THE BUILDING BLOCK OF THE AI DATA CENTERAI INFRASTRUCTURE REIMAGINED, OPTIMIZED, AND READY FOR ENTERPRISE AI

500 GB/s Peak Bi-directional Bandwidth

10x NVIDIA® Mellanox® ConnectX-6200 Gb/s Network Interface

3.2X More Cores to Power the Most Intensive AI Jobs

Dual 64-Core AMD Rome CPUs2 TB RAM

4.8 TB/s Bi-directional Bandwidth

2X More than Previous-Generation NVSwitch

6x NVIDIA NVSwitches™

50 GB/s Peak Bandwidth

2X Faster than Gen3 NVME SSDs

30 TB Gen4 NVME SSD

Up to 640 GB Total GPU Memory

12 NVIDIA NVLinks™ per GPU

600 GB/s GPU-to-GPU Bi-directional Bandwidth

8x NVIDIA A100 Tensor Core GPUs

1TRILLIONTransistors

1KILOMETER

of Traces

5PETAFLOPS

of AI Performance

1MILLIONDrill Holes

30THOUSAND

Components

50 THOUSANDConnector Pins

DGX A100 INFRASTRUCTURE IS AGILERun any workload on any system for maximized utilization.

TRADITIONAL INFRASTRUCTURE IS CONSTRAINEDInfrastructure silos starve AI workloads or waste capacity.

Optimal Over Under

CLUSTER 1: TRAINING CLUSTER 2: ANALYTICS CLUSTER 3: INFERENCE

NVIDIA DGX™

A100 SystemsNVIDIA Mellanox Quantum™ HDR200 Gb/s InfiniBand Smart Switches

Petabytes ofAll-Flash Storage

Modular, with 20 Node Scalable Units

Optimized Softwarefrom NVIDIA NGC™

Deployable in Weeks, Instead of Months

UNIFIED AI INFRASTRUCTURE

0%

10%

20%

30%

40%

50%

60%

70%

80%

Today’sWorkload

Tomorrow’sWorkload

Next Week’sWorkload

INFERENCE

ANALYTICS

TRAINING

GAME-CHANGING PERFORMANCE FOR INNOVATORSNVIDIA DGX SUPERPOD: UNLOCKING AI AT SCALE FOR EVERY ENTERPRISE