september pacific wave and prp/grp big news for big data · cenic: california’s research &...

Post on 12-Jun-2020

0 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

PacificWaveandPRP/GRPBigNewsforBigData

DaveReese

29TH NORDUNET CONFERENCEHELSINKI,FINLANDSEPTEMBER 22,2016

Six Charter Associates:

• California K-12 System

• California Community Colleges

• California State University System

• Stanford, Caltech, USC

• University of California

• California Public Libraries

• CENIC is a non-profit created to serve California’s K-20 research & education institutions with cost-effective, high-bandwidth networking

CENIC: California’s Research & Education Network• 3,800+milesofopticalfiber• US$75Mannualoperatingbudget• Membersinall58countiesconnectvia

fiber-opticcableorleasedcircuitsfromtelecomcarriers

• Over10,000 sitesconnecttoCENIC• 20,000,000 CaliforniansuseCENIC• Governedbymembersonthesegmental

level• Collaboratewithover500privatesector

partners• 88 other peering partners

(Google, Microsoft, Amazon …)• Enables worldwide collaboration

PacificWave• Beganasfirstgeographicallydistributedexchangein2004

• PacificWaveisanopenexchangesupportingbothcommercialandR&Epeers

• Currentlyserves29countriespeeringacrossthePacificandWesternUnitedStates

• WithPNWGPandTransPac,announcedthefirst100GbpsTrans-Pacific linkfromTokyotoSeattlein2015

PacificWaveandWRN

• PacificWaveandtheWesternRegionNetworkprovidefora100GbpsnetworkspanningtheWesternUnitedStatesservingPNWGP,CENIC,FRGP,ABQGPandUH.

• PacificWaveandNSFIRNCawardeePIREN(Univ ofHawaii)worktogethersupportingAARNet linkstoCaliforniaandWashingtonandexpansionofhigh-speedservicethroughthePacificIslandsRegion

www. p n w - g i g a p o p . n e t

Nx100GAcrossthePacific• CURRENT:

– TransPac/PacificWave(Tokyo-Seattle)– SINGAREN/Internet2(Singapore-LosAngeles)– SINET/SoftBank/PacificWave(Tokyo-LosAngeles)– AARNET/PIREN/PacificWave(Australia-SEA)– AARNET/PIREN/PacificWave(Australia-LA)

• FUTURE:– UH/PIREN/PacificWave(Guam-Hawaii-LA)

PacificWaveandNSF/IRNC• PacificWavehasbeenpartiallysupportedthroughthreeseparatefive-yearNationalScienceFoundationgrantssupportinggrowth,connectivityandinnovation

• Currentawardpromotes100GexpansionandimplementationofSDXcapabilitieswithinPacificWave(ACI-1451050)

SDX=SDN+IXP

9

AS A Router

ASCRouter

ASB Router

BGPSession

SDNSwitch

SDXControllerSDX

AbstractionLayer(FlowSpace Firewall)OpenFlowSwitches

On-rampLocations(Ethernet/virtualcircuits)

NetworkTestbedEnivironments

CircuitBuilding(NSI)

SDXmiddlewareOpenFlow Controllers

(plural)

Testbed Resources/OtherUses(DTNs) ScienceGroupApplications /Uses

Pacific Wave SDX Testbed Control Plane

Vision: Creating a Pacific Research Platform

Use Optical Fiber Networks to Connect All Data Generators and Consumers,

Creating a “Big Data” Freeway System

“The Bisection Bandwidth of a Cluster Interconnect, but Deployed on a 20-Campus Scale.”

This Vision Has Been Building for 15 Years

Creating a “Big Data” Freeway on Campus:NSF-Funded Prism@UCSD and CHeruB Grants

Prism@UCSD, Phil Papadopoulos, SDSC, Calit2, PI (2013-15)CHERuB, Mike Norman, SDSC PI

CHERuB

These Are Twoof Over

100 NSF Campus Cyberinfrastructure

GrantsMade in the Last 4 Years

How Prism@UCSD Transforms Big Data Microbiome Science:Preparing for Knight/Smarr 1 Million Core-Hour Analysis

12 Cores/GPU128 GB RAM3.5 TB SSD48TB Disk

10Gbps NIC

Knight Lab

10Gbps

Gordon

Prism@UCSD

Data Oasis7.5PB,

200GB/s

Knight 1024 ClusterIn SDSC Co-Lo

CHERuB100Gbps

Emperor & Other Vis Tools

64Mpixel Data Analysis Wall

120Gbps

40Gbps

1.3Tbps

Next Step: The Pacific Research Platform Creates a Regional End-to-End Science-Driven “Big Data Freeway System”

NSF CC*DNI Grant$5M 10/2015-10/2020

PI: Larry Smarr, UC San Diego Calit2Co-Pis:• Camille Crittenden, UC Berkeley CITRIS, • Tom DeFanti, UC San Diego Calit2, • Philip Papadopoulos, UC San Diego SDSC, • Frank Wuerthwein, UC San Diego Physics and

SDSC

ThePacificResearchPlatform(PRP)• NSFCC-NIEandsimilarprojectsrepresentsignificant investmentsincampus

infrastructureincluding SDN,ScienceDMZ’s (~130projects)

• Butthescientistsarestillstruggling withthecomplexityofusingthenetworkandinteroperabilitybetweendifferentimplementations ofScienceDMZ’s

• PRPfocusesonenabling thesciencecommunitiesacrossthePacificregion tomakeeffectiveuseof thehighperformance infrastructure

• Kick-off inDecember2014:takeadvantageoftheregionalinfrastructure;perfSONAR formeasurement/analysisandMaDDashforvisualization

• IncludeDTN’s:useacommonsoftwaresuitefordatamovement; reflectdisk-to-diskperformanceonMaDDash

• Demonstratedasaproof-of-conceptattheCENICSpringmeeting (March2015)

DOE ESnet’s Science DMZ: A Scalable Network Design Model for Optimizing Science Data Transfers

A Science DMZ integrates four key concepts into a unified whole:

– A network architecture designed for high-performance applications, with the science network distinct from the general-purpose network

– The use of dedicated systems for data transfer

– Performance measurement and network testing systems that are regularly used to characterize and troubleshoot the network

– Security policies and enforcement mechanisms that are tailored for high performance science environments

http://fasterdata.es.net/science-dmz/

PRPv0 - An experiment including:

CaltechCENIC / Pacific WaveESnet / LBNLNASA Ames / NRENSan Diego State UniversitySDSCStanford UniversityUniversity of WashingtonUSC

UC BerkeleyUC DavisUC IrvineUC Los AngelesUC RiversideUC San DiegoUC Santa Cruz

17

18

PRPv0 ExperimentThe PRPv0 experiment concentrated on the regional aspects of the research data movement challenge.

§ High-performance interconnection among campus Science DMZs

§ A mesh of perfSONAR toolkit instances§ perfSONAR MaDDash -- Measurement

and Debugging Dashboard§ Flash I/O Network Appliances (FIONAs)

and Data Transfer Nodes (DTNs)§ GridFTP file transfers to quantify

throughput, with results reflected on MaDDash

§ CalREN HPR / AS2153§ A partial mesh of bilateral BGP

sessions across the Pacific Wave distributed exchange

FIONA – Flash I/O Network Appliance:Linux PCs Optimized for Big Data on DMZs

FIONAs Are Science DMZ Data Transfer Nodes (DTNs) &

Optical Network Termination DevicesUCSD CC-NIE Prism Award & UCOPPhil Papadopoulos & Tom DeFantiJoe Keefe & John Graham

Cost $8,000 $20,000IntelXeonHaswell E5-1650v36-Core 2xE5-2697v314-Core

RAM 128GB 256GBSSD SATA3.8TB SATA3.8TB

NetworkInterface 10/40GbEMellanox 2x40GbEChelsi+MellanoxGPU NVIDIATeslaK80

RAIDDrives0to112TB(add~$100/TB)

UCOP Rack-Mount Build: Source:JohnGrahamandTomDeFanti,Calit2

§ DTNs loaded with Globus Connect Server suite to obtain GridFTP tools.

§ cron-scheduled transfers using globus-url-copy.

§ ESnet-contributed script parses GridFTP transfer log and loads results in an esmond measurement archive.

§ FDT – developed by Caltech in collaboration with PolytehnicaBucharest

20

As of 3/9/15, the Pacific Research Platform (PRPv0) as a facility, logs rather good performance: From To Measured

Bandwidth Data Transfer Utility

San Diego State Univ. UC Los Angeles 5Gb/s out of 10 GridFTP UC Riverside UC Los Angeles 9Gb/s out of 10 GridFTP UC Berkeley UC San Diego 9.6Gb/s out of 10 GridFTP UC Davis UC San Diego 9.6Gb/s out of 10 GridFTP UC Irvine UC Los Angeles 9.6Gb/s out of 10 GridFTP UC Santa Cruz UC San Diego 9.6Gb/s out of 10 FDT Stanford UC San Diego 12Gb/s out of 40 FDT Univ. of Washington UC San Diego 12Gb/s out of 40 FDT UC Los Angeles UC San Diego 36Gb/s out of 40 FDT Caltech UC San Diego 36Gb/s out of 40 FDT Table I.2.1: Bandwidth of flash disk-to-flash disk file transfers shown between several sites

for the existing experimental facility “PRPv0.”

January 29, 2016 PRPV1 (L3)

PRP Point-to-Point Bandwidth MapGridFTP File Transfers-Note Huge Improvement in Last Six Months

June 6, 2016 PRPV1 (L3)Green is Disk-to-DiskIn Excess of 5Gbps

Troubleshooting Unidirectional Performance Issues

Measuring performance – IPv6

Measuring Performance – IPv4

25

PRP Timeline

• PRPv1– A routed Layer 3 architecture – Tested, Measured, Optimized, With Multi-domain Science Data– Bring Many Of Our Science Teams Up – Each Community Thus Will Have Its Own Certificate-Based Access

To its Specific Federated Data Infrastructure.

• PRPv2– Incorporating SDN/SDX, AutoGOLE / NSI– Advanced IPv6-Only Version with Robust Security Features

– e.g. Trusted Platform Module Hardware and SDN/SDX Software– Support Rates up to 100Gb/s in Bursts And Streams– Develop Means to Operate a Shared Federation of Caches– Cooperating Research Groups

Resources

www. p n w - g i g a p o p . n e t

Pacific Wavehttp://www.pacificwave.net/https://ps-dashboard.pacificwave.net

CENIChttp://www.cenic.org/https://ps-dashboard.cenic.net

Pacific Research Platformhttp://prp.ucsd.edu/http://cenic.org/files/publications/PRP_Overview_%C6%92.pdfhttp://prp-maddash.calit2.optiputer.net/maddash-webui/

Calit2http://www.calit2.net/

CITRIShttp://citris-uc.org/

ESnethttp://www.es.net/http://fasterdata.es.net/http://ps-dashboard.es.net/

Invitation-Only PRP Workshop Held in Calit2’s Qualcomm InstituteOctober 14-16, 2015

• 130 Attendees From 40 organizations – Ten UC Campuses, as well as UCOP Plus 11 Additional US Universities– Four International Organizations (from Amsterdam, Canada, Korea, and Japan) – Five Members of Industry Plus NSF

CMS

Pacific Research PlatformDriven by Data-Intensive Research

EarthquakeEngineering

Biomedical‘omics

ParticlePhysics

TelescopeSurveys

Visualization, Virtual Reality, Collaboration

Cancer Genomics Hub (UCSC) is Housed in SDSC:Large Data Flows to End Users at UCSC, UCB, UCSF, …

1G

8G

Data Source: David Haussler, Brad Smith, UCSC

15GJan 2016

30,000 TBPer Year

Two Automated Telescope SurveysCreating Huge Datasets Will Drive PRP

300 images per night. 100MB per raw image

30GB per night

120GB per night

250 images per night. 530MB per raw image

150 GB per night

800GB per nightWhen processed

at NERSC Increased by 4x

Source: Peter Nugent, Division Deputy for Scientific Engagement, LBLProfessor of Astronomy, UC Berkeley

Precursors to LSST and NCSA

PRP Allows Researchersto Bring Datasets from NERSC

to Their Local Clusters for In-Depth Science Analysis

Data Flows Over HPWREN

Global Scientific Instruments Will Produce Ultralarge Datasets Continuously Requiring Dedicated Optic Fiber and Supercomputers

https://tnc15.terena.org/getfile/1939

Square Kilometer Array Large Synoptic Survey Telescope

https://tnc15.terena.org/getfile/1939 www.lsst.org/sites/def ault/files/document s/DM%20Introduction%20-%20K antor.pdf

Tracks ~40B Objects,Creates 10M Alerts/Night

Within 1 Minute of Observing

2x40Gb/s

We are Experimenting with the PRP for Large Hadron Collider Data Analysis Using The West Coast Open Science Grid on 10-100Gbps Optical Networks

Crossed 100 Million

Core-Hours/MonthIn Dec 2015

Over 1 Billion Data Transfers

Moved200 Petabytes

In 2015

Supported Over200 Million Jobs

In 2015

Source: Miron Livny, Frank Wuerthwein, OSG

ATLAS

CMS

40G FIONAs

20x40G PRP-connected

WAVE@UC San Diego

PRP LinksCreates Distributed Virtual Reality

PRP

CAVE@UC Merced

DanCayanUSGSWaterResourcesDiscipline

ScrippsInstitutionofOceanography,UCSanDiegomuchsupport fromMaryTyree,MikeDettinger,Guido Francoandothercolleagues

NCARUpgradingto10GbpsLinkOverWestnetfromWyomingandBouldertoCENIC/PRP

Sponsors:California EnergyCommissionNOAARISAprogramCaliforniaDWR,DOE,NSF

PlanningforclimatechangeinCaliforniasubstantialshiftsontopofalreadyhighclimatevariability

UCSD Campus Climate Researchers Need to Download Results from NCAR Remote Supercomputer Simulations

to Make Regional Climate Change Forecasts

average summer afternoon temperature

average summer afternoon temperature

Downscaling Supercomputer Climate SimulationsTo Provide High Res Predictions for California Over Next 50 Years

36

Source: Hugo Hidalgo, Tapash Das, Mike Dettinger

approximately 50 miles: Note: locations are approximate

to CI andPEMEX

Extending PRP/CENIC Optical Backplane via High Speed Wireless Research and Education Network

Real-Time Network Cameras on Mountains for Environmental Observations

Source: Hans Werner Braun, HPWREN PI

14 May 2014: 9 Simultaneous Active Fires in San Diego County

San Diego County Red Mountain Fire Cameras• Southeast (left) “Highway” Fire• Southwest (center rear) “Poinsettia” Fire• West (right) “Tomahawk” Fire

Interactive Virtual Reality of San Diego CountyIncludes Live Feeds From 150 Met Stations

TourCAVE at Calit2’s Qualcomm Institute

HPWREN Users and Public Safety ClientsGain Redundancy and Resilience from PRP Upgrade

San Diego CountywideSensors and Camera

ResourcesUCSD & SDSU

Data & ComputeResources UCSD

UCR

SDSU

UCI

UCI & UCRData Replication

and PRP FIONA Anchorsas HPWREN Expands

Northward

10X Increase During Wildfires

Data From Hans-Werner Braun

• PRP CENIC 10G Link UCSD to SDSU– DTN FIONAs Endpoints– Data Redundancy – Disaster Recovery – High Availability – Network Redundancy

NSF Has Funded Over 100 Campuses to Build Local Big Data Freeways:Imagine Linking All of Them Like the Pacific Research Platform

Red 2012 CC-NIE AwardeesYellow 2013 CC-NIE AwardeesGreen 2014 CC*IIE AwardeesBlue 2015 CC*DNI AwardeesPurple Multiple Time Awardees

Source: NSF

Next Step: Global Research PlatformBuilding on CENIC/Pacific Wave and GLIF

Current InternationalGRP Partners

Questions?

top related