the open cloud fpga testbed · tag line an moc-style catapult testbed and so much more • enhanced...
TRANSCRIPT
![Page 1: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/1.jpg)
The Open Cloud FPGA Testbed –
Supporting Experiments on
Emerging Datacenter Configurations*
Martin Herbordt Miriam Leeser
* Funded by the National Science Foundation through the Computer
Community Research Infrastructure CCRI Grand Program
![Page 2: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/2.jpg)
Motivation & Overview
![Page 3: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/3.jpg)
Motivation (1/3) – Millions of FPGAs in the Cloud
for provider use – e.g. Microsoft Catapult
Provider system uses
• SDN
• Instrumentation and Metering
Provider internal applications
• Compression
• Encryption
Provider external applications
• Security and Privacy
• Machine Learning
• Other big-data analytics
![Page 4: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/4.jpg)
Motivation (2/3) - FPGAs Everywhere in the
Datacenter – Various Observations
![Page 5: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/5.jpg)
Motivation (3/3) – Potential of Millions of FPGAs in
Datacenters for HPC – e.g. Intel COPA
![Page 6: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/6.jpg)
The Open Cloud Testbed
• Funded by National Science Foundation CCRI Grand Program
– Computer Community Research Infrastructure
• Building on Existing Infrastructure:
– MGHPCC: Massachusetts Green High Performance Computing Center
– MOC: Massachusetts Open Cloud
– OpenCloudLab
• What’s new:
– FPGAs for the user community
• Collaboration among– UMass Amherst, Boston University, Northeastern University
![Page 7: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/7.jpg)
OCFT Context & Ecosystem
![Page 8: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/8.jpg)
Core Team
![Page 9: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/9.jpg)
MGHPCC: Massachusetts Green High Performance
Computer Center
Mass Open Cloud
![Page 10: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/10.jpg)
MOC: Massachusetts Open Cloud
• Funded by Commonwealth, Industry partners and universities
• Thousands users, many thousands of users of services
• New Harvard/BU research IT plan to create a production service:
– consistent infrastructure, operations team, research facilitators, buy-in model
• Connection to NSF NESE (20+PB), NSF NE Cyberteam, Harvard Dataverse
• Sustainability through:
– integration with research IT and support for end-users
– industry support for cloud: interoperability lab, exposing new innovation, visibility into
usage
– extensive experience upstreaming with large industry driven open source communities
• Support smaller institutions: new MTC proposal & NE Cyberteam
• Used by regional “friends and family” CISE researchers: cybersecurity (MACS), systems,
data science …
![Page 11: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/11.jpg)
What is Massachusetts Open Cloud (MOC)?
![Page 12: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/12.jpg)
MOC supports
– real users
– access to real data sets
– can provide traces of real usage
– can allow services to be exposed to end-users (e.g.,
TTP)
– has access to production services at scale (e.g., NESE)
– infrastructure and services provided by industry
partners
![Page 13: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/13.jpg)
• Scientific infrastructure for cloud research
• Three clusters (Utah, Wisconsin, and Clemson), which offer 15,000
cores
– Each cluster has a different focus: storage and networking (using hardware
from Cisco, Seagate, and HP), high-memory computing (Dell), and energy-
efficient computing (HP).
• Designed specifically for reproducible research
• Hard isolation to create many parallel “slices”
![Page 14: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/14.jpg)
What is CloudLab?
![Page 15: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/15.jpg)
Open CloudLab Concept
![Page 16: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/16.jpg)
Research "in" the MOC
logs/usage data
Cloud Users
NESE
Cloud ResearcherCloud Researchers
MOC production cloud
ESI NERC
![Page 17: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/17.jpg)
The Open Cloud FPGA Testbed
![Page 18: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/18.jpg)
The Open Cloud FPGA Testbed - OCFT
Tag line An MOC-style Catapult testbed and so much more
• Enhanced with programmable hardware (FPGA) capabilities not
present in other facilities available to researchers today
Current FPGAs in the Datacenter: why do we need the OCFT?
• Microsoft Catapult
– No user access
• AWS F1 instances >> and Baidu, Chameleon, TACC, etc.
– Available to users as accelerators, but interactions are restricted
• Various FPGA-centric clusters >> BU, Paderborn, Riken, TACC, Tsukuba
– Very difficult to bring on line, even for a single institution
– Even more difficult to maintain
– HPC-specific rather than general datacenter
![Page 19: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/19.jpg)
OCFT for FPGAs in the Datacenter
![Page 20: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/20.jpg)
Why OCFT will work
Funding for FPGA-specific system management and customer service
• FTE FPGA engineer
Integration into existing cloud ecosystem
Broader community will be pitching in
• Industry partners, advisory board, beta users
![Page 21: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/21.jpg)
Sample Projects
![Page 22: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/22.jpg)
How OCFT will be used –
Sample Projects
• Hardware operating system (on the FPGAs)• Drivers, Multitenancy, Handling “Pass-through” system communication
• Development environment• Enable access and programming by system and application developers
• System applications• Compression, security, privacy preserving computation
• User applications – in the node to across the datacenter• Middleware offload – MPI
• Application-aware I/O support through lossy compression
• Massively parallel applications – Large scale physical simulations
• Distributed machine learning
![Page 23: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/23.jpg)
The analyst learns only
and nothing else.
Applications: Privacy
• Garbled Circuits in an FPGA cloud. Supported by NSF SaTC grant:
– Massively Scalable Secure Computation Infrastructure Using FPGAs
– In collaboration with Stratis Ioannidis, Northeastern University
• Provide privacy guarantees:
• Good match for FPGAs
• Currently targeting
AWS F1 instances
![Page 24: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/24.jpg)
Yao’s Garbled Circuits: Protocol Overview
GARBLER EVALUATOR INPUT OWNERS
TRANSMIT
PROXY OBLIVIOUS TRANSFER Private Inputs
GA
RB
LE
Keys
Garbled Circuit
EVALUATE
PH
AS
E I
PH
AS
E I
IP
HA
SE
III
![Page 25: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/25.jpg)
Survey Results
email us if you would like to participate
![Page 26: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/26.jpg)
OCFT – Survey Results• Alpha cohort – Herbordt & Leeser research groups, Red Hat
• Beta cohort – Established FPGA/Cloud/HPC research groups. Survey is for Beta cohort.
• Gamma cohort – broader community with certain attributes, particularly the experience to be
able to use this rather than other infrastructure.
Initial list of potential users by affiliationUniversities Replies
• Boston University 2
• Brown
• BYU
• Cornell
• CMU x
• MIT x
• NCSU x
• Northeastern
• Penn
• Stevens
• Tufts x
• U. Arkansas x
• U. Alabama x
• UCSD
• U. Florida x
• U. Miami of Ohio x
• U. Massachusetts x
Universities, cont. Replies
• UNCC
• U. Pittsburgh
• U. Tennessee
• Worcester Polytechnic
• Wash. U. St. Louis x
• W. Michigan
• Yale
National Labs Replies
• Argonne x
• Lawrence Berkeley
• Pacific Northwest x
Industry Replies
• AlgoLogic
• Atomic Rules x
• Comma Corp x
• Gray Research LLC
• Red Hat x
![Page 27: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/27.jpg)
Beta user configuration priority
We can’t support everything … What should be the priorities?
First Choice Total
FE1: Catapult2-like – Bump-in-the-Wire 10 10
FE2: Programmable NIC 2 2
FE3: FPGA is the node 0 0
BE1: Bare-metal back-end processor 1 2
BE2: Tightly coupled back-end processor (CCIX) 2 4
BE3: Cluster of directly connected FPGAs 2 5
![Page 28: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/28.jpg)
Beta user project types
What will you use the OCFT for?
Project Type first choice total
Cloud and Operating System 6 6
Middleware 2 5
FPGA systems 3 4
FPGA tools 3 6
Provider applications 1 3
Tenant applications 2 3
![Page 29: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/29.jpg)
Miscellaneous
Enthusiasm for OCFT (17 replies)13/17 gave as part of their answer some variation of very interested
4/17 gave practical responses of what they would do with OCFT
Tools preference (17 replies)Intel – 11 Xilinx – 12 Generic – 1
Both or would switch – 12/17
HBM? (17 replies)Yes = 8 No = 1 “Nice but” = 2 No reply re HBM = 6
What board? (17 replies)no reply = 2 no preference = 7 Xilinx = 2 Intel = 4 Both = 2
![Page 30: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/30.jpg)
FPGA Options
![Page 31: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/31.jpg)
Xilinx Alveo Cards for data centers: https://www.xilinx.com/products/boards-and
kits/alveo/u280.html#specifications
![Page 32: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/32.jpg)
Intel D5005: https://www.intel.com/content/www/us/en/progra
mmable/products/boards_and_kits/dev-
kits/altera/intel-fpga-pac-d5005/overview.html
![Page 33: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/33.jpg)
Advantages and Disadvantages
• Xilinx Alveo 280
– + High Bandwidth Memory (HBM)
– -- Only 2 QSFP28 connections
– Programming: Xilinx Vitis Tool
• Intel D5005
– + 4 QSFP28 connections
– -- No HBM
– Intel OneAPI
![Page 34: The Open Cloud FPGA Testbed · Tag line An MOC-style Catapult testbed and so much more • Enhanced with programmable hardware (FPGA) capabilities not present in other facilities](https://reader033.vdocuments.us/reader033/viewer/2022050305/5f6df09055facb2e490c7e6b/html5/thumbnails/34.jpg)
Responsive to User Community
We want to build what you, our users, want
Discussion, comments, questions, …