recent trends in operating systems and their applicability to hpc · 2017. 11. 21. · recent...

23
Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges Ron Brightwell, Rolf Riesen University of New Mexico Sandia National Laboratories May 11, 2006 Lugano, Switzerland

Upload: others

Post on 18-Jan-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Recent Trends in Operating Systems and their

Applicability to HPCArthur Maccabe,Patrick Bridges

Ron Brightwell, Rolf Riesen

University of New Mexico Sandia National Laboratories

May 11, 2006Lugano, Switzerland

Currently:Director, Computer Science and Mathematics DivisionOak Ridge National Laboratory
Page 2: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Xen

L4

Mach

VMware

Catamount

CNK

Variety, variety, variety

lightweight kernels

hypervisors

microkernels

Page 3: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Microkernels

• Minimal services

• policy versus mechanism

• address spaces, control (threads), message passing

• Servers

• trampoline

Microkernel (Mach, L4)

Hardware

App

App

App

App

Server 1 Server 2

Page 4: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Hypervisors

Hypervisor (Xen)

Do

ma

in 0

(Ho

st

OS

)

HardwareG

ue

st

OS

1

(Do

ma

in 1

)

App

App

Gu

est

OS

2

(Do

ma

in 2

)

App

App

Gu

est

OS

n

(Do

ma

in n

)

App

App

...

• Hypervisor virtualizes hardware

• goal is to run multiple OSes

• direct access to hardware is preferred

• Xen (para)virtualizes Processor, MMU, and basic I/O

• Additional I/O virtualization done by Domain 0

Page 5: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Lightweight Operating Systems

• Catamount

• SUNMOS, Puma/Cougar

• Catamount, Portals

• Blue Gene/L

• Compute Node Kernel (CNK)

• I/O Nodes (linux)

Page 6: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Blue Gene/L CNK

CNK

Hardware

I/O Node

Linux

App

CNK

Hardware

App

CNK

Hardware

App

CNK

Hardware

I/O Node

Linux

App

CNK

Hardware

App

CNK

Hardware

App

• I/O nodes

• run Linux

• have storage resources

• separate I/O network

• Compute nodes

• run lightweight kernel

• high speed, partition-able network

Page 7: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Catamount

• QK – mechanism

• communication

• address spaces

• PCT – policy

• finding servers

• Wrapper lib

• wrapper for stdio calls

• RPC to I/O node

QK (Quintessential Kernel)

PC

T

(Pro

ce

ss C

on

tro

l T

hre

ad

)

Hardware

Ap

plic

atio

n 1

Ap

plic

atio

n 2

Ap

plic

atio

n n

...

Wrap

Lib

Wrap

Lib

Wrap

Lib

I/O Node

(Linux)

Page 8: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Linux, the 800 Pound Penguin

• Imagine that you are a “small” computer company in the US

• One customer believes in lightweight OSes

• Another demands Linux

• You can’t afford to support the code bases for two OSes

• What do you do?

• The world is waiting for your answer....

Page 9: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

What does Linux do?

Linux

Video

CardVideo

CardVideo

CardVideo

CardVideo

Card

Disk

ControlDisk

ControlDisk

ControlDisk

ControlDisk

Control

Net

CardlNet

CardlNet

CardlNet

Cardl

I/O

BusI/O

BusI/O

BusI/O

Bus

I/O

DeviceI/O

DeviceI/O

DeviceI/O

DeviceI/O

Device

TCP/IP

MPI

MPI

ApplicationTelnet

ssh

POSIX

emacstermiinal

mail

client

gcc

glibc

• Provides a wide range of services

• libraries

• development environment

• work environment

• Works on a wide range of hardware

• graphics cards

• I/O buses

• flaky stuff.....

• Hourglass design

Page 10: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

What does Linux doin HPC?

Linux

Net

Cardl

I/O

Bus

I/O

Device

TCP/IP

MPI

MPI

ApplicationTelnet

ssh

POSIX

emacstermiinal

mail

client

gcc

glibc

• Don’t really have that many devices

• No disks

• none of it is flaky :)

• Must be the services

• Probably not mail, emacs, or the terminal emulator....

• “Real men read their mail on a Paragon”

Page 11: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Lightweight Linux?

I’m busy planning to rule the world!

Well, good luck with that.

Page 12: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Running Linux on BG/L

• Seems like a “no-brainer”

• some people will tell you that BG/L already runs Linux....

• It’s not.....

• “exec” is reasonable, but what does “fork” mean?

• what is the right tradeoff for resources allocated to Linux?

• Is that really Linux on the I/O nodes?

CNK

Hardware

I/O Node

Linux

App

CNK

Hardware

App

CNK

Hardware

App

CNK

Hardware

I/O Node

Linux

App

CNK

Hardware

App

CNK

Hardware

App

Page 13: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Linux on Catamount

• Basic idea

• QK == Xen

• PCT == Dom 0

• QK virtualization

• PCT builds address spaces

• PCT can run contexts

• Portals for network

• Use XenoLinux

• emulate Xen hypercalls

• no mod of XenoLinux

QKP

CT

(Pro

ce

ss C

on

tro

l T

hre

ad

)Hardware

Ap

plic

atio

n 1

Ap

plic

atio

n 2

Lin

ux

...

App

App

Page 14: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Xen HypercallsHypercall Meaning

set_callbacks normal and “failsafe” handlers

sched_op_new yield, block, shutdown, poll

mmu_update update page table entries

stack_switch change the stack

fpu_taskswitch next use of FPU faults

memory_op increase/decrease memory allocation

event_channel_op inter-domain event-channel mgmt

physdev_op BIOS Replacement

Page 15: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Catamount Mechanisms

PC

T

(Pro

ce

ss C

on

tro

l

Th

rea

d)

QK

Hardware

Ap

plic

atio

n 1

Ap

plic

atio

n 2

Wrap

Lib

MB MB

Wrap

LibPortals +

Quit Quantum,

Illegal instruction,

Illegal address

Portals +

Build Address Space,

Run Context

Interrupts:

Quantum

Network

illegal instr, addr

quit quantumportals

Page 16: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

A more realistic picture

• Start with XenoLinux

• minimize modifications

• build a wedge to provide QK interface

• wedge could support page table construction

• Extend PCT and QK to support XenoLinux

• minimize impact on Catamount applications

• minimize changes to QK

XenoLinux

PCT

Wedge

PC

T

QK'

Hardware

MB

Linux

App

Linux

App

Xen W

edge Linux

App

Linux

App

Cata

mount A

pplic

ation

Page 17: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Space Sharing

XenoLinux

PCTWedge

PC

T

QK'

Hardware

MB

Xe

n W

ed

ge

Linux App

Catamount Application

PC

T

QK'

Hardware

Xe

n W

ed

ge

Never forget that the real goal is to run a single application per node

(multiple processes, multiple threads)

Page 18: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Why Linux on Catamount?

• Linux is not free

• Initial port and optimization

• Linux evolves and requires updates

• Does “lightweight” Linux exist?

• Catamount currently works and scales

• not clear that Linux will scale

• Catamount doesn’t evolve :) :)

• Use XenoLinux on Catamount

• XenoLinux will evolve: evolve wedge, then PCT; QK only when necessary

• Minimal number of supported code bases

Page 19: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

FAST-OSForum to Address Scalable Technology

for runtime and Operating Systems

Page 20: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Activity

Colony Virtualization on minimal Linux with SSI services

Config Combine micro services to build app specific OS

DAiSES Adaptation of OS based on Kperfmon & Kerninst

K42 Enhance applicability of K42 for HEC OS research

MOLAR Modules to config and adapt Linux + RAS & fSM

Peta-Scale SSI Intersection of big (SMP) and small (node) kernels

Right-Weight Build application specific Linux/Plan 9 kernels

Scalable FT Implicit, explicit, incremental checkpointing & resilience

SmartApps Vertical integration between SmartApps and K42

ZeptoOS Ultralight Linux, collective runtime, measure & FT

Projects

Page 21: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Colony H M H M H M H

ConfigOS H M H M M M

DAiSES H H M

K42 H H H M M

MOLAR H H H H M M

Peta-Scale SSI H H H H H

Rightweight M H M M H

Scalable FT H M H M

SmartApps M H H M

ZeptoOS H H H H H

FAST-OS H High

M Medium

½�

Lin

ux

Vir

tual

izat

ion

Adap

tabili

ty

Usa

ge M

odel

s

Met

rics

Fault H

andlin

g

Com

mon A

PI

SSI

Colle

ctiv

e RT

I/O

OS

Nois

e

Page 22: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

Lead Academic Industrial

Colony LLNL UIUC IBM

Config SNL UNM, Caltech

DAiSES UTEP Wisconsin IBM

K42 LBNL Toronto, UNM IBM

MOLAR ORNL LaTech, OSU, NCSU Cray

SSI ORNL Rice HP, CFS, SGI, Intel

Right-Weight LANL Bell Labs

Scalable FT PNNL LANL, UIUC Quadrics, Intel

SmartApps Texas A&M LLNL IBM

ZeptoOS ANL Oregon

Partners

Page 23: Recent Trends in Operating Systems and their Applicability to HPC · 2017. 11. 21. · Recent Trends in Operating Systems and their Applicability to HPC Arthur Maccabe, Patrick Bridges

FAST-OS

• PI meeting/workshop (open meeting)

• with USENIX in Boston, May 30 & 31

• http://www.cs.unm.edu/~fastos

• Most recent issue of ACM OSR

“Linux’s cleverness is not in the software, but in the development model”

Rob Pike, “Systems Software Research is Irrelevant,” 2/2000