operational cosmo demonstrator opcode

24
Federal Department of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss Operational COSMO Demonstrator OPCODE André Walser and Oliver Fuhrer MeteoSwiss COSMO-GM, Rome, 5-9 September 2011

Upload: vinnie

Post on 07-Jan-2016

43 views

Category:

Documents


0 download

DESCRIPTION

Operational COSMO Demonstrator OPCODE. COSMO-GM, Rome, 5-9 September 2011. André Walser and Oliver Fuhrer MeteoSwiss. Project overview. Additional proposal to the Swiss HP2C initiative to build an “ OP erational CO SMO DE monstrator (OPCODE) ” Project proposal accepted by end of May - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Operational COSMO Demonstrator OPCODE

Federal Department of Home Affairs FDHAFederal Office of Meteorology and Climatology MeteoSwiss

Operational COSMO Demonstrator OPCODE

André Walser and Oliver Fuhrer

MeteoSwiss

COSMO-GM, Rome, 5-9 September 2011

Page 2: Operational COSMO Demonstrator OPCODE

2 OPCODE | COSMO-GM 2011André Walser ([email protected])

Project overview

• Additional proposal to the Swiss HP2C initiative to build an “OPerational COSMO DEmonstrator (OPCODE)”

• Project proposal accepted by end of May• Start of project 1 June 2011 until end of 2012• Project resources:

• second contract with IT company SCS to continue collaboration until end of 2012

• 2 new positions at MeteoSwiss for about 1 year• Swiss HPC center CSCS• C2SM (collaboration with ETH Zurich and others)

Page 3: Operational COSMO Demonstrator OPCODE

3 OPCODE | COSMO-GM 2011André Walser ([email protected])

Main goals• Leverage the research results of the ongoing HP2C COSMO

project• Prototyp implementation of the COSMO production suite of

MeteoSwiss making aggressive use of GPU technology• MeteoSwiss ready to buy a GPU based hardware for the 2015

production machine• Same time-to-solution on substantially cheaper hardware:

Cray XT4 (3 cabinets)

GPU based hardware(a few rack units)

Page 4: Operational COSMO Demonstrator OPCODE

4 OPCODE | COSMO-GM 2011André Walser ([email protected])

GPU perspectives

GFLOPS per Watt is expected to increase strongly in the next years

Page 5: Operational COSMO Demonstrator OPCODE

5 OPCODE | COSMO-GM 2011André Walser ([email protected])

Workflow on demonstrator

Page 6: Operational COSMO Demonstrator OPCODE

6 OPCODE | COSMO-GM 2011André Walser ([email protected])

COSMO-7 / COSMO-2 suite:

0 34

3h

ass

imila

tion

(2

1 U

TC

)

0-2

4h

fo

reca

st (

00

UT

C)

an

d T

C p

rod

uct

s

Elapsed timein min

3h

ass

imila

tion

(2

1 U

TC

)

0-2

4h

fo

reca

st (

00

UT

C)

an

d T

C p

rod

uct

s

25

-72

h f

ore

cast

(0

0 U

TC

)a

nd

TC

pro

du

cts

1 7 11 46 49

COSMO-2 forecast

COSMO-7 assimilation

COSMO-7 forecast

COSMO-2 assimilation

COSMO-2 TC products

COSMO-7 TC products

61

Current production scheme

• Time-critical post-processing takes about 15 minutes longer than forecasts for both COSMO-2 and COSMO-7

• current bottleneck is post-processing tool fieldextraentire suite has to be optimized for demonstrator

Page 7: Operational COSMO Demonstrator OPCODE

7 OPCODE | COSMO-GM 2011André Walser ([email protected])

Two workpages

• Workpage A: Porting remainig parts of opr COSMO code @ MeteoSwiss to demonstrator

• Workpage B: Porting suite to demonstrator, optimize it, and operate it

Page 8: Operational COSMO Demonstrator OPCODE

8 OPCODE | COSMO-GM 2011André Walser ([email protected])

To use full speed-up, data has to remain on GPU within a time step; sent to CPU for I/O only

Work package A

COSMO workflow:

Input

Physics

Dynamics

Assimilation

Boundary Conditions

Diagnostics

Output

What’s still missing for a full GPU implementation?

Page 9: Operational COSMO Demonstrator OPCODE

9 OPCODE | COSMO-GM 2011André Walser ([email protected])

To use full speed-up, data has to remain on GPU within a time step; sent to CPU for I/O only

Work package A

COSMO workflow:

Input

Physics HPC2

Dynamics HPC2

Assimilation

Boundary Conditions

Diagnostics

Output

What’s still missing for a full GPU implementation?

Page 10: Operational COSMO Demonstrator OPCODE

10 OPCODE | COSMO-GM 2011André Walser ([email protected])

Tasks Work Package A

Task

A1. Dynamical Core: complete/update HP2C code SCS

Page 11: Operational COSMO Demonstrator OPCODE

11 OPCODE | COSMO-GM 2011André Walser ([email protected])

Task A2: Inter-/intra-GPU parallelization

• COSMO requires a communication library with halo-update as well as several other communications (e.g. global reduce, gather, scatter)

• e.g. peer-to-peer:

Page 12: Operational COSMO Demonstrator OPCODE

12 OPCODE | COSMO-GM 2011André Walser ([email protected])

Work Package A

Task

A1. Dynamical Core: complete/update HP2C code SCS

A2. Inter-GPU parallelization: library for halo-updates, global reductions, scatters, gathers CSCS

Page 13: Operational COSMO Demonstrator OPCODE

13 OPCODE | COSMO-GM 2011André Walser ([email protected])

Work Package A

Task

A1. Dynamical Core: complete/update HP2C code SCS

A2. Inter-GPU parallelization: library for halo-updates, global reductions, scatters, gathers CSCS

A3. Interoperability C++/CUDA/Fortran: common compile system, Unified Virtual Addressing SCS

Page 14: Operational COSMO Demonstrator OPCODE

14 OPCODE | COSMO-GM 2011André Walser ([email protected])

A4. Data Assimilation: Porting to GPU

Assimilation part is a huge code!

Page 15: Operational COSMO Demonstrator OPCODE

15 OPCODE | COSMO-GM 2011André Walser ([email protected])

Work Package A

Task

A1. Dynamical Core: complete/update HP2C code SCS

A2. Inter-GPU parallelization: library for halo-updates, global reductions, scatters, gathers CSCS

A3. Interoperability C++/CUDA/Fortran: common compile system, Unified Virtual Addressing SCS

A4. Data Assimilation: Porting to GPU MeteoSwiss

Page 16: Operational COSMO Demonstrator OPCODE

16 OPCODE | COSMO-GM 2011André Walser ([email protected])

Work Package A

Task

A1. Dynamical Core: complete/update HP2C code SCS

A2. Inter-GPU parallelization: library for halo-updates, global reductions, scatters, gathers CSCS

A3. Interoperability C++/CUDA/Fortran: common compile system, Unified Virtual Addressing SCS

A4. Data Assimilation: Porting to GPU MeteoSwiss

A5. I/O: Software layer controling copying of fields from CPU to GPU and vice versa for I/O C2SM?

Page 17: Operational COSMO Demonstrator OPCODE

17 OPCODE | COSMO-GM 2011André Walser ([email protected])

Work Package A

Task

A1. Dynamical Core: complete/update HP2C code SCS

A2. Inter-GPU parallelization: library for halo-updates, global reductions, scatters, gathers CSCS

A3. Interoperability C++/CUDA/Fortran: common compile system, Unified Virtual Addressing SCS

A4. Data Assimilation: Porting to GPU MeteoSwiss

A5. I/O: Software layer controling copying of fields from CPU to GPU and vice versa for I/O C2SM?

A6. Porting other code parts (BC, diagnostics) to GPU SCS

Page 18: Operational COSMO Demonstrator OPCODE

18 OPCODE | COSMO-GM 2011André Walser ([email protected])

Work Package B

Task

B1. Hardware CSCS

B2. System Software CSCS

Page 19: Operational COSMO Demonstrator OPCODE

19 OPCODE | COSMO-GM 2011André Walser ([email protected])

Work Package B

Task

B1. Hardware CSCS

B2. System Software CSCS

B3. COSMO-Package: Porting and optimization of steering scripts MeteoSwiss

Page 20: Operational COSMO Demonstrator OPCODE

20 OPCODE | COSMO-GM 2011André Walser ([email protected])

Work Package B

Task

B1. Hardware CSCS

B2. System Software CSCS

B3. COSMO-Package: Porting and optimization of steering scripts MeteoSwiss

B4. Post-processing: Parallelization of post-processing tools, additional work in fieldextra (partly paid by “COSMO license money”) MeteoSwiss

Page 21: Operational COSMO Demonstrator OPCODE

21 OPCODE | COSMO-GM 2011André Walser ([email protected])

Work Package B

Task

B1. Hardware CSCS

B2. System Software CSCS

B3. COSMO-Package: Porting and optimization of steering scripts MeteoSwiss

B4. Post-processing: Parallelization of post-processing tools, additional work in fieldextra (partly paid by “COSMO license money”) MeteoSwiss

B5. Setup and Testing MeteoSwiss

Page 22: Operational COSMO Demonstrator OPCODE

22 OPCODE | COSMO-GM 2011André Walser ([email protected])

Organization

1.7 FTESCS, CSCS, C2SM

0.9 FTEnew position @MeteoSwiss1 yearstill open

1.9 FTE new collaborator @MeteoSwiss 15 months, CSCS

Page 23: Operational COSMO Demonstrator OPCODE

23 OPCODE | COSMO-GM 2011André Walser ([email protected])

Schedule

Page 24: Operational COSMO Demonstrator OPCODE

24 OPCODE | COSMO-GM 2011André Walser ([email protected])

Thank you !