high end computing at cardiff university focus on campus grids james osborne

30
High End Computing at Cardiff University Focus on Campus Grids James Osborne

Upload: lynn-campbell

Post on 23-Dec-2015

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: High End Computing at Cardiff University Focus on Campus Grids James Osborne

High End Computing at Cardiff University

Focus on Campus Grids

James Osborne

Page 2: High End Computing at Cardiff University Focus on Campus Grids James Osborne

Contents

High End Computing SpectrumFacilities at CardiffCondor at CardiffSuccess StoriesHigh End Computing FuturesQuestions

Page 3: High End Computing at Cardiff University Focus on Campus Grids James Osborne

High End Computing Spectrum

Page 4: High End Computing at Cardiff University Focus on Campus Grids James Osborne

The HEC Spectrum

HPCTightly Coupled

Supercomputers

NUMA Machines£ Million+

HTCLoosely Coupled

Small Clusters

Campus Grids£ Thousand

£ H Thousand

Large Clusters

SMP£ H Thousand

£ Million

Page 5: High End Computing at Cardiff University Focus on Campus Grids James Osborne

The HPC End

HPCTightly Coupled

Supercomputers

Bluegene L

131,072 CPUs

Page 6: High End Computing at Cardiff University Focus on Campus Grids James Osborne

The HTC End

HTCLoosely Coupled

Campus Grids

Condor@Cardiff

600+ CPUs

Page 7: High End Computing at Cardiff University Focus on Campus Grids James Osborne

Facilities at Cardiff

Page 8: High End Computing at Cardiff University Focus on Campus Grids James Osborne

Facilities at Cardiff - Helix

Large ClustersHelix

200 CPUs

Owned by PHARM, CHEMY, EARTH, BIOSI

Page 9: High End Computing at Cardiff University Focus on Campus Grids James Osborne

Facilities at Cardiff - SGI

Small ClustersSGI Origin 300

32 CPUs

Owned by WeSC

Page 10: High End Computing at Cardiff University Focus on Campus Grids James Osborne

Facilities at Cardiff - Condor

Campus GridsCondor@Cardiff

600+ CPUs

Owned by insrv

Page 11: High End Computing at Cardiff University Focus on Campus Grids James Osborne

Condor at Cardiff

Page 12: High End Computing at Cardiff University Focus on Campus Grids James Osborne

What is Condor ?

Condor is a software system that creates a High-Throughput Computing (HTC) environment

Condor effectively utilizes the computing power of workstations that communicate over a network

Condor's power comes from the ability to effectively harness resources under distributed ownership

Page 13: High End Computing at Cardiff University Focus on Campus Grids James Osborne

What is a Condor Pool ?

A pool is a collection of workstations that communicate over a network

Central Manager

master

collector

negotiator

schedd

startd

= ClassAd Communication Pathway

= Process Spawned

Submit-Only

master

schedd

Execute-Only

master

startd

Execute-Only

master

startd

Page 14: High End Computing at Cardiff University Focus on Campus Grids James Osborne

What is a Condor Job ?

A command line windows executable All files in a self-contained directory structure Condor runs jobs in a sandbox ..\execute\... Condor runs jobs as user condor-reuse-vm1

One or more input filesOne or more output filesA submit script

One or more logs – useful for debugging

Page 15: High End Computing at Cardiff University Focus on Campus Grids James Osborne

What Goes In A Submit Script ?

Running myprog 100 timesuniverse = vanilla

executable = myprog.exe

input = myin.$(PROCESS)

output = myout.$(PROCESS)

error = myerr.$(PROCESS)

queue 100

Page 16: High End Computing at Cardiff University Focus on Campus Grids James Osborne

What Else Can Go In ?

root_dir = c:\mydirectory

transfer_files = ALWAYS

transfer_input_files = $(ROOT_DIR)\afile.txt

transfer_output_files = $(ROOT_DIR)\afile.txt

log = mylog.$(PROCESS)

notification = NEVER | ERROR

arguments = -arg1 -arg2

Page 17: High End Computing at Cardiff University Focus on Campus Grids James Osborne

What Else Can Go In ?

requirements = OpSys == “WINNT51”

Machine == “hostname.cf.ac.uk”

Page 18: High End Computing at Cardiff University Focus on Campus Grids James Osborne

How Do I Submit A Job ?

In the first instance by sending all your files to [email protected] to allow us to tailor your jobs to our environment

In time by seeking permission to submit your own jobs to [email protected] to allow us to enable your workstation as a submit host Currently requires IP address change

Page 19: High End Computing at Cardiff University Focus on Campus Grids James Osborne

How Do I Submit A Job ?

Submitting your jobcondor_submit myscript.sub

Checking your job’s progresscondor_q

Checking the poolcondor_status

Page 20: High End Computing at Cardiff University Focus on Campus Grids James Osborne

Terms of Use

Any local researcher can use the campus grid on the proviso that they… write a short summary of their research that

we can use to publicise their use of the campus grid

provide references to journal articles and conference proceedings containing appropriate acknowledgements

Page 21: High End Computing at Cardiff University Focus on Campus Grids James Osborne

Success Stories

Page 22: High End Computing at Cardiff University Focus on Campus Grids James Osborne

Prof Tim Wess

OPTOMX-Ray DiffractionDetermine shape of moleculesTime on a single workstation = 2-3 DaysTime on the campus grid = 2-3 HoursSpeed-up factor of ~20

Chair of Non-Crystalline Diffraction Community & Chair of CCP13 for Non-Crystalline Materials

Page 23: High End Computing at Cardiff University Focus on Campus Grids James Osborne

Prof Tim Wess

“This capability provides the final link in the chain that Cardiff has established to solve macromolecular structures”

“Our involvement with synchrotron sources such as DIAMOND … and the residence of CCP 13 … ensures that we are well placed to be in the vanguard of structure determination”

Chair of Non-Crystalline Diffraction Community & Chair of CCP13 for Non-Crystalline Materials

Page 24: High End Computing at Cardiff University Focus on Campus Grids James Osborne

Soyeon Lee

CARBSMontecarlo Simulation20,000 parameters for 90 different modelsTime on a single workstation = 42 DaysTime on the campus grid = 2 DaysSpeed-up factor of ~20

Research Student

Page 25: High End Computing at Cardiff University Focus on Campus Grids James Osborne

Dr Kevin Ashelford

BIOSIDistributed SearchIdentify corrupt records in a DNA databaseTime on a single workstation = 2.4 YearsTime on the campus grid = 2.6 WeeksSpeed-up factor of ~50

Research Fellow

Page 26: High End Computing at Cardiff University Focus on Campus Grids James Osborne

Dr Kevin Ashelford

“This is a significant contribution to microbial research and will hopefully be the required impetus for the world-wide research community to improve current methods”

Page 27: High End Computing at Cardiff University Focus on Campus Grids James Osborne

High End Computing Futures

Page 28: High End Computing at Cardiff University Focus on Campus Grids James Osborne

The HEC Spectrum

HPCTightly Coupled

Supercomputers

NUMA Machines£ Million+

HTCLoosely Coupled

Small Clusters

Campus Grids£ Thousand

£ H Thousand

Large Clusters

SMP£ H Thousand

£ Million

SGI Origin 300

Helix Condor@Cardiff

Page 29: High End Computing at Cardiff University Focus on Campus Grids James Osborne

The HEC Spectrum

HPCTightly Coupled

Supercomputers

NUMA Machines£ Million+

HTCLoosely Coupled

Small Clusters

Campus Grids£ Thousand

£ H Thousand

Large Clusters

SMP£ H Thousand

£ Million

SGI Origin 300

SRIF 3

Helix Condor@Cardiff