tony doyle - university of glasgow introduction. tony doyle - university of glasgow 31 august...

36
Tony Doyle - University of Glasgow Introduction

Upload: charity-douglas

Post on 27-Dec-2015

214 views

Category:

Documents


1 download

TRANSCRIPT

Tony Doyle - University of Glasgow

Introduction

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Expression of Interest

Universities of Aberdeen, Dundee, Edinburgh, Glasgow (lead),

Heriot-Watt, St. Andrews, Stirling, Strathclyde1. wide variety of disciplines 2. computing-enabled research3. portals4. training 5. accounting 6. infrastructure management7. distributed computing provision8. three-year project 9. approximately £5m from SFC 10.comparable equipment resources from

partners

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Objectives

• Added Value: a grid-enabled software infrastructure, providing added value to institutional hardware

• Integration: integrated with existing projects in the UK, Europe and internationally

• Leverage: standardised middleware will be exploited

• Virtual Communities: nurture and support• Outreach and Training: reach new

research communities (Scottish companies will also be able to benefit)

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Development Plans

• Pilot phase: prototype grid service (www.scotgrid.ac.uk)

• Boost: R&D at NeSC (www.nesc.ac.uk) • Prototype: e-infrastructure in operation for

four years• Growth: support for growing number of

activities• Evolution: technology development required• Challenges:

– Grid accessible to non-computing researchers– development of discipline-specific portals– information management– integration of different authentication systems

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Institutional Resources

• Substantial investments in computing resources

• ScotGrid will integrate theseDundee: £1m life sciences HPC facility (currently 200

processors, > 100TB), upgrade plannedEdinburgh: £2m Computing and Data Facility (ECDF)

plannedGlasgow: £1m dedicated grid facility (first

components arrived this week) 1 MSI2k, 100 TBStirling: £850k in central servers Aberdeen, Heriot-Watt, St. Andrews and Strathclyde

are supportive

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Accounting and Monitoring

• Ensure fairness and balance of the institutional resources between the different research communities

• Effective distributed accounting provides the basis by which partners can have confidence in resource sharing and in additional resource provision and consumption

• Annual ScotGrid accounts provided to SFC will enable each partner institute to determine their usage charges, providing a basis for long-term sustainability via research grants and the rolling equipment programmes in each university

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

SFC Support • Institutional equipment investments are comparable to the support

required from SFC for staff to develop and realise the full potential of ScotGrid

• In addition, support will be needed for meetings, travel, etc• The full economic cost over three years, starting July 2007, is

estimated at around £5m• Project Management: (1.2 FTE) A dedicated project director plus a

fraction of the PI to coordinate the complex technical developments and specialised discipline requirements

• System Management: (4 FTE) Local technical management of the key facilities forming part of ScotGrid

• Grid Operation: (6 FTE) A distributed team of technicians to achieve agreed levels of service across the entire ScotGrid infrastructure

• Accounting Support: (1 FTE) Development of accounting support to ensure fair and well-managed use of ScotGrid facilities

• Portal Development: (4 FTE) A team of specialists to develop the specialised portals required by client disciplines

• Training and Outreach: (2 FTE) A team of specialists to determine the development and training needs of client disciplines

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Sustainability

• There is a long-term demand for a Grid over the next ten years

• Medium-term sustainability will depend upon close co-operation

• The project is designed to be fully inclusive• The ScotGrid resource will help sustain the leading position

Scotland has in e-Science and Grid research, and ensure that the Scottish research community is well positioned for future research funding opportunities

• continued support via grant applications for new research growing out of the project and from the rolling equipment programmes in each University

• A summary report of research income generated and/or supported using the infrastructure will be compiled in 2009 to enable SFC and the partners to determine the most appropriate sustainable funding model from 2010 and beyond

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

SFC feedback (17/7/06)

• The proposal was considered outwith the discussion of proposals to the SRDG periodic competition

• The Panel considered that the proposal met a strategic need for Scotland

• It would be ideally suitable for consideration for funding under the Enhancing Excellence (pooling) funding stream of SRDG

• Two key issues for the proposal: (i) what parts of infrastructure are provided

already at the UK level(ii) be clear on the pan-Scottish dimension

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

SFC further feedback (26/7/06)

A. need to understand the relationship and synergy with the Informatics Pooling bid

B. Highlight the "service" aspects of the ScotGrid proposition

C. The outcome of the proposal would be for assessment next year

D. Institutions involved should consider how they can contribute

E. Scottish Enterprise could be involved particularly in knowledge transfer and core technology aspects

F. Approach Abertay w.r.t. their formal training (linked to informatics pooling)

G. New senior appointments in this area should be considered to build up the area of Grid research

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

A further set of points were identified:4. Industrial liaison should be incorporated -

testbed encouraged5. A gap analysis should be performed6. Investigate Grid-associated studentships 7. Service existing research pooling efforts8. An overview document should be prepared 9. Indicate how many users are anticipated

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Broader Issues

a. Technical issues should be discussed (in the appendices)

b. International review committee will judge

c. Develop links with co-funders (e.g. Research Councils)

d. EPSRC must be approached re: supporting academic investments. (PPARC are already aware).

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Benefits

Establishment of a Research e-Infrastructure leading to:

1. Expansion in terms of partners and services2. Robust operations and management procedures3. Interoperability with other national and

international e-infrastructures4. Integration of key data sources5. Improved understanding of data provenance6. Improved and measured ScotGrid reliability7. Supported research across a range of domains8. Access by non-Grid focused researchers9. Service definitions for a sustainable infrastructure10.Industrial partners testing/utilising Grid services

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Timeline

In terms of defining a timeline:i. need to identify contacts at all Institutesii. SFC (David Gani) will meet with partners on

November 6th iii. focus on enhanced service aspects and identifying

the core Grid technologies • Three meetings currently planned:

– Today Edinburgh– November 6th Glasgow– End-December TBC

Internal aim (for discussion): appendices by the end of the yearproposal by SFC-requested timescale

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Summary

• Expression of Interest submitted at end of May 2006

• Draft proposal developed up to mid-May 2006 was used to guide internal discussions

• Received prompt, positive feedback to EoI from SFC in July

• Need to build collaboration and review draft proposal in light of SFC feedback and today’s inputs

Talk Title

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

Grids and Business – key points

• Trust - Not used to sharing resources

• Security - Sensitive data with sensitive applications

• Business models – what can be charged for as a service

• Guaranteed QoS – Service Level Agreements

• Accounting - tracking resources usage in multi-admin context

• Standards – to encourage long-term investment

• Applications – need to support legacy applications

• Portability – across multiple platforms and implementations

• Open source support – robust reference implementation

• Software license management – how to generate revenue ina grid context

EGEE-II provides an excellent framework for collaborating with business on these subjects

Tony Doyle - University of Glasgow

Background

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

News: First Meeting of ScotGrid Partners to develop Pan-Scottish Grid

News: Glasgow prepares with new Grid servers

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Monday

Wednesday

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Accounting

June 2002 June 2006

Past CPU performance •3,061,049 CPU hours •383,858 completed jobs.•> 100 users, >30 VOsNeed to:•Publish data automatically •Agree the conversion rates £: [CPU, disk, manpower]•Agree internal charging

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

PP Grid OverviewAim: by 2008 (full year’s data

taking)- CPU ~100MSi2k (100,000

CPUs)- Storage ~80PB - Involving >100 institutes

worldwide

- Build on complex middleware being developed in advanced Grid technology projects, both in Europe (Glite) and in the USA (VDT)

1. Prototype went live in September 2003 in 12 countries

2. Extensively tested by the LHC experiments in September 2004

3. 197 sites, 13,797 CPUs, 5PB storage in September 2005

4. Today: 177 active sites, 28,574 CPUs, 45PB storage

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

MiddlewareRequirement

Storage Element

Basic File Transfer

Reliable File Transfer

Catalogue Services

Data Management tools

Compute Element

Workload Management

VO Agents

VO Membership Services

DataBase Services

Posix-like I/O

Application Software Installation Tools

Job Monitoring

Reliable Messaging

Information System

15 Baseline Services for a functional Grid

Scottish (and National) Grid Service will rely upon gLite components

This middleware (now at v3.0) builds upon VDT (Globus and Condor) and meets the requirements of all the low-level, basic scientific use cases:

1. Green (amber) areas are (almost) agreed as part of the shared generic middleware stack by each of the application areas

2. Red are areas where generic middleware competes with application-specific software.

www.glite.org

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Glasgow Edinburgh

Leicester Oxford

London

Netherlands

Publically Curated Data

Private data

Private data

Private data

Private data

Private data

Private data

CFG Virtual Organisation Ensembl

MGI

HUGO

OMIM

SWISS-PROT

… DATA HUB

RGD

Middleware

Generic User

Services

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

1. Glasgow/Edinburgh/Durham/ (Dundee) Project

2. Compute-intensive jobs performed at Glasgow and Durham

3. Data-intensive jobs performed at Edinburgh

4. First applications using the Grid. Meeting real requirements of Grid applications: currently Particle Physics, Bioinformatics,

Computing Science, Electrical Engineering

5. Glasgow leading R&D in Grid Data Management, Security, Portals

6. Edinburgh leading User Training, R&D in Networking and Storage

7. Management part of a worldwide Grid infrastructure through GridPP and EGEE

What was?

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

What are the Grid challenges?

Data Management, Security and

Sharing

1. Software process2. Software efficiency3. Deployment

planning 4. Link centres

5. Share data

6. Manage data7. Install software8. Analyse data9. Accounting

10. Policies

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Summary• ScotGrid is part of an evolving National

and International Grid• NGS, EGEE and OMII-UK are important

partner organisations• Deployment, middleware and application

support will each raise issues for ScotGrid• External developments place constraints

on the hardware and software• Timeline is ~right for spin-off

Tony Doyle - University of Glasgow

Planning…

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Work Plan• We are developing a more detailed work plan…• International leadership of the participating departments, their

proven ability to collaborate successfully, their needs for large-scale computing, and their stated intentions to exploit a Scottish Grid

• Each discipline has its own application needs, pressing timescales, and expectations of large-scale computing

• The project will provide the framework needed for synergy and economies of scale

• Deployment plans to: – expand the user base– include new provider sites– create a training programme– enhance current service provision and establish user

support• This will be integrated with the work of the UK and European Grid

services • A top-down approach will draw on institutional strengths and

strategic plans for computing• A bottom-up approach will focus on requirements of each discipline,

building unified services across Scotland

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Stakeholders

Stakeholder Interest / stake Importance University Computing Services Road mapping and resource provision High Higher Education Colleges Accessibility of facilities Medium Researchers Standard interfaces and access to

resources of appropriate scale High

Middleware Providers Interoperability standards, established via deployment practice

Medium

Industrialists Using (open) standards to establish commercial systems

High

Research Councils Optimising use of infrastructure investments

High

Scottish Funding Council Developing national e-infrastructure

High

Community and project infrastructure projects Interoperability and sustainability High

UKERNA/JISC Grid as a service above JANET network provision

Medium

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Project Management

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Risk analysisName Likelihood

(1-4) Impact

(1-4) Risk

(L x I) Action to Mitigate Risk

Staffing 2 3 6 Build on existing expertise. Organisational 3 3 9 Define/build/agree collaboration

structure. Clarify the role of ScotGrid and its interactions.

Technical 3 2 6 Adopt a conservative approach to technology deployment.

External suppliers 3 1 3 Work with developers and partner infrastructures.

Insufficient support for a focussed e-infrastructure

2 4 8 Ensure funding is in place early and work to increase the scale and relevance of ScotGrid.

Service under utilised 3 2 6 Establish user board and survey initial requirements.

Service over subscribed 3 2 6 Work with partners and affiliates to increase hardware via annual review.

Inadequate support infrastructure

2 3 6 Monitor performance of support activities via pre-defined metrics.

Lack of international compatibility

1 4 4 Active engagement in NGF, GGF, EGEE.

Failure to integrate e-Infrastructure authentication

2 3 6 Review authentication early in the project – agree processes, recommend UK CA.

Security compromise 1 4 4 Work with other e-Infrastructure providers. Limit capability through portals. Key part of user training.

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Middleware:

Baseline Services

Requirement OMII VDT/GT gLite Other Comment 1. Storage Element

Access to data/storage

through OGSA-DAI functionality

Yes SRM via dCache or

DPM

gLite includes Storage Resource Management capability

2. Basic File Transfer

Yes GridFTP Yes gLite and OMII include GridFTP

3. Reliable File Transfer

Remote File Transfer

File Transfer Service

FTS being tested by OMII

4. Catalogue Services

UDDI registry via

GRIMOIRES

RLS LCG File Catalogue

Central catalogues adequate, high throughput needed

5. Data Management tools

OMII incorporates OGSA-DAI

LCG tools (replica

management)

SRB gLite File Placement Service under development

6. Compute Element

OMII Job Service

Gatekeeper Yes gLite uses Globus (GT2)

7. Workload Management

GridSAM and manual job submission

Condor-G Resource Broker

RB builds on Globus, Condor-G. Migrate to GT4?

8. VO Agents Perform localised activities on behalf of VO

9. VO Membership Services

Tools for account

management, no

GridMapFile equivalent

CAS VOMS CAS does not provide all the needed functionality

10. DataBase Services

OGSA-DAI MySQL, PostgreSQL, ORACLE

Off–the-shelf offerings are adequate

11. Posix-like I/O

GFAL Xrootd Application access to data

12. Application Software Installation Tools

PACMAN Yes Tools already exist in EGEE

13. Job Monitoring

Monalisa, Netlogger

Logging & Bookkeeping

service, R-GMA

14. Reliable Messaging

Supported through FIRMS

Tools such as Jabber are used

15. Information System

MDS (GLUE)

Yes BDII EGEE based on BDII and GLUE schema

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

External Relations

Applications

Application Specific Middleware

Generic Grid Middleware

Infrastructure

BRIDGES

OMII

GridPP

SBRN

Scot Grid

PPA

BIO

VOTES

CDF ATLAS

NGS

EEE

IR NW

EGEE …

nanoCMOS

Vis

31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow

Management

Steering

Committee

Management Board

User Board

Technical Board

Need a Shadow

Management

Structure(to prepare proposal)