data management & information systems

37
EGEE-II INFSO-RI- 031688 Enabling Grids for E-sciencE www.eu-egee.org www.glite.org EGEE and gLite are registered trademarks Data Management & Information Systems Markus Schulz – SA3 - CERN OGF - EGEE-II User Forum Manchester - 9 May 2007

Upload: shania

Post on 12-Jan-2016

24 views

Category:

Documents


0 download

DESCRIPTION

Data Management & Information Systems. Markus Schulz – SA3 - CERN OGF - EGEE-II User Forum Manchester - 9 May 2007. Disclaimer. Material that went into this presentation has been provided by many developers from inside JRA1, SA3, NA4 and external contributors Many thanks Ask questions!. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Data Management &  Information Systems

EGEE-II INFSO-RI-031688

Enabling Grids for E-sciencE

www.eu-egee.orgwww.glite.org

EGEE and gLite are registered

trademarks

Data Management & Information Systems Markus Schulz – SA3 - CERN

OGF - EGEE-II User Forum

Manchester - 9 May 2007

Page 2: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 2

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 2EGEE-II INFSO-RI-031688

Disclaimer• Material that went into this presentation has been provided by many

developers from inside JRA1, SA3, NA4 and external contributors

– Many thanks

Ask questions!

Page 3: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 4

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 4EGEE-II INFSO-RI-031688

EGEE Data Management

lcg_utilsFTS

Vendor Specific

APIs

GFAL Cataloging Storage Data transfer

Data Management

User ToolsVOFrameworks

(RLS) LFC SRM(Classic

SE)gridftp RFIO

Page 4: Data Management &  Information Systems

EGEE-II INFSO-RI-031688

Enabling Grids for E-sciencE

www.eu-egee.orgwww.glite.org

EGEE and gLite are registered

trademarks

LFCLCG File Catalog

LHC Computing Grid File Catalog

Large Hadron Collider Computing Grid File Catalog

Page 5: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 6

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 6EGEE-II INFSO-RI-031688

LCG “File” Catalog

• The LFC stores mappings between– Users’ file names– File locations on the Grid

• The LFC is accessible via– CLI, C API, Python interface, Perl interface

Supports sessions and bulk operations

– Data Location Interface (DLI) Web Service used for match making:

• given a GUID, returns physical file location

• ORACLE backend for high performance applications– Read-only replication support

File replica 2GUID

File replica 1

File replica m

LFC file name 1

LFC file name n

Page 6: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 7

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 7EGEE-II INFSO-RI-031688

LFC features

lfc-ls –l /grid/vo/

• Hierarchical Namespace• GSI security• Permissions and ownership• ACLs (based on VOMS)• Virtual ids

– Each user is mapped to (uid, gid)

• VOMS support– To each VOMS group/role corresponds a virtual gid

/grid

/vo

/data

fileLFC

DLIlfc-getacl /grid/vo/data

Page 7: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 8

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 8EGEE-II INFSO-RI-031688

What's new ?

• LFC bulk operations– New method: lfc_getreplicas

– Greatly improves replicas listing performance

• Secondary groups support– Since LFC version 1.6.3 (in production)

Page 8: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 10

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 10EGEE-II INFSO-RI-031688

Since LFC version 1.6.3

LFC dir 1

775 (uid1, gid1)

User 1 from VO 1

Mapped to (uid1, gid1)

NEW

User 2 from VO1 with VOMS Role

Mapped to (uid2, gid2, gid1)

1. Creates directory

2. T

ries t

o cr

eate fil

e

Also belongs to VO1

• With secondary groups– User 2 can register a file in dir1

– As (s)he belongs to gid2 and gid1

– But: User 1 cannot register a file in a directory created by User 2, if (s)he does not have the same VOMS Role !

Page 9: Data Management &  Information Systems

EGEE-II INFSO-RI-031688

Enabling Grids for E-sciencE

www.eu-egee.orgwww.glite.org

EGEE and gLite are registered

trademarks

DPM (Disk Pool Manager)

Page 10: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 12

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 12EGEE-II INFSO-RI-031688

Storage Element

• Storage Resource Manager (SRM) – hides the storage system implementation (disk or active tape)– handles authorization– translates SURLs (Storage URL) to TURLs (Transfer URLs)– disk-based: DPM, dCache,+; tape-based: Castor, dCache

• File I/O: posix-like access from local nodes or the grid GFAL (Grid File Access Layer)

Page 11: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 13

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 13EGEE-II INFSO-RI-031688

What is a DPM ?

• Disk Pool Manager– Manages storage on disk servers

– SRM support 1.1 2.1 (for backward compatibility) 2.2 (released in DPM version 1.6.3)

– GSI security

– ACLs

– VOMS support

– Secondary groups support (see LFC)

Page 12: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 14

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 14EGEE-II INFSO-RI-031688

DPM strengths

• Easy to use– Hierarchical namespace

$ dpns-ls /dpm/cern.ch/home/vo/data

• Easy to administrate– Easy to install and configure

– Low maintenance effort

– Easy to add/drain/remove disk servers

• Target: small to medium sites– Single disks --> several disk servers

Page 13: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 15

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 15EGEE-II INFSO-RI-031688

/vo

DPM: user's point of view

/dpm

/domain

/home

DPM

head node file

(uid, gid1, …)

DPM

disk servers

• DPM Name Server– Namespace

– Authorization

– Physical files location

• Disk Servers– Physical files

• Direct data transfer from/to disk server (no bottleneck)

• External transfers via gridFTP

CLI, C API,

SRM-enabled client, etc. data transfer

Page 14: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 16

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 16EGEE-II INFSO-RI-031688

GFAL & lcg_util

• Data management access libs.– Shield users from complexity– Interacts with information system, catalogue and SRM-SEs

• GFAL – Posix like C API for file access– SRMv2.2 support– User space tokens correspond to

A certain retention policy (custodial/replica) A certain access latency (online/nearline)

• lcg_util (command line + C API )– Replication, catalogue interaction etc.

Page 15: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 17

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 17EGEE-II INFSO-RI-031688

LFC & DPM deployment status

• EGEE Catalog– 110 LFCs in production

37 central LFCs 73 local LFCs

• EGEE SRM Storage Elements– CASTOR– dCache– DPM

96 DPMs in production Supporting 135 VOs

• LFC and DPM– Stable and reliable production quality services– Well established services– Require low support effort from administrators and developers

DPM

dCache

CASTOR

Storage Element instances

published in EGEE’s Top BDII

Page 16: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 18

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 18EGEE-II INFSO-RI-031688

FTS overview

• gLite File Transfer Service is a reliable data movement fabric service (batch for file transfers)– FTS performs bulk file transfers between multiple sites– Transfers are made between any SRM-compliant storage

elements (both SRM 1.1 and 2.2 supported)– It is a multi-VO service,

used to balance usage of site resources according to the SLAs agreedbetween a site and theVOs it supports

– VOMS aware

Page 17: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 19

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 19EGEE-II INFSO-RI-031688

FTS

• Why is it needed ?– For the user, the service it provides is the reliable point to point

movement of Storage URLs (SURLs) and ensures you get your share of the sites’ resources

– For the site manager, it provides a reliable and manageable way of serving file movement requests from their VOs and an easy way to discover problems with the overall service delivered to the users

– For the VO production manager, it provides ability to control requests coming from his users

• Re-ordering, prioritization,…

– The focus is on the “service” delivered to the user• It makes it easy to do these things well with minimal manpower

Page 18: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 20

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 20EGEE-II INFSO-RI-031688

FTS: key points

• Reliability– It handles the retries in case of

storage / network failures– VO customizable retry logic

– Service designed for high-availability deployment

• Security– All data is transferred securely using delegated credentials with

SRM / gridFTP– Service audits all user / admin operations

• Service and performance– Service stability: it is designed to efficiently use the available

storage and network resources without overloading them– Service recovery: integration of monitoring to detect service-level

degradation

Page 19: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 21

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 21EGEE-II INFSO-RI-031688

Service scale

• Designed to scale up to the transfer needs of very data intensive applications

• Currently deployed in production at CERN– Running the production WLCG

tier-0 data export– Target rate is ~1 Gbyte/sec 24/7– Over 9 petabytes transferred in last 6 months > 10 million files

• Also deployed at ~10 tier-1 sites running a mesh of transfers across WLCG– Inter-tier1 and tier-1 to tier-2 transfers– Each tier-1 has transferred around 0.2 – 0.5 petabytes of data

Page 20: Data Management &  Information Systems

EGEE-II INFSO-RI-031688

Enabling Grids for E-sciencE

www.eu-egee.orgwww.glite.org

EGEE and gLite are registered

trademarks

The gLite AMGA Metadata Catalogue

Page 21: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 23EGEE-II INFSO-RI-031688

Metadata in EGEE• Metadata is information about data stored in files

– usually lives in relational databases• AMGA is a joint JRA1-NA4 development

– Used by several application domains ( BioMed, HEP, EarthObs….)• Implementation:

– SOAP and Text front-ends– Streamed Bulk Operations ----> performance– Supports single calls, sessions & connections– SSL security with grid certs (X509)

and others, passwords, Kerberos Own User & Group management + VOMS

– PostgreSQL, Oracle, MySQL, SQLite backends

• Query parser supports good fraction of SQL:– Access permissions per directory/entry via ACLs

• AMGA integrates support for replication of metadata– Asynchronous replication: Ideal for WAN

Page 22: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 24EGEE-II INFSO-RI-031688

AMGA Clients & APIs

• AMGA Clients (for setup, administration)– Shell-like client– Graphical Browser (Python)

• Many Programming APIs– Diverse user community requested/provided

C/C++, Java, Python, Perl, PHP

• SOAP interface– Works with gSOAP, Axis, PySOAP

Page 23: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 25EGEE-II INFSO-RI-031688

Performance

• Performance comparable to direct DB access – C++, TCP streaming protocol, very fast SSL sessions

Throughput comparison between AMGA and direct access via JDBC reading same table on a LAN

100

1000

10000

100000

1e+06

1 10 100

Throughput [entries/s]

# clients

AMGA 1000 rowsJDBC 1000 rowsAMGA 1 rowJDBC 1 row

Logarithmic Scale!

Page 24: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 26EGEE-II INFSO-RI-031688

Scale

• LHCb (HEP VO use case)– 100 Million entries successfully tested!– 150GB data– 100 000 entries/day insert rate expected– 10 entries/second read-rate– Uses ORACLE RAC backend

For most demanding use cases

Page 25: Data Management &  Information Systems

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 28EGEE-II INFSO-RI-031688

Encrypted Data Storage

Page 26: Data Management &  Information Systems

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 29EGEE-II INFSO-RI-031688

Motivation

Medical community as the principal user• large amount of images• privacy concerns vs. processing needs• ease of use (image production and application)

Strong security requirements• anonymity (patient data is separate)• fine grained access control (only selected individuals)• privacy (even storage administrator cannot read)

Legacy service in use, based on gLite-1.5

Described components are under development

Page 27: Data Management &  Information Systems

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 30EGEE-II INFSO-RI-031688

Building Blocks

• Hospitals: – DICOM = Digital Image and COmmunication in Medicine

• Grid: SE = SRM + gridftp + I/O– and a client (application processing an image)

Goal: data access at any location

SE

SRM

gridftp

I/O

DICOM

Page 28: Data Management &  Information Systems

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 31EGEE-II INFSO-RI-031688

Exporting Images

“wrapping” DICOM :– anonymity: patient data is separated and stored in AMGA– access control: ACL information on individual files in SE (DPM)– privacy: per-file keys

distributed among several Hydra key servers fine grained access control

Image is retrieved from DICOM and processed to be “exported” to the grid.

DICOM-SE

SRMv2

gridftp

I/O

DICOM

trigger

HydraKeyStore

HydraKeyStore

HydraKeyStore

AMGAmetadata

image

patient data

file ACL

keys

Page 29: Data Management &  Information Systems

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 32EGEE-II INFSO-RI-031688

Accessing Images

• image ID is located by AMGA• key is retrieved from the Hydra key servers• file is accessed by SRM (access control in DPM)• data is read and decrypted block-by-block

in memory only (GFAL and hydra-cli)---> useful for allStill to be solved:• ACL synchronization among SEs

DICOM-SE

SRMv2

gridftp

I/O

DICOM

HydraKeyStore

HydraKeyStore

HydraKeyStore

AMGAmetadata

image

1. patient look-up

3. get TURL

2. keys

4. read

GFAL

Page 30: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 33

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 33EGEE-II INFSO-RI-031688

Information Systems

• R-GMA

• BDII (ldap based information system)

Page 31: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 34

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 34EGEE-II INFSO-RI-031688

Relational Grid Monitoring Architecture

R-GMA

• For users R-GMA appears similar to a single relational database.• Implementation of GGF’s Grid Monitoring Architecture (GMA)• Rich set of APIs (WebBrowsers, Java, C/C++, Python)• Backbone of EGEE monitoring (almost every activity leaves traces)

– See Dashboard, Realtime Monitor ++++++ about 20 tools

• Used by EGEE accounting as transport

ProducerService

RegistryService

ConsumerService

AP

IA

PI

SchemaService

Consumerapplication

Producerapplication

Publish Tuples

Send Query

Receive Tuples

Register

LocateQ

uery

Tuples

SQL “CREATE TABLE”

SQL “INSERT”

SQL “SELECT”

Page 32: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 35

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 35EGEE-II INFSO-RI-031688

Service discovery

• SD provides simple methods for locating services– hides underlying information system (simplified use)– plug-ins for R-GMA, BDII and XML files – API available for Java, C/C++ and command line tools

Page 33: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 36

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 36EGEE-II INFSO-RI-031688

The Information System

BDII

top-level

BDII

site-level

BDII

resource

MDS

GRIS

provider provider

WMS

WN

UI

FTS

FCR

Queries

(15HZ)

Site

•Based on ldap

•Standardized information provider (GIP)

•GLUE-1.3 schema

•Used with 230+ sites

•Roughly 60 instances in EGEE

•Top level BDII at CERN 15HZ query rate

•>20MByte of data

VO specific filter, based on live status

•2 minutes

Berkeley Data Base Information Index

Page 34: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 38

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 38EGEE-II INFSO-RI-031688

Inside A BDII

2171LDAP

2172LDAP

2173LDAP

2170Port Fwd

Update DB&

Modify DB

2170Port Fwd

Swap DBs

Write to cache Write to cache

Write to cache Write to cache

Write to cache ldapsearch

FCR

Page 35: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 39

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 39EGEE-II INFSO-RI-031688

Load Balanced BDII

BDII2170

BDII2170

BDII2170

BDII2170

BDII2170

BDII2170

DNS Round

Robin Alias

Queries

Page 36: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 43

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 43EGEE-II INFSO-RI-031688

GIN BDII

Generic Information Provider

Pro

vide

r E

GE

E

Pro

vide

r O

SG

Pro

vide

r N

DG

F

GIN

BDIIARC

BDII

Pro

vide

r N

areg

i

Pro

vide

r T

erag

rid

Pro

vide

r P

ragm

a

EGEE

Site

OSG

Site

NDGF

Site

Naregi

Grid

Teragrid

Grid

Pragma

Grid

•Used by the GIN group

Page 37: Data Management &  Information Systems

OGF - EGEE II User Forum - Manchester - 9 May 2007 44

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

OGF - EGEE II User Forum - Manchester - 9 May 2007 44EGEE-II INFSO-RI-031688

Information Systems

• Current problems• SLAPD demons on loaded systems can starve

– CEs drop out of the system– Move info provider from CE

• SiteBDIIs co-hosted on busy systems time out– Loss of an entire site in the info system – Move on large sites to low load node

• Improve reliability by fail back top level BDIIs– Needs work in the clients

• Scalability tests indicate limits (1-2 years time)– Cache static data more aggressively– Smarter schema (OGF- GLUE)– Change underlying technology

Simple insulation API needed ------> Standardization (OGF SAGA?)