data management & information systems
DESCRIPTION
Data Management & Information Systems. Markus Schulz – SA3 - CERN OGF - EGEE-II User Forum Manchester - 9 May 2007. Disclaimer. Material that went into this presentation has been provided by many developers from inside JRA1, SA3, NA4 and external contributors Many thanks Ask questions!. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/1.jpg)
EGEE-II INFSO-RI-031688
Enabling Grids for E-sciencE
www.eu-egee.orgwww.glite.org
EGEE and gLite are registered
trademarks
Data Management & Information Systems Markus Schulz – SA3 - CERN
OGF - EGEE-II User Forum
Manchester - 9 May 2007
![Page 2: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/2.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 2
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 2EGEE-II INFSO-RI-031688
Disclaimer• Material that went into this presentation has been provided by many
developers from inside JRA1, SA3, NA4 and external contributors
– Many thanks
Ask questions!
![Page 3: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/3.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 4
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 4EGEE-II INFSO-RI-031688
EGEE Data Management
lcg_utilsFTS
Vendor Specific
APIs
GFAL Cataloging Storage Data transfer
Data Management
User ToolsVOFrameworks
(RLS) LFC SRM(Classic
SE)gridftp RFIO
![Page 4: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/4.jpg)
EGEE-II INFSO-RI-031688
Enabling Grids for E-sciencE
www.eu-egee.orgwww.glite.org
EGEE and gLite are registered
trademarks
LFCLCG File Catalog
LHC Computing Grid File Catalog
Large Hadron Collider Computing Grid File Catalog
![Page 5: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/5.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 6
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 6EGEE-II INFSO-RI-031688
LCG “File” Catalog
• The LFC stores mappings between– Users’ file names– File locations on the Grid
• The LFC is accessible via– CLI, C API, Python interface, Perl interface
Supports sessions and bulk operations
– Data Location Interface (DLI) Web Service used for match making:
• given a GUID, returns physical file location
• ORACLE backend for high performance applications– Read-only replication support
…
File replica 2GUID
File replica 1
File replica m
LFC file name 1
LFC file name n
…
![Page 6: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/6.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 7
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 7EGEE-II INFSO-RI-031688
LFC features
lfc-ls –l /grid/vo/
• Hierarchical Namespace• GSI security• Permissions and ownership• ACLs (based on VOMS)• Virtual ids
– Each user is mapped to (uid, gid)
• VOMS support– To each VOMS group/role corresponds a virtual gid
/grid
/vo
/data
fileLFC
DLIlfc-getacl /grid/vo/data
![Page 7: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/7.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 8
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 8EGEE-II INFSO-RI-031688
What's new ?
• LFC bulk operations– New method: lfc_getreplicas
– Greatly improves replicas listing performance
• Secondary groups support– Since LFC version 1.6.3 (in production)
![Page 8: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/8.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 10
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 10EGEE-II INFSO-RI-031688
Since LFC version 1.6.3
LFC dir 1
775 (uid1, gid1)
User 1 from VO 1
Mapped to (uid1, gid1)
NEW
User 2 from VO1 with VOMS Role
Mapped to (uid2, gid2, gid1)
1. Creates directory
2. T
ries t
o cr
eate fil
e
Also belongs to VO1
• With secondary groups– User 2 can register a file in dir1
– As (s)he belongs to gid2 and gid1
– But: User 1 cannot register a file in a directory created by User 2, if (s)he does not have the same VOMS Role !
![Page 9: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/9.jpg)
EGEE-II INFSO-RI-031688
Enabling Grids for E-sciencE
www.eu-egee.orgwww.glite.org
EGEE and gLite are registered
trademarks
DPM (Disk Pool Manager)
![Page 10: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/10.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 12
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 12EGEE-II INFSO-RI-031688
Storage Element
• Storage Resource Manager (SRM) – hides the storage system implementation (disk or active tape)– handles authorization– translates SURLs (Storage URL) to TURLs (Transfer URLs)– disk-based: DPM, dCache,+; tape-based: Castor, dCache
• File I/O: posix-like access from local nodes or the grid GFAL (Grid File Access Layer)
![Page 11: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/11.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 13
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 13EGEE-II INFSO-RI-031688
What is a DPM ?
• Disk Pool Manager– Manages storage on disk servers
– SRM support 1.1 2.1 (for backward compatibility) 2.2 (released in DPM version 1.6.3)
– GSI security
– ACLs
– VOMS support
– Secondary groups support (see LFC)
![Page 12: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/12.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 14
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 14EGEE-II INFSO-RI-031688
DPM strengths
• Easy to use– Hierarchical namespace
$ dpns-ls /dpm/cern.ch/home/vo/data
• Easy to administrate– Easy to install and configure
– Low maintenance effort
– Easy to add/drain/remove disk servers
• Target: small to medium sites– Single disks --> several disk servers
![Page 13: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/13.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 15
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 15EGEE-II INFSO-RI-031688
/vo
DPM: user's point of view
/dpm
/domain
/home
DPM
head node file
(uid, gid1, …)
DPM
disk servers
• DPM Name Server– Namespace
– Authorization
– Physical files location
• Disk Servers– Physical files
• Direct data transfer from/to disk server (no bottleneck)
• External transfers via gridFTP
CLI, C API,
SRM-enabled client, etc. data transfer
![Page 14: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/14.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 16
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 16EGEE-II INFSO-RI-031688
GFAL & lcg_util
• Data management access libs.– Shield users from complexity– Interacts with information system, catalogue and SRM-SEs
• GFAL – Posix like C API for file access– SRMv2.2 support– User space tokens correspond to
A certain retention policy (custodial/replica) A certain access latency (online/nearline)
• lcg_util (command line + C API )– Replication, catalogue interaction etc.
![Page 15: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/15.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 17
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 17EGEE-II INFSO-RI-031688
LFC & DPM deployment status
• EGEE Catalog– 110 LFCs in production
37 central LFCs 73 local LFCs
• EGEE SRM Storage Elements– CASTOR– dCache– DPM
96 DPMs in production Supporting 135 VOs
• LFC and DPM– Stable and reliable production quality services– Well established services– Require low support effort from administrators and developers
DPM
dCache
CASTOR
Storage Element instances
published in EGEE’s Top BDII
![Page 16: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/16.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 18
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 18EGEE-II INFSO-RI-031688
FTS overview
• gLite File Transfer Service is a reliable data movement fabric service (batch for file transfers)– FTS performs bulk file transfers between multiple sites– Transfers are made between any SRM-compliant storage
elements (both SRM 1.1 and 2.2 supported)– It is a multi-VO service,
used to balance usage of site resources according to the SLAs agreedbetween a site and theVOs it supports
– VOMS aware
![Page 17: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/17.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 19
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 19EGEE-II INFSO-RI-031688
FTS
• Why is it needed ?– For the user, the service it provides is the reliable point to point
movement of Storage URLs (SURLs) and ensures you get your share of the sites’ resources
– For the site manager, it provides a reliable and manageable way of serving file movement requests from their VOs and an easy way to discover problems with the overall service delivered to the users
– For the VO production manager, it provides ability to control requests coming from his users
• Re-ordering, prioritization,…
– The focus is on the “service” delivered to the user• It makes it easy to do these things well with minimal manpower
![Page 18: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/18.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 20
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 20EGEE-II INFSO-RI-031688
FTS: key points
• Reliability– It handles the retries in case of
storage / network failures– VO customizable retry logic
– Service designed for high-availability deployment
• Security– All data is transferred securely using delegated credentials with
SRM / gridFTP– Service audits all user / admin operations
• Service and performance– Service stability: it is designed to efficiently use the available
storage and network resources without overloading them– Service recovery: integration of monitoring to detect service-level
degradation
![Page 19: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/19.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 21
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 21EGEE-II INFSO-RI-031688
Service scale
• Designed to scale up to the transfer needs of very data intensive applications
• Currently deployed in production at CERN– Running the production WLCG
tier-0 data export– Target rate is ~1 Gbyte/sec 24/7– Over 9 petabytes transferred in last 6 months > 10 million files
• Also deployed at ~10 tier-1 sites running a mesh of transfers across WLCG– Inter-tier1 and tier-1 to tier-2 transfers– Each tier-1 has transferred around 0.2 – 0.5 petabytes of data
![Page 20: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/20.jpg)
EGEE-II INFSO-RI-031688
Enabling Grids for E-sciencE
www.eu-egee.orgwww.glite.org
EGEE and gLite are registered
trademarks
The gLite AMGA Metadata Catalogue
![Page 21: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/21.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 23EGEE-II INFSO-RI-031688
Metadata in EGEE• Metadata is information about data stored in files
– usually lives in relational databases• AMGA is a joint JRA1-NA4 development
– Used by several application domains ( BioMed, HEP, EarthObs….)• Implementation:
– SOAP and Text front-ends– Streamed Bulk Operations ----> performance– Supports single calls, sessions & connections– SSL security with grid certs (X509)
and others, passwords, Kerberos Own User & Group management + VOMS
– PostgreSQL, Oracle, MySQL, SQLite backends
• Query parser supports good fraction of SQL:– Access permissions per directory/entry via ACLs
• AMGA integrates support for replication of metadata– Asynchronous replication: Ideal for WAN
![Page 22: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/22.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 24EGEE-II INFSO-RI-031688
AMGA Clients & APIs
• AMGA Clients (for setup, administration)– Shell-like client– Graphical Browser (Python)
• Many Programming APIs– Diverse user community requested/provided
C/C++, Java, Python, Perl, PHP
• SOAP interface– Works with gSOAP, Axis, PySOAP
![Page 23: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/23.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 25EGEE-II INFSO-RI-031688
Performance
• Performance comparable to direct DB access – C++, TCP streaming protocol, very fast SSL sessions
Throughput comparison between AMGA and direct access via JDBC reading same table on a LAN
100
1000
10000
100000
1e+06
1 10 100
Throughput [entries/s]
# clients
AMGA 1000 rowsJDBC 1000 rowsAMGA 1 rowJDBC 1 row
Logarithmic Scale!
![Page 24: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/24.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 26EGEE-II INFSO-RI-031688
Scale
• LHCb (HEP VO use case)– 100 Million entries successfully tested!– 150GB data– 100 000 entries/day insert rate expected– 10 entries/second read-rate– Uses ORACLE RAC backend
For most demanding use cases
![Page 25: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/25.jpg)
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 28EGEE-II INFSO-RI-031688
Encrypted Data Storage
![Page 26: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/26.jpg)
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 29EGEE-II INFSO-RI-031688
Motivation
Medical community as the principal user• large amount of images• privacy concerns vs. processing needs• ease of use (image production and application)
Strong security requirements• anonymity (patient data is separate)• fine grained access control (only selected individuals)• privacy (even storage administrator cannot read)
Legacy service in use, based on gLite-1.5
Described components are under development
![Page 27: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/27.jpg)
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 30EGEE-II INFSO-RI-031688
Building Blocks
• Hospitals: – DICOM = Digital Image and COmmunication in Medicine
• Grid: SE = SRM + gridftp + I/O– and a client (application processing an image)
Goal: data access at any location
SE
SRM
gridftp
I/O
DICOM
![Page 28: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/28.jpg)
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 31EGEE-II INFSO-RI-031688
Exporting Images
“wrapping” DICOM :– anonymity: patient data is separated and stored in AMGA– access control: ACL information on individual files in SE (DPM)– privacy: per-file keys
distributed among several Hydra key servers fine grained access control
Image is retrieved from DICOM and processed to be “exported” to the grid.
DICOM-SE
SRMv2
gridftp
I/O
DICOM
trigger
HydraKeyStore
HydraKeyStore
HydraKeyStore
AMGAmetadata
image
patient data
file ACL
keys
![Page 29: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/29.jpg)
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 32EGEE-II INFSO-RI-031688
Accessing Images
• image ID is located by AMGA• key is retrieved from the Hydra key servers• file is accessed by SRM (access control in DPM)• data is read and decrypted block-by-block
in memory only (GFAL and hydra-cli)---> useful for allStill to be solved:• ACL synchronization among SEs
DICOM-SE
SRMv2
gridftp
I/O
DICOM
HydraKeyStore
HydraKeyStore
HydraKeyStore
AMGAmetadata
image
1. patient look-up
3. get TURL
2. keys
4. read
GFAL
![Page 30: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/30.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 33
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 33EGEE-II INFSO-RI-031688
Information Systems
• R-GMA
• BDII (ldap based information system)
![Page 31: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/31.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 34
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 34EGEE-II INFSO-RI-031688
Relational Grid Monitoring Architecture
R-GMA
• For users R-GMA appears similar to a single relational database.• Implementation of GGF’s Grid Monitoring Architecture (GMA)• Rich set of APIs (WebBrowsers, Java, C/C++, Python)• Backbone of EGEE monitoring (almost every activity leaves traces)
– See Dashboard, Realtime Monitor ++++++ about 20 tools
• Used by EGEE accounting as transport
ProducerService
RegistryService
ConsumerService
AP
IA
PI
SchemaService
Consumerapplication
Producerapplication
Publish Tuples
Send Query
Receive Tuples
Register
LocateQ
uery
Tuples
SQL “CREATE TABLE”
SQL “INSERT”
SQL “SELECT”
![Page 32: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/32.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 35
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 35EGEE-II INFSO-RI-031688
Service discovery
• SD provides simple methods for locating services– hides underlying information system (simplified use)– plug-ins for R-GMA, BDII and XML files – API available for Java, C/C++ and command line tools
![Page 33: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/33.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 36
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 36EGEE-II INFSO-RI-031688
The Information System
BDII
top-level
BDII
site-level
BDII
resource
MDS
GRIS
provider provider
WMS
WN
UI
FTS
FCR
Queries
(15HZ)
Site
•Based on ldap
•Standardized information provider (GIP)
•GLUE-1.3 schema
•Used with 230+ sites
•Roughly 60 instances in EGEE
•Top level BDII at CERN 15HZ query rate
•>20MByte of data
VO specific filter, based on live status
•2 minutes
Berkeley Data Base Information Index
![Page 34: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/34.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 38
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 38EGEE-II INFSO-RI-031688
Inside A BDII
2171LDAP
2172LDAP
2173LDAP
2170Port Fwd
Update DB&
Modify DB
2170Port Fwd
Swap DBs
Write to cache Write to cache
Write to cache Write to cache
Write to cache ldapsearch
FCR
![Page 35: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/35.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 39
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 39EGEE-II INFSO-RI-031688
Load Balanced BDII
BDII2170
BDII2170
BDII2170
BDII2170
BDII2170
BDII2170
DNS Round
Robin Alias
Queries
![Page 36: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/36.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 43
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 43EGEE-II INFSO-RI-031688
GIN BDII
Generic Information Provider
Pro
vide
r E
GE
E
Pro
vide
r O
SG
Pro
vide
r N
DG
F
GIN
BDIIARC
BDII
Pro
vide
r N
areg
i
Pro
vide
r T
erag
rid
Pro
vide
r P
ragm
a
EGEE
Site
OSG
Site
NDGF
Site
Naregi
Grid
Teragrid
Grid
Pragma
Grid
•Used by the GIN group
![Page 37: Data Management & Information Systems](https://reader033.vdocuments.us/reader033/viewer/2022051401/56814548550346895db215f1/html5/thumbnails/37.jpg)
OGF - EGEE II User Forum - Manchester - 9 May 2007 44
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
OGF - EGEE II User Forum - Manchester - 9 May 2007 44EGEE-II INFSO-RI-031688
Information Systems
• Current problems• SLAPD demons on loaded systems can starve
– CEs drop out of the system– Move info provider from CE
• SiteBDIIs co-hosted on busy systems time out– Loss of an entire site in the info system – Move on large sites to low load node
• Improve reliability by fail back top level BDIIs– Needs work in the clients
• Scalability tests indicate limits (1-2 years time)– Cache static data more aggressively– Smarter schema (OGF- GLUE)– Change underlying technology
Simple insulation API needed ------> Standardization (OGF SAGA?)