Download - CESGA Status Report
EGEE-II INFSO-RI-031688
Enabling Grids for E-sciencE
www.eu-egee.org
EGEE and gLite are registered trademarks
CESGA Status Report
Javier Lopez, Alvaro Simon, Esteban Freire/ CESGASA3 All Hands Meeting Barcelona
EGEE User Forum 2
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
Outline
• Main Achievements since November• Work on Deliverables/Milestones• Issues/Problems• Next Steps
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
Main achievements since November
EGEE User Forum 4
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
Infrastructure
• New infrastructure based on virtual machines• This is the new common infrastructure for SA3, PPS
and Production testbeds @CESGA• Fronted:
– 1x Dell Poweredge 2950: 1TB RAID5 storage (golden images of all the services)
• Virtual Machines:– 4x10 Dell Poweredge 1955: quad-core processors
• For SA3 services we use HVM machines: they allow us to use kernel 2.4 without modification to the OS
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
Infrastructure• Advantages
– We can increase or decrease the capacity on demand– Easy to test new releases in a clean environment– Possible to roll-back failing upgrades using LVM snapshoot
capability
• We have produced a document explaining our infrastructure:– https://swe-wiki.egee.cesga.es/cgi-bin/moin.cgi/XEN3_Virtual_Ma
chines_-_CESGA-EGEE– More detailed documents also available on request
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
SGE• NOTE: This task is a joined effort between IC, LIP and
CESGA
• Integration in LCG CE ready– RPM packages tested– Yaim scripts developed– Documentation updated– Ready for certification
• Re-Distribution of Grid Engine: – Reviewed license and sent to SA3 list for second review– Re-distribution allowed
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
Testing SGE• Based on the Torque/maui tests developed @GRNET• Adapting the scripts • Preliminary results available
– Very slow submission
• Optimizing SGE configuration
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
SGE on gLite CE• IP ready (same as in LCG)• Meeting with BLAH developer (David Rebatto) to
understand the work required• Required scripts are being developed @IC• Testing will be done @CESGA• APEL ready (Dave Kant)
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
Assigned Tasks• Task #4759: Testing SGE
– In progress
• Task #4600: Provide updated RPMs for SGE jobmanager and installation guide– Ready for certification
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
Issues/Problems• Job submission tests:
– Preliminary results show that optimization of default SGE configuration required
• Improve SGE configuration to send back to CE stdout and stderror files
• Modifications required to run SGE on para-virtual machines
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
Next Steps• SGE is working on a lcg-CE. SGE is working on a lcg-CE. Next step: CERTIFICATION
– Add RPMs to SA3 repository– Integrate SGE yaim scripts
• Tests for SGE lcg-CE (later they will be reused for glite-CE)
• SGE on glite-CE: Started on integrating support for BLAHBLAH– Other local middleware elements (GIIS, YAIM) basically remain
unchanged for this glite-CE flavour.– APEL ready
• Support for external SGE_QMASTER (IC and CESGA use this type of configuration in production)
• GridICE sensors for SGE
EGEE User Forum 12
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
References• Xen Virtualization @CESGA
– https://swe-wiki.egee.cesga.es/cgi-bin/moin.cgi/XEN3_Virtual_Machines_-_CESGA-EGEE
• SGE Wiki Page– https://twiki.cern.ch/twiki/bin/view/LCG/ImplementationOfSGE