grid computing
TRANSCRIPT
1
Grid ComputingGrid Computing
Presented ByPresented By
Keshab NathKeshab Nath
2
OutlineOutline• Conception
• Why Grid?
• Grid Topologies
• Grid Architecture
• Key Components
• Globus Toolkit
• Applications
3
What Grid Computing isWhat Grid Computing is
Allows sharing and coordinated use of diverse resources in dynamic, distributed “virtual organizations”.
Grid computing is a service for sharing computer power and data storage capacity over the Internet.
Web– Uniform naming access to documents
Grid
- Uniform, high performance access to computational resources
Colleges/R&D Labs
Software Catalogs
http://
http://
Web vs. GRID
Why Grids?Why Grids?
• Large-scale science and engineering are done through the interaction of people, heterogeneous computing resources, information systems, and instruments, all of which are geographically and organizationally dispersed.
• The overall motivation for “Grids” is to facilitate the interactions of these resources in order to support large-scale science and Engineering.
• Highly User friendly and mature web tech: No training needed
• Provide access via “The grid “ to remote resources of millions of Pc’s from any where to any body.
LocalCluster
2100
2100 2100 2100 2100
2100 2100 2100 2100
Personal Device
SuperCompu
ters
GlobalGrid
EnterpriseCluster/Grid
Scalable ComputingScalable Computing
PERFORMANCE
+
Q
o
S
6
Grid TopologiesGrid Topologies • Intragrid - Local grid within an organization: Single organizations
-A single cluster - Trust based on personal contracts
7
8
• Extragrid – Multiple organizations
– Resources of a organizations connected through a (Virtual) Private Network – Multiple clusters – Trust based on Business to Business contracts
9
• Intergrid – Global sharing of resources through the internet – Many Multiple clusters – Trust based on certification
10
Grid ArchitectureGrid Architecture
Application
Collective
Resource
Connectivity
Fabric
Application
Transport
Internet
Link
GRID
Internet
11
Grid ArchitectureGrid Architecture Fabric layer: Provides the resources to which shared access is controlled
by Grid protocols
The resources normally include physical and logical entities. .
Connectivity layer: Defines the core communication and authentication
protocols required for grid-specific network functions.
These protocols enable the exchange of data between fabric layer
resources.
Resource layer: Defines protocols for secure negotiations, initiation,
monitoring control operation on individual resources.
Information & Management protocols
12
Collective Layer : The collective services layer deals with the scheduling
services, data replications services, and monitoring services.
These services are not associated with any one specific resource but focus
on interactions across resources
Application Layer: These are user applications that operate within VO
environment.
which includes applications in science, engineering, business, finance and more,
as well as portals and development toolkits to support the applications. This is
the layer that grid users "see" and interact with
13
How Does Grid Work
.- If there is a scope for parallelism ,it can also decompose
Your work into convenient execution units based on the available resources
You Submit Your Work
And The Grid-Find convenient places to your data
-Deals with Authentication to the different sites that you will be using-Organizes efficient access to data for running your job
-migration, replication
-Run your jobs, monitor progress, recover from problems, tells you when work is complete
15
Key ComponentsKey Components
• Portal/user interface
16
Key ComponentsKey Components
• Security– Grid Security Infrastructure (GSI)
Key ComponentsKey Components• Broker
– Monitoring and Discovery Service (MDS)
It provides information about available resources on the grid and working status of these resources
18
Key ComponentsKey Components
• Scheduler
If jobs were to become lost due to system or network outages, scheduler would automatically resubmit the job elsewhere.
Key ComponentsKey Components
• Data management – Grid Access to Secondary Storage (GASS)
It is responsible for moving files and data to various nodes within the grid
20
Key ComponentsKey ComponentsJob and resource management
– Grid Resource Allocation Manager (GRAM)
Providing the services to launch a job on a particular resource checking the job’s status, retrieve the results when the job is complete
21
Globus Toolkit 4Globus Toolkit 4
• The Globus® Toolkit is an open source software toolkit used
for building grids. provided by the Globus Alliance.
• Globus Toolkit 4 provides components in the following five
categories:
– Common runtime components
– Security
– Data management
– Information services
– Execution management
22
Globus Toolkit 4 Globus Toolkit 4
Globus Toolkit Globus Toolkit • Common Runtime:
The Common Runtime components provide a set of fundamental libraries and tools which are needed to build both WS and non-WS services.
• Information services:
The Information Services, more commonly referred to as the Monitoring and Discovery Services (MDS), includes a set of components to discover and monitor resources in a virtual organization.
• Execution management
Execution Management components deal with the initiation, monitoring, management, scheduling and coordination of executable programs, usually called jobs, in a Grid.
23
Globus Toolkit Globus Toolkit • Security Using the Security components, based on the Grid Security
Infrastructure (GSI), we can make sure that our communications are secure.
• Data management
These components will allow us to manage large sets of data in our virtual organization.
Modules in this section enable distributed data in the grid to locate , move (GridFTP,RFT) and to manage (OGSA).
24
25
Applications• The Southern California Earthquake Center uses Globus
software to visualize earthquake simulation data.
26
Applications• Scientists in the Earth System Grid (ESG) are producing, archiving, and
providing access to climate data that advances our understanding of global
climate change. ESG uses Globus software for security, data movement,
and system monitoring.
27
Applications• Computational scientists at Brown University are using the
Globus Toolkit and MPICH-G2 to simulate the flow of blood through human arteries.
28
Grid ProjectsGrid ProjectsInternational Grid Projects• Open Middleware Infrastructure Institute Europe (OMII-Europe) - May 2006 -> May 2008• Enabling Grids for E-sciencE (EGEE) - March 2004 -> March 2006• Enabling Grids for E-sciencE II (EGEE II) - April 2006 -> April 2008• BREIN — September 2007 → August 2009• Data TAG - January 2001 -> January 2003• European Data Grid (EDG) - March 2001 -> March 2004• Baltic Grid - November 2005 -> April 2008
National Grid Projects• D-Grid (German)• GARUDA (Indian)• grid computing project at VECC (Calcutta, India)• China Grid Project• INFN Grid (Italian)• Knowledge Grid Malaysia• NAREGI Project• Singapore National Grid Project• Thai National Grid Project• BELNET Grid, Belgium
GARUDAGARUDA
Ended on March 2008, accomplished its deliverables by connecting 17 cities across 45 academic and research institutes country wide along with the required softwares for managing grid computing applications.
29
GARUDA is India's first national grid initiative bringing together academic, scientific and research communities for developing their data and compute intensive applications with guaranteed QoS.
The Department of Information Technology (DIT), Government of India, has funded Centre for Development of Advanced Computing (C-DAC) to deploy the nation-wide computational grid “GARUDA” spanning across 17 cities and 45 institutions with an aim to bring Distributed/Grid networked infrastructure to academic labs, research labs and industries in India
Some current Grids in development and deployment
European Data Grid The European Data Grid is a European Union funded project which
aims to create a huge Grid system for computation and data-sharing. It is aimed at projects in high energy physics, biology and medical image processing, and astronomy.
The National Fusion Collaboratory• The National Fusion Collaboratory project exists to help research
magnetic fusion. Magnetic fusion experiments operate on pulses of plasmas which are produced approximately every 15 minutes. The data generated from each measurement must be analysed within the 15 minutes so that changes can be made to the set up in time for the next pulse
30
31
ReferenceReference• “The Anatomy of the Grid(Enabling Scalable Virtual Organizations)” ---by Ian Foster, Carl Kesselman, Steven Tuecke• “Physiology of the Grid (An Open Grid Services Architecture for
Distributed Systems Integration )” ---by Ian Foster, Carl Kesselman, Jeffrey M. Nick, Steven Tuecke
• “Grid Computing:Past,Present, and Future”--- by Elias Kourpas, June 2006
• “Introduction to Grid Computing” ---IBM Redbook,2005
• IBM Grid Computing:www-03.ibm.com/grid/index.shtml
• Globus website: www.globus.org
32
Thank You!Thank You!