basic grid registry configuration – there is not any backup data grid registry configuration where...

1
0,0 0,2 0,4 0,6 0,8 1,0 1,2 1,4 1,6 1,8 1 2 3 4 5 6 7 8 9 10 11 12 summary response time [s] ping (GR 2.0) ping (GR 1.0) search service (GR 2.0) search service (GR 1.0) num berofnodes ` 0,0 0,2 0,4 0,6 0,8 1,0 1,2 1,4 1,6 1,8 0 1 2 3 4 5 6 7 8 9 10 11 summary response time [s] ping search serivce num berofcrashed nodes 0 1 2 3 4 5 6 7 0 1 2 3 4 summary response time [s] num beroferrors 0,0 0,5 1,0 1,5 2,0 2,5 3,0 3,5 0 10 20 30 40 50 60 70 80 90 100 num berofitem s synchronization time [s] synchronization ofdatabases synchronization ofrouting tables U ser node A1 node A2 node A12 ... XM L DB XML DB XM L DB U ser node A1 node A 2 node A12 ... XML DB XM L DB XM L DB node A1 node A 2 node A12 ... XML DB XM L DB XM L DB node A node AB node AA currentprovider Mathematics: A lgebra Mathematics: Discrete Mathematics Mathematics echo timeout XM L DB XM L DB node AA Mathematics: A lgebra U ser XM L DB 2 echo 3 7 4 5 6 1 node A node AB node AA Mathematics : A lge bra Mathematics: Discrete Mathematics Mathematics echo tim eo ut node AA currentprovider Mathematics : A lgebra Adm inistrator 3 echo 3 4 1 2 node AAA new node Mathematics : A lgebra: Line ar A lgebra node A node AB node AA unupdated Mathematics: Algebra Mathematics: Discrete Mathematics Mathe matics e cho echo 2 node AA currentprovider Mathematics: A lgebra echo 1 1 3 node A AA Mathematics: Algebra: Linear Algebra 2 echo echo node A node A B node AA Mathematics: Algebra Mathematics : Discrete Mathematics Mathematics echo echo XM L DB XM L DB node AA backup Mathematics: Algebra Adm inistrator XM L DB echo Basic Grid Registry configuration – there is not any backup data Grid Registry configuration where every domain has duplicated information Mathem atics U ser Graph Theory Discrete Mathem atics Algebra 1 XM L DB 2 9 3 8 7 4 6 5 10 Find all services solving TSP problem Marian Bubak 1,2 , Cezary Górka 1 , Marek Kasztelnik 1 , Maciej Malawski 1,2 , Tomasz Gubała 2 1 Institute of Computer Science AGH, Mickiewicza 30, 30-059 Kraków, Poland 2 Academic Computer Centre CYFRONET, Nawojki 11, 30-950 Kraków, Poland [email protected], [email protected], [email protected], [email protected], [email protected] Building applications using Web or Grid services has become increasingly popular A user connects services into the workflow to perform needed computation There has to be a registry storing information about Web or Grid services (Grid Registry) Need of a fault-tolerant version of the Grid Registry For fault tolerance data stored in registry has to be redundant If data are duplicated, a synchronization mechanism is needed M. Bubak, T. Gubala, M. Kapałka, M. Malawski, K. Rycerz, Workflow composer and service registry for grid applications, Future Generation Computer Systems, vol. 21, no. 1, 2005, pp. 79-86. • stores information about Web and Grid services (syntactic, semantic and human- readable description) • distributed, scalable • Grid-enabled The system is composed of single nodes Initial registry configuration. User can ask registry about information from Mathematics, Mathematics : Algebra and Mathematics : Discrete Mathematics domains Information stored in domain Mathematics : Algebra is duplicated Echo messages are sent to ancestors by all the nodes. It provides knowledge about current registry configuration Using Echo mechanism the registry detects that the node AA from domain Mathematics : Algebra crashed Registry reacts to this information – changes in Local Routing Table Query is redirected to backup node (1 – 7) One of the nodes from domain Mathematics : Algebra is still unreachable Administrator still can modify registry configuration (1 - 4) All information stored in registry is available for the user When broken node is repaired, it synchronizes information with the most up to date node from domain (2) All changed entries in Local Routing Table are updated in repaired node (2) If necessary, new connections are established (3) The test shows a comparison between response time depending on number of hops that message has to pass while reaching its destination in prototype and fault-tolerant version of the Grid Registry Before Grid Registry reacts to unreachable nodes, user can send query. Then it can be redirected to broken node. In this case error message is generated and then user's query is redirected to backup node. This test presents such a situation. Response time depends on number of generated error messages Grid Registry can modify its Local Routing Table, so query will not be redirected to broken nodes. Graph presents response time depending on number of broken nodes When broken node becomes reachable, Local Routing Table and XML database have to be synchronized. Test shows synchronization time depending on number of items that have to be synchronized Functionality solving this problem is available in new version of the Grid Registry • Every node is a single point of failure this problem could be solved by adding data redundancy Desynchronization of data Overloaded nodes

Upload: gwenda-mccarthy

Post on 03-Jan-2016

220 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Basic Grid Registry configuration – there is not any backup data Grid Registry configuration where every domain has duplicated information Find all services

0,0

0,2

0,4

0,6

0,8

1,0

1,2

1,4

1,6

1,8

1 2 3 4 5 6 7 8 9 10 11 12

sum

ma

ry r

esp

on

se ti

me

[s]

ping (GR 2.0)

ping (GR 1.0)

search service (GR 2.0)

search service (GR 1.0)

number of nodes

`

0,0

0,2

0,4

0,6

0,8

1,0

1,2

1,4

1,6

1,8

0 1 2 3 4 5 6 7 8 9 10 11

sum

mar

y re

spon

se ti

me

[s]

ping search serivce

number of crashed nodes

0

1

2

3

4

5

6

7

0 1 2 3 4

sum

mar

y re

spon

se ti

me

[s]

number of errors

0,0

0,5

1,0

1,5

2,0

2,5

3,0

3,5

0 10 20 30 40 50 60 70 80 90 100

number of items

sync

hron

izat

ion

time

[s]

synchronization of databases

synchronization of routing tables

User

node A1 node A2 node A12...

XML DB

XML DB

XML DB

User

node A1 node A2 node A12...

XML DB

XML DB

XML DB

node A1 node A2 node A12...

XML DB

XML DB

XML DB

node A

node ABnode AAcurrent provider

Mathematics: AlgebraMathematics:

Discrete Mathematics

Mathematics

echotim

eout

XML DB

XML DB

node AA

Mathematics: Algebra

User

XML DB

2

echo 3

7

45

6

1

node A

node ABnode AA

Mathematics: AlgebraMathematics:

Discrete Mathematics

Mathematics

echotim

eout

node AAcurrent provider

Mathematics: Algebra

Administrator

3

ech

o 3

41

2

node AAAnew node

Mathematics: Algebra: Linear Algebra

node A

node ABnode AAunupdated

Mathematics: AlgebraMathematics:

Discrete Mathematics

Mathematics

echoech

o 2

node AAcurrent provider

Mathematics: Algebra

echo 1

1

3

node AAA

Mathematics: Algebra: Linear Algebra

2

echo

echo

node A

node ABnode AA

Mathematics: AlgebraMathematics:

Discrete Mathematics

Mathematics

echoech

o

XML DB

XML DB

node AAbackup

Mathematics: Algebra

Administrator

XML DB

ech

o

Basic Grid Registry configuration – there is not any backup data

Grid Registry configuration where every domain has duplicated information

Mathematics

User

Graph Theory

Discrete Mathematics

Algebra

1

XML DB

2

9

3

8

7 4

6

5

10Find all services

solving TSP problem

Marian Bubak1,2, Cezary Górka1, Marek Kasztelnik1, Maciej Malawski1,2, Tomasz Gubała2

1Institute of Computer Science AGH, Mickiewicza 30, 30-059 Kraków, Poland2Academic Computer Centre CYFRONET, Nawojki 11, 30-950 Kraków, Poland

[email protected], [email protected], [email protected], [email protected], [email protected]

• Building applications using Web or Grid services has become increasingly popular

• A user connects services into the workflow to perform needed computation

• There has to be a registry storing information about Web or Grid services (Grid Registry)

• Need of a fault-tolerant version of the Grid Registry• For fault tolerance data stored in registry has to be

redundant• If data are duplicated, a synchronization mechanism

is needed

M. Bubak, T. Gubala, M. Kapałka, M. Malawski, K. Rycerz, Workflow composer and service registry for grid applications, Future Generation Computer Systems, vol. 21, no. 1, 2005, pp. 79-86.

• stores information about Web and Grid services (syntactic, semantic and human-readable description)

• distributed, scalable• Grid-enabled

• The system is composed of single nodes

• Initial registry configuration.

• User can ask registry about information from Mathematics, Mathematics : Algebra and Mathematics : Discrete Mathematics domains

• Information stored in domain Mathematics : Algebra is duplicated

• Echo messages are sent to ancestors by all the nodes. It provides knowledge about current registry configuration

• Using Echo mechanism the registry detects that the node AA from domain Mathematics : Algebra crashed

• Registry reacts to this information – changes in Local Routing Table

• Query is redirected to backup node (1 – 7)

• One of the nodes from domain Mathematics : Algebra is still unreachable

• Administrator still can modify registry configuration (1 - 4)

• All information stored in registry is available for the user

• When broken node is repaired, it synchronizes information with the most up to date node from domain (2)

• All changed entries in Local Routing Table are updated in repaired node (2)

• If necessary, new connections are established (3)

The test shows a comparison between response time depending on number of hops that message has to pass while reaching its destination in prototype and fault-tolerant version of the Grid Registry

Before Grid Registry reacts to unreachable nodes, user can send query. Then it can be redirected to broken node. In this case error message is generated and then user's query is redirected to backup node. This test presents such a situation. Response time depends on number of generated error messages

Grid Registry can modify its Local Routing Table, so query will not be redirected to broken nodes. Graph presents response time depending on number of broken nodes

When broken node becomes reachable, Local Routing Table and XML database have to be synchronized. Test shows synchronization time depending on number of items that have to be synchronized

Functionality solving this problem is available in new version of the Grid Registry

• Every node is a single point of failure this problem could be solved by adding data redundancy• Desynchronization of data• Overloaded nodes