ajou university, south korea icsoc 2003 “disconnected operation service in mobile grid...

28
Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Disconnected Operation Service in Mobile Grid Service in Mobile Grid Computing Computing Sang-Min Park , Young-Bae Ko, and Jai-Hoon Kim Ajou University South Korea

Upload: marcia-freeman

Post on 25-Dec-2015

221 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Disconnected Operation Service in Disconnected Operation Service in Mobile Grid ComputingMobile Grid Computing

Sang-Min Park , Young-Bae Ko,

and Jai-Hoon Kim

Ajou University

South Korea

Page 2: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea2

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

ContentsContents

Introduction to Grid

Wireless Mobile Environment

Mobile Grid – Two Possible ScenariosMobile Interface to Grid

Mobile Resources for Grid

Proposed Job Scheduling Algorithm for Mobile Grid System

Performance Evaluations

Conclusions and Future Works

Page 3: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea3

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Introduction to GridIntroduction to Grid

What is Grid?Large scale distributed computing on InternetDesigned for effectively solving complicated problemsVarious kinds of resources are autonomously managed at distributed sites

Effects of GridTo increase job throughput (performance)To increase utilization rate of resourcesEnabling cooperative work

Grid SoftwareGrid middleware (Ex: Globus Toolkit, Unicore)Standardization efforts (OGSA)Application toolkits (AppLes, Nimrod-G, Ninf-G, etc)Parallel computing (MPICH-G)

Page 4: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea4

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Wireless Mobile EnvironmentWireless Mobile Environment

Characteristics

Users can freely move with heterogeneous mobile devices

Wireless communication is mainly used to access information, not to compute the information

Main Constraints

Scarce host resources : computational capability, small amount of memory, limited battery life

Scarce network resources : narrow bandwidth, increased latency, small coverage

Weak security over wireless communication

Intermittent connectivity and frequent node failures

Heterogeneous systems (Hardware, O/S, Applications)

Page 5: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea5

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Two Possible Scenarios in Two Possible Scenarios in Mobile GridMobile GridMobile Interface to Grid

Mobile Resources for Grid

Page 6: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea6

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Mobile device as grid interfaceMobile device as grid interface

Job submission, monitoring, result acquisition through mobile devices

Grid users can manage their jobs anytime, anywhere, conveniently

Grid promises the performance and reliability

Page 7: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea7

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

IssuesIssues

Adapting to various interfaces of mobile devicesLaptops operated by Microsoft windows

PDAs having very small display and no keyboards

But, we need to sustain same functionalities

Reliable job managementMobile devices are exposed to frequent failures

The resources are scarce

How can we manage the job reliably?

Agent-based Job ManagementAn Agent on a static system performs job management on behalf of the mobile user

Job initiation, interaction, monitoring, and completion process could be performed by this Agent

The process of this Agent would be dependent on the condition of mobile user

Page 8: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea8

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Mobile Device as Grid ResourceMobile Device as Grid Resource

ScenariosMobile devices serve their computing capability, memory space, network resource for the grid jobsA group of mobile devices may form an actual grid siteA grid job submitted from static grid site run on the mobile devices

Why we should consider these thin devices as grid resources?

A performances of mobile devices are increasing, significantlyLaptop has similar computing performance, memory size, storage amountA PDA can have a 400Mhz CPU, 128 MB memory, wireless Internet capability

Wireless networking technologies are rapidly developingThe number of mobile users grow fastA boundary of grid resource will be broadening

Page 9: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea9

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Proposed ArchitectureProposed Architecture

Mobile QueuingServer (MQS)

Job4 Job3 Job2 Job1

Grid Middleware

M-Grid Gateway

Internet

WirelessLAN AP

WirelessLAN AP

Job decompose anddistribute

LAN

Job

Page 10: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea10

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

IssuesIssues

Fault toleranceFrequent disconnections, exhausted battery…Even though a single mobile device is prone to failures, a collection of resources should be reliable and powerful

Dealing with Platform HeterogeneityA large portion of grid applications is based on the Unix-like OSMicrosoft Windows, Pocket PC, Palm OS are dominating mobile OS

Job DecompositionThe normal grid job is too heavy to be executed on a single mobile device

Job SchedulingHow can we distribute jobs to mobile resources? What principle should be applied for?

Page 11: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea11

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Job Scheduling Algorithm in Job Scheduling Algorithm in Mobile GridMobile Grid

Why we should deal with this scheduling problem?Scheduling algorithms in static grid: FCFS (based on Machine Speed)

Disconnections and failures are common in wireless networks

The number of mobile nodes participating in computing is NOT necessarily proportional to the performance of Grid.

Our proposed job scheduling algorithmTo consider disconnection and reconnection rates of mobile resources, in addition to the job execution speed

To figure out the number of mobile nodes which are expected to produce the best performance

Page 12: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea12

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Job Scheduling Algorithm in Job Scheduling Algorithm in Mobile GridMobile Grid

Job Execution Process in Mobile Grid System

Mobile QueuingServer (MQS)

Job4 Job3 Job2 Job1

Grid Middleware

M-Grid Gateway

Internet

WirelessLAN AP

WirelessLAN AP

Job decompose anddistribute

LAN

Job

Job Executioninputtransfer

outputtransfer

Page 13: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea13

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Job Scheduling Algorithm in Job Scheduling Algorithm in Mobile GridMobile Grid

Three steps to construct the algorithmCalculate the expected time to transfer input and output data

Determine the job completion time of every mobile node (Job execution time + input and output transfer time)

Find out the optimal number of mobile nodes for the best job performance

number of participating nodes

(ordered by increasing response time)

time

decreasing computingtime of jobs

increasing waiting timeof data (input, output)

job execution time(computing time+dataexchange time

optimal number of

nodes

Page 14: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea14

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Job Scheduling Algorithm in Job Scheduling Algorithm in Mobile GridMobile Grid

First step: Calculate the expected input and output data transfer time (to and from the mobile node)

)(tfc

,

)(tfc: Time required to transfer ‘: Time required to transfer ‘t’ t’ time unit of data when the node is time unit of data when the node is initially initially in connection statein connection state

)(tfd : Time required to transfer ‘: Time required to transfer ‘t’ t’ time unit of data when the node is time unit of data when the node is initially initially in disconnection statein disconnection state

Then, how to determine the values of and ?Then, how to determine the values of and ?)(tfd)(tfc

)()()( tfPtfPtf ddcc

Page 15: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea15

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

First Step (Cont’d)First Step (Cont’d)

C D

CP

,

DP

)()(),( tftftC cc Time required to perform ‘Time required to perform ‘ ’ time ’ time

units of transferunits of transfer

Probability of Probability of connection and connection and

disconnection statedisconnection state

• No disconnection occurs during No disconnection occurs during data transfer:data transfer:

e

• Disconnection occurs during Disconnection occurs during data transfer:data transfer:

e1

Probability : (Poisson Probability : (Poisson process)process)Transfer time : Transfer time :

Probability : (Poisson Probability : (Poisson process)process)

Transfer time :Transfer time :1

1)1(

)()(),(

ee

tftftC cc

Detailed Procedures

Page 16: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea16

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

First Step (Cont’d)First Step (Cont’d)

)(tfc

1

1)1(

lim

)()(lim

)(

0

0

ee

tftf

t

tf==

t

1

t

tftf cd

11

)(1

)(

Detailed Procedures

Page 17: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea17

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

First Step (Cont’d)First Step (Cont’d)

Detailed Procedures

tt

tfPtfPtf dc Dc

11

1

)()()(

1

1),,(

ttf

Here, we can get the expected time to transfer the data Here, we can get the expected time to transfer the data for which needs for which needs t t timetime unit of connectionunit of connection

Page 18: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea18

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Job Scheduling Algorithm in Job Scheduling Algorithm in Mobile GridMobile Grid

Second step: Determine job completion time in every mobile nodes and sort them

)()(),,( outijobinioutjobini tfttftttg Job completion time in a mobile nodeJob completion time in a mobile node

jobt depends on the number of mobile node participating depends on the number of mobile node participating in the computingin the computing

Page 19: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea19

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Job Scheduling Algorithm in Job Scheduling Algorithm in Mobile GridMobile Grid

Third step: Find the optimal number of participating nodes which makes the best job performance

When we add a node in the computing

++ : the amount of job that a single mobile node perform decreases : the amount of job that a single mobile node perform decreases

- - : we may need to wait so long for the reconnection of the newly : we may need to wait so long for the reconnection of the newly added mobile node, especially if the node remains disconnected longadded mobile node, especially if the node remains disconnected long

So, we determine whether adding a node to the computing will So, we determine whether adding a node to the computing will result in better job execution time or not, in the sorted listresult in better job execution time or not, in the sorted list

number of participating nodes

(ordered by increasing response time)

time

decreasing computingtime of jobs

increasing waiting timeof data (input, output)

job execution time(computing time+dataexchange time

optimal number of

nodes

Page 20: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea20

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Performance EvaluationsPerformance Evaluations

MethodsMathematical analysis, simulations, and implementations

Implementation MethodSimple prototype of MQS (Mobile Queuing Server)

Performs mobile node management, job distribution, monitoring, and result gathering process

Disconnection and reconnection is regarded as a normal activity

Considers the connection state of the node when executing the job

Page 21: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea21

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Performance EvaluationsPerformance Evaluations

Input Variables for assumed mobile environments

Mobile environments Max Min Max Min

Stable 0.003 0.001 0.003 0.001

Highly disconnected 0.027 0.009 0.003 0.001

Unstable 0.027 0.009 0.027 0.009

Highly connected 0.003 0.001 0.027 0.009

: disconnection rate of mobile node

: reconnection rate of mobile node

Page 22: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea22

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Performance EvaluationsPerformance Evaluations

Mobile Node Specification

(10 Machines)

CPU P4 1.6 GHzMemory 256 MB

Network Connection

802.11b wireless LAN

ApplicationBlast (Bioinformatics)

 

Workload amountNumber of protein sequences needed for

1000 seconds of processing 

Network disconnection & reconnection

Exponentially random value with respect to the and of mobile node.

Manually configured the disconnection and reconnection of mobile node

DataInput

Decomposed protein sequences(5 KB/number of nodes)

OutputInformation achieved from protein

(200 KB/number of nodes)

Experimental environments

Page 23: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea23

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Performance EvaluationsPerformance Evaluations

0

200

400

600

800

1000

1200

1400

1600

1800

2000

1 2 3 4 5 6 7 8 9 10number of mobile nodes

time

(s

ec

)highly disconnectivestableunstablehigly connective

Analytical results in four difference environmentsAnalytical results in four difference environments

Page 24: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea24

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Performance EvaluationsPerformance Evaluations

0

200

400

600

800

1000

1200

1400

1 2 3 4 5 6 7 8 9 10number of mobile nodes

time

(s

ec

)

AnalysisSimulationEx periment

Optimal at

3 nodes

Job completion time in a Job completion time in a stablestable environmentsenvironments

0

200

400

600

800

1000

1200

1400

1600

1800

1 2 3 4 5 6 7 8 9 10number of mobile nodes

time

(s

ec

)

AnalysisSimulationEx periment

Optimal at

5 nodes

Job completion time in a Job completion time in a highly highly disconnecteddisconnected environments environments

Page 25: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea25

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Performance EvaluationsPerformance Evaluations

0

100

200

300

400

500

600

700

800

900

1000

1100

1200

1 2 3 4 5 6 7 8 9 10number of mobile nodes

time

(s

ec

)

AnalysisSimulationEx periment

Optimal at

10 nodes

Job completion time in an Job completion time in an unstable unstable environmentsenvironments

0

100

200

300

400

500

600

700

800

900

1000

1100

1200

1 2 3 4 5 6 7 8 9 10number of mobile nodes

tim

e (

se

c)

AnalysisSimulationEx periment

Optimal at

10 nodes

Job completion time in a Job completion time in a highly highly connectedconnected environments environments

Page 26: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea26

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Performance EvaluationsPerformance Evaluations

What we learn from these results?

The proposed job scheduling algorithm is suitable to be adapted to mobile environments, especially when the mobile nodes tend to remain in disconnection state long

Page 27: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea27

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

ConclusionsConclusions

Discussed on the issue of integrating wireless mobile environments into Grid

Proposed gateway-based architecture where the mobile nodes play two roles: grid interfaces and resource providers

Presented several technical issues to be solved for the future Mobile Grid system

Proposed job scheduling algorithm which take accounts of the connection/disconnection rate and the execution performance of mobile node

The experimental results show that the proposed scheduling algorithm could be viable approach to deal with performance degradation of Mobile Grid system

Page 28: Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing

Ajou University, South Korea28

ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing”

Thank you!Thank you!