marin silic, goran delac and sinisa srbljic prediction of atomic web services reliability based on...

25
Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty of Electrical Engineering and Computing University of Zagreb, Croatia http://ccl.fer.h r/ http://www.fer.h r/ http://www.unizg .hr/ ESEC/FSE, Saint Petersburg, Russia, 2013.

Upload: jacob-moody

Post on 20-Jan-2016

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Marin Silic, Goran Delac and Sinisa Srbljic

Prediction of Atomic Web Services Reliability Based on K-means Clustering

Consumer Computing Laboratory

Faculty of Electrical

Engineering and Computing

University of Zagreb, Croatia

http://ccl.fer.hr/

http://www.fer.hr/

http://www.unizg.hr/

ESEC/FSE, Saint Petersburg, Russia, 2013.

Page 2: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Outline

Motivation

Reliability in SOA

State-of-the-art

CLUS Approach

Evaluation

Conclusion

ESEC/FSE, Saint Petersburg, Russia, 2013.

Page 3: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Motivation

Contemporary web applications - SOA

ESEC/FSE, Saint Petersburg, Russia, 2013.

A 2

A 1A 3

A 4

A 5

Web Application

Page 4: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

A 2

A 1

A 3

A 4

Composite service

Process of candidates selection

ESEC/FSE, Saint Petersburg, Russia, 2013.

A 2A 1

A 3A 4

A 5 A 6

Functional properties Nonfunctional properties

Ensure the desired functionality Reliability Availability…

Impact Qos & QoE

Repository

Page 5: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

A 2

A 1

A 3

A 4

A 5

A 6

Service Oriented System

“Reliability on demand” definition

ESEC/FSE, Saint Petersburg, Russia, 2013.

REQ

RES

𝑡𝑅𝐸𝑄𝑡𝑅𝐸𝑆

𝑡𝑅𝐸𝑆−𝑡𝑅𝐸𝑄<∆ 𝑡

𝑝𝑟=𝑁 𝑆

𝑁𝑇

The ratio of successful against

total number of invocations

Application

PastInvocation

Sample

Page 6: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Drawbacks/Obstacles

Client’s vs. provider’s perspective Service invocation context Depends on the quality of the sample Acquiring a sample proves to be a difficult task

ESEC/FSE, Saint Petersburg, Russia, 2013.

A 2A 1 A 3

A 4A 5 A 6

QoS QoS QoS

QoS QoS QoS

A 1

QoS1

QoS2

QoS QoS1 QoS2≠≠

Service Provider

Client

Client

Page 7: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Insight to the Solution

To overcome the drawbacks and obstacles

Collect partial, but relevant past invocation sample

Utilize prediction methods to estimate the reliability for the missing records

ESEC/FSE, Saint Petersburg, Russia, 2013.

Page 8: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

State-of-the-art

Collaborative filtering

ESEC/FSE, Saint Petersburg, Russia, 2013.

p1n?…p11 ? … p1i ??…? p22 … ? ………… … … … pun?…pu1 ? … pui ………… … … … pmn?…pm1 ? … pmi

m users

n services ??…?…?

ui matrix m,n >>matrix is extremely sparsenumber of values to predict

Page 9: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Collaborative filtering

Computes the similarity using PCC Matrix can be employed in two different ways

ESEC/FSE, Saint Petersburg, Russia, 2013.

p1n?…p11 ? … p1i ??…? p22 … ? ………… … … … pun?…pu1 ? … pui ………… … … … pmn?…pm1 ? … pmi

UPCC approach

IPCC approachHybrid approach

Page 10: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Disadvantages of Collaborative Filtering

Scalability

Having millions of users and services – these approaches do not scale

Accuracy in dynamic environments

Internet is a highly dynamic systemDo not consider environment conditions

ESEC/FSE, Saint Petersburg, Russia, 2013.

Page 11: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

CLUStering

To address scalabilityApplies the principle of aggregationReduces the redundant data by clustering users

and services using K-means

To improve the accuracy Introduces environment-specific parametersDisperses the collected data across the

additional dimension

ESEC/FSE, Saint Petersburg, Russia, 2013.

Page 12: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

CLUS Overview

ESEC/FSE, Saint Petersburg, Russia, 2013.

(1c)

(2c)

(5c)

Data Clustering Phase

r(u, s, t) p(r)

RawData

ClusteredData

Environment Clustering

UsersClustering

ServicesClustering

Creationof D

(3c)

(4c)

Prediction

Prediction Phase

Page 13: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Environment-specific Clustering

Set of environment conditions

ESEC/FSE, Saint Petersburg, Russia, 2013.

𝐸={𝑒1 ,𝑒2 ,…,𝑒𝑖 ,…,𝑒𝑛}

t0 tct1 ti-1 ti tc-1w1 w2 wi wc

e1 e2 ei …… en

… …𝑝𝑤 𝑖

=1

¿𝑊 𝑖∨¿ ∑𝑟 ∈𝑊 𝑖

𝑝𝑟 ¿

𝑝𝑤1𝑝𝑤2

𝑝𝑤 𝑖𝑝𝑤𝑐

K-meansclusteringA day

Page 14: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

User-specific Clustering

Set of users clusters

ESEC/FSE, Saint Petersburg, Russia, 2013.

𝑈={𝑢1 ,𝑢2 ,…,𝑢𝑖 ,…,𝑢𝑚}

u1 u2 ui …… um

……

𝑝𝑟={𝑝𝑒1 ,𝑝𝑒2 ,…,𝑝𝑒𝑖 ,…,𝑝𝑒𝑛 }

e1 e2 ei …… en𝑝𝑒1𝑝𝑒2

𝑝𝑒𝑖𝑝𝑒𝑛

𝑝𝑟 𝑝𝑟 𝑝𝑟𝑝𝑟

𝑝𝑟 𝑝𝑟 𝑝𝑟 𝑝𝑟 𝑝𝑟

K-meansclustering

Page 15: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Service-specific Clustering

Set of services clusters

ESEC/FSE, Saint Petersburg, Russia, 2013.

𝑆={𝑠1 ,𝑠2 ,…, 𝑠𝑖 ,…,𝑠𝑙 }

s1 s2 si …… sl

……e1 e2 ei …… en𝑝𝑒1𝑝𝑒2

𝑝𝑒𝑖𝑝𝑒𝑛 K-meansclustering

𝑝𝑟 𝑝𝑟 𝑝𝑟𝑝𝑟

𝑝𝑟 𝑝𝑟 𝑝𝑟 𝑝𝑟 𝑝𝑟

𝑝𝑟={𝑝𝑒1 ,𝑝𝑒2 ,…,𝑝𝑒𝑖 ,…,𝑝𝑒𝑛 }

Page 16: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Creation of Space D Each record, r(u, s, t), is associated to the

belonging clusters uk , sj , ei Each entry in D is computed as follows:

R contains all the records that belong to clusters uk , sj , ei

ESEC/FSE, Saint Petersburg, Russia, 2013.

𝐷 [𝑢𝑘 ,𝑠 𝑗 ,𝑒𝑖 ]=1

¿𝑅∨¿ ∑𝑟 ∈𝑅

𝑝𝑟 ¿

Page 17: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Prediction

Assuming an ongoing rc=(uc, sc, tc) First, it checks the collected sample:

If H is not empty

Otherwise,

ESEC/FSE, Saint Petersburg, Russia, 2013.

𝐻={𝑟 h∨𝑢𝑐=𝑢h⋀ 𝑠𝑐=𝑠h⋀ 𝑡𝑐 , 𝑡h∈𝑤𝑖 }

𝑝𝑐=1

¿𝐻∨¿ ∑𝑟 ∈𝐻

𝑝𝑟 ¿

𝑝𝑐=𝐷 [𝑢𝑘 , 𝑠 𝑗 ,𝑒𝑖 ] ,𝑢𝑐∈𝑢𝑘∧𝑠𝑐∈𝑠 𝑗∧𝑡𝑐∈𝑒𝑖

Page 18: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Evaluation

Comparison with the state-of-the-artUPCC IPCCHybrid

Evaluation measuresPrediction accuracy

oMAE, RMSEPrediction performance

oAggregated prediction time

ESEC/FSE, Saint Petersburg, Russia, 2013.

Page 19: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Evaluation

Experiment setupAmazon EC2 Cloud

ESEC/FSE, Saint Petersburg, Russia, 2013.

Data

Page 20: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Evaluation

Results – Impact of data densityPrediction accuracy – with load intensity

ESEC/FSE, Saint Petersburg, Russia, 2013.

Page 21: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Evaluation

Results – Impact of data densityPrediction performance – with load intensity

ESEC/FSE, Saint Petersburg, Russia, 2013.

Page 22: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Evaluation

Results – Impact of number of clustersPrediction accuracy, Data density = 20%

ESEC/FSE, Saint Petersburg, Russia, 2013.

Page 23: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Evaluation

Results – Impact of number of clustersPrediction performance, Data density = 20%

ESEC/FSE, Saint Petersburg, Russia, 2013.

Page 24: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Conclusion

Proposed a CLUS approach Improved the prediction accuracy

By introducing environment-specific parameters At least 56% lower RMSE value than the state-of-the-art

Improved the prediction performance By applying principle of aggregation Execution time reduced for two orders of magnitude when

compared to the state-of-the-art Flexibility of approach

Trade-off between accuracy and scalability Can be applied in different environments

ESEC/FSE, Saint Petersburg, Russia, 2013.

Page 25: Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty

Q&A

Thanks the audience for listening.

ESEC/FSE, Saint Petersburg, Russia, 2013.