opodis'04 a protocol for recording provenance in service-oriented grids paul groth, michael...

24
OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

Upload: shanna-booker

Post on 13-Dec-2015

215 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

A protocol for recording provenance in service-oriented Grids

Paul Groth, Michael Luck, Luc Moreau

University of Southampton

Page 2: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

Or…How to show your work.

In a Grid

Page 3: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

Contents

1. What is Provenance and why you should care

2. The Grid and Provenance

3. An Architectural Vision

4. PReP

6. What’s next

7. Conclusion

Page 4: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

A Definition

Main Entry: prov·e·nance Pronunciation: 'präv-n&n(t)s, 'prä-v&-"nän(t)sFunction: nounEtymology: French, from provenir to come forth, originate, from Latin provenire, from pro- forth + venire to come Date: 17851 : ORIGIN, SOURCE2 : the history of ownership of a valued object or work of art or literature

The provenance of something is the

documentation of process that led to that something

i.e. showing your work

Page 5: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

The importance of provenance

Process is IMPORTANT

Art Wine Drug Discovery Financial Auditing Aerospace …

Page 6: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

The Grid

The Grid problem is defined as coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organisations [Foster, Kesselman, Tuecke].

Page 7: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

… and the Provenance Problem

Given a set of services in an open grid environment that decide to form a virtual organisation with the aim to produce a given result;

How can we determine the process that generated the result, especially after the virtual organisation has been disbanded?

Page 8: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

Provenance Problem cont.

Provenance recording should be part of the infrastructure

Currently, the Web Services protocol stack and the Open Grid Services Architecture do not provide any support for recording provenance.

Methods are generally adhoc and do not interoperate.

Page 9: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

More Definitions

Provenance The provenance of some data is the documentation of

process that led to the data.

Interaction Provenance For some data, interaction provenance is the

documentation of interactions between actors that led to the data.

Actor Provenance For some data, actor provenance is documentation

that can only be provided by a particular actor pertaining to the process that led to the data.

Page 10: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

An Architecture

Page 11: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

Motivating Requirements

Verifiability Accountability Reproducibility

Repeat Reproduce

Preservation Scalability Generality Customisability

Page 12: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

An Architecture with Provenance Support

Page 13: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

PReP- Provenance Recording Protocol

client serviceinvocation

result

recordinvocationand result

recordinvocationand result

negotiate

Why record 2 views?

ProvenanceStore

ProvenanceStore

ProvenanceStore

Page 14: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

Multiple Provenance Stores

client serviceinvocationresult

ProvenanceStore

invocationand result record

invocationand result record

client serviceinvocationresult

ProvenanceStoreinvocation

and result recordinvocation

and result record

client serviceinvocationresult

ProvenanceStore

invocationand result record

invocationand result record

Provenance stores may be shared or different

Page 15: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

Linking Stores

client serviceinvocationresult

ProvenanceStore

invocationand result record

invocationand result record

client serviceinvocationresult

ProvenanceStoreinvocation

and result recordinvocation

and result record

client serviceinvocationresult

ProvenanceStore

invocationand result record

invocationand result record

Link

Page 16: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

PReP in detail

Model PReP using asynchronous message passing. Maps well to any implementation Helpful for scalability

Four Phase Protocol Negotiation Invocation Provenance Recording Termination

Page 17: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

PReP’s messages

ProposeReplyInvoke ResultRecord NegotiationRecord InvocationRecord ResultSubmission FinishedAdditional Provenance

Record Negotiation AckRecord Invocation AckRecord Result AckSubmission Finished AckAdditional Provenance Ack

Page 18: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

PReP’s messages

ProposeReplyInvoke ResultRecord NegotiationRecord InvocationRecord ResultSubmission FinishedAdditional Provenance

Record Negotiation AckRecord Invocation AckRecord Result AckSubmission Finished AckAdditional Provenance Ack

Used for connecting provenance stores and for recording provenance about actors.

Page 19: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

Provenance Store – An abstract state machine

Formalise the protocol by formalising the individual entities in the protocol

Know exactly how the Provenance Service responds to receipt of messages

Use to show a liveness property Something good will eventually

happen

Page 20: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

Client and Service

State transition diagram Cannot formalise internals, only the

response to PReP Show Termination Property

Page 21: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

Page 22: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

What’s next

PReP Version 2Greater Flexibility

• In configuration and recorded infoSecurity

• non-repudiation• mutual authentication

Fault Tolerant Specification What if you have a lot of data?

Look at scalability

Page 23: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

Conclusion

Provenance is important Provenance recording must be part of

the infrastructure. Standards. PReP provides is a first step. An implementation (PReServ) is

available.

Page 24: OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04

http://www.pasoa.org

Where you can download [email protected]

The End