traceability, reproducibility, and scalability in integrated ecosystem assessments: july 2013 eco-op...
TRANSCRIPT
![Page 1: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/1.jpg)
Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments:
July 2013
ECO-OP is supported by NSF Grant #0955649PIs: Peter Fox (RPI) and Andrew Maffei (WHOI)
NEFSC Collaborators: Jon Hare and Mike Fogarty
Software programmer: Massimo Di StefanoInformatics and metadata: Stace Beaulieu
![Page 2: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/2.jpg)
Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments:
Adopting a provenance modelfor a collaborative report
July 2013
ECO-OP is supported by NSF Grant #0955649PIs: Peter Fox (RPI) and Andrew Maffei (WHOI)
NEFSC Collaborators: Jon Hare and Mike Fogarty
Software programmer: Massimo Di StefanoInformatics and metadata: Stace Beaulieu
![Page 3: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/3.jpg)
Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments:
Adopting a provenance modelfor a collaborative report
July 2013
![Page 4: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/4.jpg)
Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments:
Adopting a provenance modelfor a collaborative report
July 2013
Metadata for data and workflow provenance(i.e., the marine ecosystem indicators and the collaborative report)
![Page 5: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/5.jpg)
Use Case:Northeast Shelf Large Marine Ecosystem
Ecosystem Status Report
“traceability, repeatability, explanation, verification, and validation” for ecosystem data and information products in the NEFSC Ecosystem Status Report (ESR)
Goal:
![Page 6: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/6.jpg)
Page from 2009 ESR
Section on Climate Forcing
Figures available for download as PDF or image files –
but without access to data or metadata
![Page 7: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/7.jpg)
Page from 2009 ESR
Section on Climate Forcing
Figures available for download as PDF or image files –
but without access to data or metadata
Note: NOAA directive forISO 19115 metadata, butthese are not sufficient to describe time-series indicators
![Page 8: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/8.jpg)
![Page 9: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/9.jpg)
Software design to track provenance
M. Di Stefano
![Page 10: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/10.jpg)
Software design to track provenance
M. Di Stefano
![Page 11: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/11.jpg)
PROV Data Modelhttp://www.w3.org/TR/prov-dm/W3C Recommendation 30 April 2013
Core Structures (types and relations)
![Page 12: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/12.jpg)
PROV Data Modelhttp://www.w3.org/TR/prov-dm/W3C Recommendation 30 April 2013
Core Structures (types and relations)
Entity may be a single data product, or a chapter containing several data products
![Page 13: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/13.jpg)
PROV-O: The PROV Ontology (expresses PROV-DM using OWL2)http://www.w3.org/TR/prov-o/
PROV Data Modelhttp://www.w3.org/TR/prov-dm/W3C Recommendation 30 April 2013
Core Structures (types and relations)
Entity may be a single data product, or a chapter containing several data products
![Page 14: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/14.jpg)
http://ipython.org/Screenshot of IPython Notebook used to track both data and workflow provenance
![Page 15: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/15.jpg)
http://ipython.org/Screenshot of IPython Notebook used to track both data and workflow provenance
Code inPython,Matlab,R, other
![Page 16: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/16.jpg)
http://ipython.org/Screenshot of IPython Notebook used to track both data and workflow provenance
Code inPython,Matlab,R, other
![Page 17: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/17.jpg)
http://ipython.org/Screenshot of IPython Notebook used to track both data and workflow provenance
Notebook can be shared, or output as script, HTML, PDF,other
![Page 18: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/18.jpg)
PDF output of IPython Notebook with clickable links to data and code
![Page 19: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/19.jpg)
PDF output of IPython Notebook with clickable links to data and code
![Page 20: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/20.jpg)
Screenshot of csv file at GitHub
![Page 21: Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox](https://reader036.vdocuments.us/reader036/viewer/2022062322/56649edb5503460f94beb089/html5/thumbnails/21.jpg)
Screenshot of csv file at GitHub
Having access not only to the data that are plotted, but also to provenance metadata increases the (re-) usability of the data