rolling deck to repository (r2r) motivation may 4, 2009 rolling deck to repository a direct pipeline...
TRANSCRIPT
May 4, 2009
“Rolling Deck to Repository”A direct pipeline from ship to archive for
oceanographic research
Suzanne Carbotte, Robert Arko, Vicki Ferrini (LDEO)
Stephen Miller, Karen Stocks (SIO)
Cynthia Chandler, Andrew Maffei (WHOI)
Introduction
May 4, 2009
UNOLS fleet - Primary Resource for Ocean Research
UNOLS fleet - Primary Resource for Ocean Research
• Collect observational data for a broad range of science applications, e.g.– impacts of global change on ocean chemistry and ecosystems– ocean circulation– volcanism and earthquake processes at plate boundaries
• >30 yr history supporting ocean science- time series of ocean obs. and global exploration
• Great diversity of data• 22 ships in UNOLS fleet+ 5 USAP&USCG
– global/regional/coastal
• Collect observational data for a broad range of science applications, e.g.– impacts of global change on ocean chemistry and ecosystems– ocean circulation– volcanism and earthquake processes at plate boundaries
• >30 yr history supporting ocean science- time series of ocean obs. and global exploration
• Great diversity of data• 22 ships in UNOLS fleet+ 5 USAP&USCG
– global/regional/coastal
May 4, 2009
Spectrum of Data
Research expeditions involve a variety of data collection activities:
• National facility (eg NDSF, Seismic, OBSIP)
• Science party instrumentation (eg current meter, sediment traps, chemical/physical probes etc)
• Operational/Underway (eg bathymetry, MET, ADCP, subbottom etc)
May 4, 2009
Operations data streams are of high value
Operations data streams are of high value
Case studies• 1. PhysO measurements (eg flourescence,
salinity) provide crucial ground truth for satellite observations for global change
• 2. Bathymetry- many uses but coverage is sparse -all data contributes usefully to building global synthesis
• 3. MET data -verification and air-sea interactions
Case studies• 1. PhysO measurements (eg flourescence,
salinity) provide crucial ground truth for satellite observations for global change
• 2. Bathymetry- many uses but coverage is sparse -all data contributes usefully to building global synthesis
• 3. MET data -verification and air-sea interactions
May 4, 2009
Case study: Physical Oceanography
date latitude longitude temp
19950109 22.4827 61.1847 24.406
19950109 22.4832 61.1832 24.598
19950109 22.4832 61.1832 24.595
19950109 21.8330 62.4000 24.740
19950109 21.8328 62.3998 24.667
19950110 21.1830 63.5502 24.914
19950110 21.1830 63.5502 24.921
19950110 21.1827 63.5502 24.908
19950110 20.5665 64.6667 24.857
19950110 20.5673 64.6662 24.858
19950111 19.8830 65.8835 24.884
19950111 19.8833 65.8832 24.959
19950111 19.1665 67.1655 24.696
19950112 19.1427 67.1757 24.678
Satellite Calibration and Validation ~ Sea Surface TemperatureNOAA / NASA AVHRR SST in situ SST from Thomas Thompson
date lat lon temp
19950109 22.48 61.18 24.41
19950109 22.48 61.18 24.60
19950109 22.48 61.18 24.60
19950109 21.83 62.40 24.74
19950109 21.83 62.40 24.67
19950110 21.18 63.55 24.91
19950110 21.18 63.55 24.92
19950110 21.18 63.55 24.91
19950110 20.57 64.67 24.86
19950110 20.57 64.67 24.86
19950111 19.88 65.88 24.88
19950111 19.86 65.88 24.96
19950111 19.17 67.17 24.70
19950111 19.50 67.18 24.68
May 4, 2009
Case study: Ocean colorSatellite Calibration and Validation ~ Ocean color
SeaWiFS chlorophyll In situ chlorophyll (R/V Revelle)
date lon lat Chl-a
20020119 -178.8 -50.7 0.94
20020119 -177.9 -51.5 0.97
20020121 -177.4 -51.9 2.22
20020121 -175.5 -53.3 4.26
20020122 -175.0 -53.7 2.55
20020122 -173.5 -54.5 2.98
20020122 -171.6 -55.5 2.50
20020122 -171.5 -55.6 1.11
20020122 -171.4 -55.7 0.98
20020122 -171.4 -55.7 1.13
20020122 -171.2 -55.7 0.75
20020122 -171.0 -55.6 1.24
20020122 -171.1 -55.6 0.86
20020122 -171.3 -55.5 0.85
20020122 -171.4 -55.5 0.71
May 4, 2009
Case Study: Seafloor Bathymetry
Tsunami modelling
Ocean mixing/bottom currents
Maritime Navigation/Cables
Benthic habitats
May 4, 2009
Case Study: MET data and Air-Sea FluxesCase Study: MET data and Air-Sea FluxesAvailable flux products can differ by ~50Wm-2, particularly in core of the ACC -> largest source of error in closing the upper ocean heat budget for the Southern Ocean (e.g. Dong et al., 2007)
Year-round underway shipboard meteorological observations (e.g. the LM Gould Drake Passage transect at right) give opportunity to improve flux fields and resolve small-scales.
Provided by Janet Sprintall
May 4, 2009
The global oceans are vast, ocean exploration is expensive and slow, and after 50 years of academic investigation much remains unexplored
- All data of high value for preservation
The Challenge and the ObjectiveThe Challenge and the Objective
May 4, 2009
The Challenge and the ObjectiveThe Challenge and the Objective21st century research cruises are staffed by
multiple investigators and yield heterogeneous data sets
21st century research cruises are staffed by multiple investigators and yield heterogeneous data sets
VERTIGO project KM0414 ALOHA cruise sampling activities
R/V Kilo Moana (University of Hawaii Marine
Center)
May 4, 2009
The Challenge and the ObjectiveThe Challenge and the Objective
To make data publicly available to support data reuse and the published literature and enable informed policy decision making.
May 4, 2009
NSF data policies require data preservation
May 4, 2009
VISION
“Our strategic vision is a digital scientific data universe in which data creation, collection, documentation, analysis, preservation, and dissemination can be appropriately, reliably, and readily managed thereby enhancing the return on our nation’s research and development investment by ensuring that digital data realize their full potential as catalysts for progress in our global information society”
May 4, 2009
NOAA National Data Centers
NOAA National Data Centers
National Facility Ops Data Centers eg
Seismic (ASP), USAP,OBSIP (IRIS)
National Facility Ops Data Centers eg
Seismic (ASP), USAP,OBSIP (IRIS)
Program and Disciplinary DC (eg MARGINS,
BCODMO others)
Program and Disciplinary DC (eg MARGINS,
BCODMO others)
Operator/Institutional Archives
Operator/Institutional Archives
Ships and Facility Ops,
(e.g. OBSIP, NDSFSeismic)
Ships and Facility Ops,
(e.g. OBSIP, NDSFSeismic)
ScienceParty
ScienceParty
No comprehensive data management
WEAK LINKS
May 4, 2009
More Challenges
• No systematic archiving of data collected• Obtaining data for later re-use difficult to
impossible• Heterogeneous data collection across fleet. • Data documentation minimal and difficult• Little QA/QC- conducted on operator-by-
operator basis and has varied historically depending on operator priorities/capability
May 4, 2009
Migrate all operational shipboard data to long term repositories through R2R “gateway”
Assess quality, provide feedback
Rolling Deck to Repository
May 4, 2009
NOAA National DCProgram/Disciplinary
DC
NOAA National DCProgram/Disciplinary
DCShipsShips R2RR2R
R2R Gateway for Operations/Underway Data
NOAA National DCProgram/Disciplinary
DC
NOAA National DCProgram/Disciplinary
DCShipsShips Science PartyScience Party ??
May 4, 2009
R2R Guiding Principles
• NSF supported research vessels produce an enormous volume and diversity of scientific data each year. R2R proposes a fundamental transformation of data management in the fleet, providing a “direct pipeline” for the routine underway data and documentation from each cruise to flow from the operating institution to a central shoreside repository.
• Working directly with the ship scheduling offices and technicians will ensure more complete and consistent data acquisition, quality control, archival, and dissemination.
May 4, 2009
What will R2R do?
Operating institutions upload data
R2R Team:
1. Harvest data and metadata
2. Assess quality
3. Migrate standard data to national data centers and disciplinary data assembly centers (eg SAMOS, JCADCP, GMRT)
4. Provide long term R2R repository forCruise metadataNavigation“Orphan” data products without national data center home
May 4, 2009
Who could participate in R2R
UNOLS 61 member institutions
NOAA?Coast Guard?Polar Programs?
Archive in National Data CentersNOAA NGDC, NODC
May 4, 2009
Benefits to Stakeholders: Operators, Scientists, NSF
• Facilitate data preservation and dissemination (and free operators from archival duty)
• Standardize data collection and improve data documentation
• Assess data quality and provide feedback to improve data collection
• Standardization will enable integration of data sets across ships and time - syntheses and global compilations
• Facilitate cruise planning, permits, clearances
May 4, 2009
Research Fleet
•Mobile platforms •Short term obs•Global range•Ocean exploration
Data preservation is needed across infrastructure for ocean science
OOI•~Stationary platforms •Long-term obs•Regional/Local range
IODP•Sampling•Deep time