implementing marine xml for noaa observing data

17
Implementing Marine XML for NOAA Observing Data Nazila Merati and Eugene Burger NOAA/Pacific Marine Environmental Laboratory Seattle, WA

Upload: tory

Post on 19-Jan-2016

30 views

Category:

Documents


0 download

DESCRIPTION

Implementing Marine XML for NOAA Observing Data. Nazila Merati and Eugene Burger NOAA/Pacific Marine Environmental Laboratory Seattle, WA. Ocean observation systems (OOS) and data transformations. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Implementing Marine XML for NOAA Observing Data

Implementing Marine XML for NOAA Observing Data

Nazila Merati and Eugene Burger

NOAA/Pacific Marine Environmental Laboratory

Seattle, WA

Page 2: Implementing Marine XML for NOAA Observing Data

Ocean observation systems (OOS)and data transformations

• PMEL and its partners carry out several large scale ocean monitoring programs (TAO, FOCI, Tsunami, Argo) with hundreds of instruments and thousands of observations.

Page 3: Implementing Marine XML for NOAA Observing Data

Data transformations and data sharing

• Scientists spend a great majority of their time formatting data, transferring data and developing transfer protocols before starting data analysis and sharing.

Page 4: Implementing Marine XML for NOAA Observing Data

History of the project• In 2003, we went to the ESRI User Conference and heard that XML

would play an important role in the new ArcGIS architecture.• Also in 2003, NOAA IT and web developers began to discuss the

need for standards when using XML.• We found that others had started looking at XML standards and

defining XML standards for observational Marine data – XBTs and meteorological observations

• We identified a project at PMEL and NODC that we could use to test XML – Argo Profiling Floats.

Page 5: Implementing Marine XML for NOAA Observing Data

Test case– Argo Profiling Float Data• The Argo program is broad-scale

global ocean observing system of temperature and salinity

• Floats are designed to drift at a fixed pressure for a fixed period and then move to a profiling pressure. As they rise, they collect profiles of pressure, temperature and salinity on their way to the surface.

• Each float can produce up to 150 profiles during its life time

Page 6: Implementing Marine XML for NOAA Observing Data

Current Argo Profiling Float deployments

Page 7: Implementing Marine XML for NOAA Observing Data

Why use Argo Profiling Floats?

• Argo Profiling Floats were selected because they are a good example of a profiling data type, have only a few attributes associated with each profile and are integral to the ocean observing system architecture

• NODC’s data manager is interested in using XML as a method of data sharing between Argo Data Centers

• Integration of Argo Profile data into GIS will introduce the data to a new group of users that are more GIS savvy

Page 8: Implementing Marine XML for NOAA Observing Data

Project objectives• To work with NODC to identify key data sets and define

which file formats to work with • Using data specifications from example libraries, to

define optimal parameters for a MarineXML dictionary • To build schema that works the best with Floats• To test the schema, assess bandwidth versus file size

issues, and test feasibility of web transfers• To load data into ArcGIS 9.x and personal geodatabase • To test the metadata creator in ArcGIS for compatibility

with MarineXML schema

Page 9: Implementing Marine XML for NOAA Observing Data

Why use XML?

• Platform independent tool that makes data exchange and communication between organizations easier – its not just for science

• Extensible

• Data management - allows preservation of data, metadata, quality flags and edited data

• It can serve as the “basis of a data management framework”

Page 10: Implementing Marine XML for NOAA Observing Data

Different types of XML• Marine XML – different flavors – strong community component, been in

use since 2001• ESML – Earth Sciences Markup Language – works with netCDF, HDF,

good community backing• Borehole XML - good as it has a z-value, may be better as a descriptor of

actual location rather than data – boreholes are also more shallow that the float data, not quite sure what borehole data are stored as, mostly text files

• NcML – good open source standard, data already are stored as netCDF, it’s the marine standard for data storage, most translators are for gridded data

• Sensor ML – still written within the OGC specs, again more for descriptor, but not necessarily for data transfer

Page 11: Implementing Marine XML for NOAA Observing Data

How does this fit in with GIS?• Currently, PMEL has

ways of getting data into GIS that are great, but this still requires data formatting and the use of several different packages.

• The output is a shapefile to be used by ESRI products.

Page 12: Implementing Marine XML for NOAA Observing Data

ESRI and the geodatabase• Changes are happening in

the way GIS data are managed. We are moving from shapefiles and coverages to geodatabases for data management and data storage.

• Geodatabases can be useful if you are building data models for a specific industry.

Page 13: Implementing Marine XML for NOAA Observing Data

Geodatabases and data transformations• ArcGIS 9 has the ability to

take XML encapsulated data and transfer the data into the geodatabase.

• The Geodatabase XML format will allow you to import and export items and data to/from the geodatabase. Transfers include domains, rules and topologies and adding behaviors

Page 14: Implementing Marine XML for NOAA Observing Data

Proposed solution

Page 15: Implementing Marine XML for NOAA Observing Data

Potential problems

• netCDF to ncML or to MarineXML – no need to do a double jump

• Bandwidth and compression, XML may supersize data

• What if the geodatabase goes away?• Is MarineXML better than ESML?• Can it be applied to legacy data?

Page 16: Implementing Marine XML for NOAA Observing Data

Next steps

• Look at MarineXML specifications and meet with AODC folks next week. Identify key data sets to test and talk to ESRI about netCDF translators.

• Determine if the existing EPIC in-situ XML DTD and schemas can be modified to work with Argo Profiling Floats and can handle data.

• Test the new XSLT tools from ESRI and test transfer times – is this just as clunky as before?

Page 17: Implementing Marine XML for NOAA Observing Data

Questions and suggestions?