a services oriented architecture for water resources data

35
A Services Oriented Architecture for Water Resources Data David R. Maidment Center for Research in Water Resources University of Texas at Austin EPA Storet Conference Austin Tx, Nov 28, 2007

Upload: brian

Post on 20-Mar-2016

37 views

Category:

Documents


0 download

DESCRIPTION

A Services Oriented Architecture for Water Resources Data. David R. Maidment Center for Research in Water Resources University of Texas at Austin. EPA Storet Conference Austin Tx, Nov 28, 2007. Collaborators. University of Texas at Austin Tim Whiteaker, Stephanie Johnson - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: A Services Oriented Architecture for Water Resources Data

A Services Oriented Architecture for Water Resources Data

David R. MaidmentCenter for Research in Water Resources

University of Texas at Austin

EPA Storet ConferenceAustin Tx, Nov 28, 2007

Page 2: A Services Oriented Architecture for Water Resources Data

Collaborators

• University of Texas at Austin– Tim Whiteaker, Stephanie Johnson

• San Diego Supercomputer Center– Ilya Zaslavsky, David Valentine, Tom Whitenack

• Utah State University– David Tarboton, Jeff Horsburgh, Kim Schreuders

• Drexel University– Michael Piasecki, Bora Beran, Yoori Choi

• University of South Carolina– Jon Goodall

Page 3: A Services Oriented Architecture for Water Resources Data

A Services Oriented Architecture for Water Resources Data

• CUAHSI and WATERS• CUAHSI water web services• WaterML and WQX• Web Services for TMDL

Page 4: A Services Oriented Architecture for Water Resources Data

A Services Oriented Architecture for Water Resources Data

• CUAHSI and WATERS• CUAHSI water web services• WaterML and WQX• Web Services for TMDL

Page 5: A Services Oriented Architecture for Water Resources Data

Ocean Sciences

What is CUAHSI?

• CUAHSI – Consortium of Universities for the Advancement of Hydrologic Science, Inc

• Formed in 2001 as a legal entity

• Program office in Washington (5 staff)

• NSF supports CUAHSI to develop infrastructure and services to advance hydrologic science in US universities

Earth Sciences

AtmosphericSciences

UCAR

CUAHSI

Unidata

HISNational Science Foundation

Geosciences Directorate

Page 6: A Services Oriented Architecture for Water Resources Data

CUAHSI Member Institutions

115 US Universities as of November 2007

Page 7: A Services Oriented Architecture for Water Resources Data

SuperComputerCenters:NCSA,TACC

Domain Sciences:

Unidata, NCARLTER, CZEN

GEON

Government:USGS, EPA,

NCDC

Industry:ESRI, Kisters,

MicrosoftCUAHSI HIS

HIS Team and its Cyberinfrastructure Partners

HIS Team:Texas, SDSC,Utah, Drexel,

Duke

Page 8: A Services Oriented Architecture for Water Resources Data

Waters Network Testbed Sites

Page 9: A Services Oriented Architecture for Water Resources Data

HISTeam

WATERSTestbed

WATERS Network Information System

NSF has funded work at 11 testbed sites, each with its own science agenda. HIS supplies the

common information system

Page 10: A Services Oriented Architecture for Water Resources Data

Hydrologic Information Server Deployment

National Hydrologic Information ServerSan Diego Supercomputer Centermetadata for national datasets:

NWIS, Storet, Snotel WATERS testbed server

Provides access to 1246 sites in 16 observation networks

Page 11: A Services Oriented Architecture for Water Resources Data

Hydrologic Information Server

Microsoft SQLServer Relational Database

Observations Data Geospatial Data

GetSites

GetSiteInfo

GetVariables

GetVariableInfo

GetValues

DASH – data access system for hydrologyWaterOneFlow services

ArcGIS Server

Page 12: A Services Oriented Architecture for Water Resources Data

DefinitionThe CUAHSI Hydrologic Information System (HIS) is a geographically distributed network of data sources and functions that are integrated using a web services architecture so that they operate as a connected whole.

Page 13: A Services Oriented Architecture for Water Resources Data

A Services Oriented Architecture for Water Resources Data

• CUAHSI and WATERS• CUAHSI water web services• WaterML and WQX• Web Services for TMDL

Page 14: A Services Oriented Architecture for Water Resources Data

Point Observations Information ModelData Source

Network

Sites

Variables

Values{Value, Time, Qualifier, Offset}

Utah State Univ

Little Bear River

Little Bear River at Mendon Rd

Dissolved Oxygen

9.78 mg/L, 1 October 2007, 6PM

• A data source operates an observation network• A network is a set of observation sites• A site is a point location where one or more variables are measured• A variable is a property describing the flow or quality of water• A value is an observation of a variable at a particular time• A qualifier is a symbol that provides additional information about the value• An offset allows specification of measurements at various depths in water

http://www.cuahsi.org/his/webservices.html

GetSites

GetSiteInfo

GetVariables

GetVariableInfo

GetValues

Page 15: A Services Oriented Architecture for Water Resources Data

CUAHSI Observations Data Modelhttp://www.cuahsi.org/his/odm.html

Page 16: A Services Oriented Architecture for Water Resources Data

Locations

Variable Codes

Date Ranges

WaterML and WaterOneFlow

GetSiteInfoGetVariableInfoGetValues

WaterOneFlowWeb Service

Client

STORET

NAMNWIS

DataRepositories

Data

DataData

EXTRACTTRANSFORMLOAD

WaterML

WaterML is an XML language for communicating water dataWaterOneFlow is a set of web services based on WaterML

Page 17: A Services Oriented Architecture for Water Resources Data

WaterOneFlow• Set of query functions • Returns data in WaterML

Ilya Zaslavsky and David Valentine, SDSC

Page 18: A Services Oriented Architecture for Water Resources Data

Data Heterogeneity• Syntactic mediation

– Heterogeneity of format– Use WaterML to get data

into the same format

• Semantic mediation– Heterogeneity of meaning– Each water data source

uses its own vocabulary– Match these up with a

common controlled vocabulary

– Make standard scientific data queries and have these automatically translated into specific queries on each data source

Page 19: A Services Oriented Architecture for Water Resources Data

• Search multiple heterogeneous data sources simultaneously regardless of semantic or structural differences between them

Objective

NWIS

NARR

NAWQANAM-12

request

request

request

request

request

requestrequest

request

request

return

return

return

return

return

returnreturn

return

return

What we used to do …..

Michael PiaseckiDrexel University

Page 20: A Services Oriented Architecture for Water Resources Data

Semantic MediatorWhat we are doing now …..

NWIS

NAWQA

NARR

generic request

GetValues

GetValues

GetValues

GetValues

GetValues

GetValuesGetValues

GetValues

GetValues HODM

Michael PiaseckiDrexel University

Page 21: A Services Oriented Architecture for Water Resources Data

Hydroseekhttp://www.hydroseek.org

Supports search by location and type of data across multiple observation networks including NWIS and Storet

Bora Beran, Drexel

Page 22: A Services Oriented Architecture for Water Resources Data

HydroTaggerOntology: A hierarchy of concepts

Each Variable in your data is connected to a corresponding Concept

Page 23: A Services Oriented Architecture for Water Resources Data

HIS to Google Earthdeveloped by Peter Fitch, CSIRO, Australia

http://www.watersnet.org/wtbs/ODMKMLGatway.html

A web application housed in Canberra, Australia, that operates over the WATERS Network data services

Page 24: A Services Oriented Architecture for Water Resources Data

A Services Oriented Architecture for Water Resources Data

• CUAHSI and WATERS• CUAHSI water web services• WaterML and WQX• Web Services for TMDL

Page 25: A Services Oriented Architecture for Water Resources Data
Page 26: A Services Oriented Architecture for Water Resources Data

The biggest difference is that the ODM concept of a time series data value (orange line) is derived from two sources: Field activity, and Analysis.This design is to simplify data management

Page 27: A Services Oriented Architecture for Water Resources Data

SiteInfo Series

GetSites

GetSiteInfoSite and Series are captured in the WQX model. An activity can produce multiple results. This differs from the ODM model, where a series contains the results.

Page 28: A Services Oriented Architecture for Water Resources Data

GetValues

DataValuesMethodLabMethod

SiteInfo Series

GetValues would access all information (by design). Results captures multiple ODM concepts: DataValues, Method, and LabMethod

Page 29: A Services Oriented Architecture for Water Resources Data

WaterOneFlow WQX Comparison

WaterML WQXWebService Single WebService with

basic methods. Designed for single calls to return the needed information set.

Multiple WebServices each containing multiple methods for querying information

XML Objects/Components consistent across methods

Consistent, but users may need to make additional calls to methods to make an information set

Multiple Result No web service method Supported.

Time Series Basis of design Not supported. Multiple results are the time series.

Spatial Query No spatial query. Spatial queries by bounding box, and HUC code.

Page 30: A Services Oriented Architecture for Water Resources Data

A Services Oriented Architecture for Water Resources Data

• CUAHSI and WATERS• CUAHSI water web services• WaterML and WQX• Web Services for TMDL

Page 31: A Services Oriented Architecture for Water Resources Data

Data Access System for Hydrology (DASH)

Mission River (Water Quality Management

Segment 2004)

USGS Flow GagingStation at Refugio Texas

Web service to TCEQ

Web service to USGS

DASH map interfaceto associate gagingstation with riversegment

Page 32: A Services Oriented Architecture for Water Resources Data

Web Services Spreadsheet for Load Duration Curves

Web servicefor bacterial data

Web service for flow data

Page 33: A Services Oriented Architecture for Water Resources Data

Flow Duration Curve

cfs

Mission River at Refugio, Txderived from USGS web service

Page 34: A Services Oriented Architecture for Water Resources Data

Load Duration Curve (Fecal Coliform)Coliform Units

per day

TCEQ staff comment: “This tool produces $20,000 worth of graphs in 5 minutes”

Page 35: A Services Oriented Architecture for Water Resources Data

Conclusion: Web services work!

The CUAHSI Hydrologic Information System (HIS) is a geographically distributed network of hydrologic data sources and functions that are integrated using a web services architecture so that they function as a connected whole.

For more information: http://www.cuahsi.org/his.html