improving long-term preservation eos data by independently mapping hdf4 data objects

18
Improving long-term preservation EOS data by independently mapping HDF4 data objects Mike Folk, Ruth Aydt, Peter Cao, Kent Yang Ruth Duerr Christopher Lynnes

Upload: vanig

Post on 24-Feb-2016

47 views

Category:

Documents


0 download

DESCRIPTION

Improving long-term preservation EOS data by independently mapping HDF4 data objects. Mike Folk, Ruth Aydt, Peter Cao, Kent Yang Ruth Duerr Christopher Lynnes. Phase 2: Productizing HDF4 Mapping schema and tools for deployment. Phase 2 tasks. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Improving long-term preservation EOS data by

independently mapping HDF4 data objects

Mike Folk, Ruth Aydt, Peter Cao, Kent YangRuth Duerr

Christopher Lynnes

Page 2: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Phase 2: Productizing HDF4 Mapping schema and

tools for deployment

Page 3: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Annual HDF Briefing to ESDIS 3

Phase 2 tasks

A. Investigate integration of mapping schema with existing standards

B. Determine HDF-EOS 2 requirementsC. Redesign the XML schemaD. Implement production quality HDF4 map writerE. Develop demo HDF4 map reader and optional

general readerF. Develop test suite and validation utility for

layouts and datatypesG. Deploy tools at other NASA data centers.

April 1 2010

Page 4: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Annual HDF Briefing to ESDIS 4

HDF4/HDF-EOS2 mapping workflow

April 1 2010

HDF4

A.Investigate existing standard

schema

C.Redesign schema

D.Implement writer

E.Implement demo reader( optional general reader)

F. Optional validation utility

G.Deploy

HDF-EOS2

B.HDF-EOS2 analysis

A BB depends on A

Page 5: Improving long-term preservation EOS data by independently mapping HDF4 data objects

April 1 2010 Annual HDF Briefing to ESDIS 5

Page 6: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Task AInvestigate integration of

mapping schema with existing standards

April 1 2010 Annual HDF Briefing to ESDIS 6

Page 7: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Task A

• Status: complete except for report• Results

• Standards investigated included METS, PREMIS, ESML and NcML

• None really meet our needs.• PREMIS is about the file but doesn’t address the contents.• MIX approach harmonizes with it.• Will try MIX approach, but it doesn’t • For contents, it will deviate from PREMIS• Elements will align with corresponding subset of PREMIS • We have content that is a stretch for PREMIS• Conclusion: We will not adopt any particular standard, but will try

to harmonize with them and leverage familiarity with other things, such as the use of the Common Data Model (CDM).

April 1 2010 Annual HDF Briefing to ESDIS 7

Page 8: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Task B: Determine HDF-EOS2 requirements

April 1 2010 Annual HDF Briefing to ESDIS 8

Page 9: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Annual HDF Briefing to ESDIS 9

Categorize HDF-EOS2 data products

• Status: Complete• Results:

• Create a data pool• Data from GES DISC,

NSIDC, LAADS, LP-DAAC and LaRC

• Detailed description of sample data

• Investigate HDF-EOS2 Swath, Grid and Point

• Summarize the requirements

• (next slide)

April 1 2010

Page 10: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Requirements for HDF-EOS2 objects

• Results (continued)• Report to listing all options for adding HDF-EOS2

contents to the mapping file• Documents and reports under the wiki page:

http://wiki.hdfgroup.org/MappingPhase2_TaskB

April 1 2010 Annual HDF Briefing to ESDIS 10

Page 11: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Task CRedesign Schema

April 1 2010 Annual HDF Briefing to ESDIS 11

Page 12: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Annual HDF Briefing to ESDIS 12

Task C: Redesign Schema

• Duration: Begin in January; Finish in June• Assigned to Ruth• Activities:

1) Gather background information and knowledge:2) Formulate overall schema design and identify corner cases3) Detailed Schema design4) Documentation

April 1 2010

Page 13: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Task DImplement Writer

April 1 2010 Annual HDF Briefing to ESDIS 13

Page 14: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Annual HDF Briefing to ESDIS 14

Task D: Implement Writer

• Duration: Began in March; Finish in Sept• Assigned to: Binh-Minh Ribler and Joe Lee• Activities:

1. Implement API to for creating maps (Binh-Minh)a) Develop new writer requirements, based on new XML

schema and additional deployment needsb) Design APIsc) Implement new featuresd) Document

2. Implement writer (Joe)a) Design writer toolb) Implement writerc) Implement test suite for writerd) Document

April 1 2010

Page 15: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Annual HDF Briefing to ESDIS 15

Status of Task D

1. Implement API to for creating mapsa) Requirements and design nearly completeb) Some implementation has been completedc) A bit behind schedule, but no threat to schedule

2. Implement writer (Joe)a) Design process is underwayb) On schedule for completion by September

April 1 2010

Page 16: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Annual HDF Briefing to ESDIS 16

Task E: Implement demo reader

• Duration: Begin in June, complete in Sept• Assigned to: Ruth Duerr• Activities:

1. Develop requirements, based on new schema and identification of additional deployment needs.

2. Design reader, based on requirements, and from a review of the prototype design.

3. Implement reader.4. Document reader.5. Test reader on EOS file “zoo”6. Deposit reader, documentation, and tests in open

source repository, probably SourceForge.April 1 2010

Page 17: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Annual HDF Briefing to ESDIS 17

Task F: Implement validation utility

• Optional task• No plans currently to perform

April 1 2010

Page 18: Improving long-term preservation EOS data by independently mapping HDF4 data objects

Annual HDF Briefing to ESDIS 18

Task G: Deploy

• Duration: Begin in Jan 2010, complete in April• Assigned to: NSDC and GES DISC• Activities:

• GES DISC • Incorporate into the existing archive ingest system• Manage the retrofit into existing metadata files

• NSIDC• Implementation effort for the V0 data• Support implementation in NSIDC’s ECS system

• Other ESDCs will be encouraged to join in the effort, but it is assumed that deployment to other centers will occur for the most part subsequently to the project.

April 1 2010