data conservancy and the us nsf datanet initiative 2010 jisc/cni conference july 1, 2010 sayeed...
TRANSCRIPT
![Page 1: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/1.jpg)
Data Conservancy and the US NSF DataNet Initiative
2010 JISC/CNI ConferenceJuly 1, 2010
Sayeed ChoudhuryJohns Hopkins University
![Page 2: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/2.jpg)
Difficult times…
• Sub-theme for this conference states:
• Policies, strategies, technologies and infrastructure to manage research and teaching data in a fast changing technological and economic environment
![Page 3: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/3.jpg)
Difficult times…
• Sub-theme for this conference states:
• Policies, strategies, technologies and infrastructure to manage research and teaching data in a fast changing technological and economic environment
![Page 4: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/4.jpg)
…not a rigid road map but principles of navigation. There is no one way to design cyberinfrastructure, but there are tools we can teach the designers to help them appreciate the true size of the solution space – which is often much larger than they may think, if they are tied into technical fixes for all problems.
![Page 5: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/5.jpg)
Central points
• Infrastructure development occurs because of fast changing technological and economic environments
• Yet the words we associate typically with infrastructure include reliable, persistent, ubiquitous…stable
![Page 6: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/6.jpg)
NSF DataNet
• Science and engineering research and education are increasingly digital and data-intensive
• New methods, management structures and technologies necessary
• NSF DataNet solicitation addresses challenge by creating exemplar data infrastructure organizations
![Page 7: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/7.jpg)
NSF recent actions
• Five DataNet partners funded at $20 million each for 5 years – seed funding
• Data Conservancy and DataONE are first two awards – up to three more awards in next round
• Part of broader initiatives at NSF including requirement for data management plans and (separate) Johns Hopkins grant for feasibility study of open access repository
![Page 8: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/8.jpg)
Data Curation
The Data Conservancy embraces a shared vision: data curation is a means to collect, organize, validate and preserve data so that scientists can find new ways to address the grand research challenges that face society.
![Page 9: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/9.jpg)
Goal
The goal of Data Conservancy is to support new forms of inquiry and learning that address grand research challenges. The Data Conservancy will accomplish this goal through the creation, implementation and sustained management of an integrated and comprehensive data curation strategy.
![Page 10: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/10.jpg)
PrinciplesOur strategy focuses on connection of systems into
infrastructure through a program informed by user-centered design and research, sustained through a portfolio of funding streams, and managed through a shared, coordinated governance structure.
Build on existing exemplar scientific projects, communities and virtual organizations that have deep engagement with citizen scientists and extensive experience with large-scale, distributed system development
![Page 11: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/11.jpg)
Partner institutions• Johns Hopkins University (Lead institution)• Cornell University• DuraSpace• Marine Biological Laboratory• National Center for Atmospheric Research• National Snow and Ice Data Center• Portico• Tessella, Inc.• University of California Los Angeles• University of Illinois at Urbana-Champaign
![Page 12: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/12.jpg)
Objectives
• Infrastructure research and development– Technical requirements
• Information science and computer science research– Scientific or user requirements
• Broader impacts– Educational requirements
• Sustainability– Business requirements
![Page 13: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/13.jpg)
Domain coverage/methods• Multi-site user research methods are a blend of:
– Case study & domain comparisons– Depth & breadth– Local & global
Astronomy Earth Sciences Life Sciences Social Sciences
UCAR Task-based design and usability testing Use cases, data requirements, system recommendations
UCAR
UCLA Ethnography, virtual ethnography, oral histories Use cases, data requirements
Interviews, Surveys, Worksheets, Content analysis Curation requirements, taxonomy, metadata/provenance framework
UIUC
![Page 14: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/14.jpg)
Data Framework
• Start with a common conceptualization that applies across scientific domains
• Exploit semantic technologies• Leverage existing work• Prototype the framework in target communities
– Iteratively refine, learn from experience– Demonstrate success, measured in terms of new
science
![Page 15: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/15.jpg)
Common Conceptualization
Observations are the foundation of all scientific studies, and are the closest approximation to facts.
Wiens, J. A. (1992). Cambridge studies in ecology: The ecology of bird communities. Foundations and Patterns, 1; Processes and Variations, 2
![Page 16: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/16.jpg)
Emergence
• Emergence: The Connected Lives of Ants, Brains, Cities, and Software by Steven Johnson
• The movement from low-level rules to higher-level sophistication is what we call emergence.
![Page 17: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/17.jpg)
Data Model using OAI-ORE
![Page 18: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/18.jpg)
Concerning Infrastructure
• Infrastructure is not about system building, but rather the rich, comprehensive set of human and technology interactions within the Data Conservancy
• Embrace the diversity of cultures
• Embrace the chaos before imposing order
![Page 19: Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University](https://reader036.vdocuments.us/reader036/viewer/2022070306/5516091a550346d46f8b5f0e/html5/thumbnails/19.jpg)
Acknowledgements
• Carole Palmer (information science slides)
• Carl Lagoze (Data Framework slides)
• Tim DiLauro (OAI-ORE)
Office of Cyberinfrastructure DataNet Award #0830976
Office of Cyberinfrastructure EAGER Award #0948134