michener-institutional and subject-specific data repositories-nfdp13
DESCRIPTION
Presentation by Bill Michener asking whether Institutional and Subject-Specific Data Repositories can Co-Exist given as a 'provocation' in the final panel session at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UKTRANSCRIPT
Can Institutional and Subject-Specific Data Repositories Co-Exist?
William MichenerUniversity LibrariesUniversity of New Mexico
22 May 2013
2
The Long Tail of Orphan Data
Volu
me
Rank frequency of datatype
Well-curated/-preserved
Orphan data
(B. Heidorn)
2
CharacteristicsBig ScienceLarge VolumeAutomated sensorsWell describedWell curatedEasily Discovered
• Small Science• Small Volume• Poorly described• Rarely Indexed• Invisible to scientists• Rarely Used• Dark Data
• High spatial resolution• Process based• Theory Development• Model Development• Benchmarking
Characteristics
3
The Long Tail of Orphan DataVolu
me
Rank frequency of datatype
Subject repositories
Institutional repositories
(B. Heidorn)
3
No repositories
4
5
DataONE: Federating Data
Providing universal access to data about life on earth and the environment that sustains it
1. Building community2. Developing sustainable data discovery and interoperability solutions
3. Enabling science through tools and services
Plan
Collect
Assure
Describe
Preserve
Discover
Integrate
Analyze
6
Metadata Interoperability
KNB
LTER
ORNL DAAC InternalMetadataIndex
CDL
Coordinating Nodes
Met
adat
a Ex
trac
tion
• Virtual Portals
• Numerous search capabilities
• Metadata has link to data, which reside atMember Nodes
USGS CSAS
D-Space, I-Rods …
EML, ISO FGDC
FGDC, ISO
EML
FGDC
Dublin Core
Darwin Core
…
FGDC, ISO
Member Nodes
*Others