global biodiversity information facility dr vishwas chavan senior programme officer for digit...
TRANSCRIPT
GLOBALBIODIVERSITYGLOBALBIODIVERSITYINFORMATIONFACILITYINFORMATIONFACILITY
Dr Vishwas ChavanDr Vishwas ChavanSenior Programme Officer for Senior Programme Officer for [email protected]@gbif.org WWW.GBIF.O
RGWWW.GBIF.O
RG
Towards Data Publishing Framework for primary
biodiversity data
Towards Data Publishing Framework for primary
biodiversity data
Building the Biodiversity Informatics CommonsBuilding the Biodiversity Informatics Commons
DataCite Summer Meeting7-8 June 2010, Hannover
GBIF: an intergovernmental initiative to share biodiversity information
GBIF: an intergovernmental initiative to share biodiversity information
Currently 54 countries; 44 International Organisations…
GBIF’s Mandate ”To facilitate free and open access to biodiversity data worldwide, via the Internet, to underpin scientific research, conservation and sustainable development.”
GBIF is govt-initiated, and govt. funded, in response to government agency needs in biodiversity information access and management;
GBIF is in service to science, as a global ‘public good’
Data shared online via GBIF Data shared online via GBIF
(>201 m biodiversity records mapped to a 1 X 1 degree grid)
Data Publishers: 316Data Resources: 9900
GBIF facilitates access/exchange of dataGBIF facilitates access/exchange of data
GBIF-mediated data on the ‘India’
GBIF, Global Information Infrastructure for Biodiversity
GBIF, Global Information Infrastructure for Biodiversity
Global InfrastructureTools, Standards, and ProcessesStrategies and Policy
FrameworkOutreach and Capacity Building
Global Biodiversity Research
Infrastructure
Global Biodiversity Research
Infrastructure
In summary…GBIF’s InformaticsIn summary…GBIF’s Informatics
Improved accessto Names, Metadata and Primary Biodiversity Data
Distributed GBIF informatics architecture
Faster and easier publishing of data
Primary biodiversity data and information effectively available
Data and information that have been produced but are not easy to find, access, and use (i.e not effectively available!) - a gigantic task of mobilising billions of data is still needed, as well as integrating new data.
Biological collections
Scientific publications
Observations
Reports
Gray literature
Data Bases
Geography
Improving mobilisation and Cultural ChangesImproving mobilisation and Cultural Changes
Broadening Data Types
Data Resources Discovery
Innovative Approaches to Data Mobilisation
Data Mobilisation Strategy Discussions
Data Publishing Framework
What is there for
me?
Recognition
Opportunities
Investment
Why should I publish data?Why should I publish data?
Data Publishing FrameworkData Publishing Framework
Cultural change towards ‘free and open access’ to biodiversity data
Addresses social, technical, and policy concerns
Answer ‘What is there for me?’ for ALL
Chavan and Ingwersen (2009) , BMC Bioinformatics, 10 (Suppl. 14): S2
DPF: Core Technical ComponentsDPF: Core Technical Components
Chavan and Ingwersen (2009) , BMC Bioinformatics, 10 (Suppl. 14): S2
Occurrence Data
Occurrence Data
KML file
Data Publication together with scholarly publication: ZooKeys
experience
Penev, et.al. (2009). ZooKeys, 11: 1-8.
PersistentIdentifiers
Journal System
SubmissionSubmission
AcceptanceAcceptance
RevisionRevision
Peer ReviewPeer Review
PublicationPublication
Registry
GBRDS
DoI
DistributedMetadata Catalogues
Metadata Authors
auto conversion to manuscriptauto conversion to manuscript
GBIF Metadata Repository
Current
Biology
PhytoKeys
Indian J. Mar. Sci.
Data Paper:Recognising Data Discovery
Data Citation Mechanism & ServiceData Citation Mechanism & Service
Deep data citation mechanismRecognise ALL with their rolesMultilayer citation – producer, publisher,
aggregatorCitations within citations
Data Citation ServiceResolve citation any timeDiscover the underlined data
Under development
Data Usage Index (DUI): Why?Data Usage Index (DUI): Why?
To demonstrate to data publishers that their biodiversity efforts do have impact
• To encourage …– Increase of high quality data discovery and
mobilisation– Further usage of biodiversity data and information in
scientific work– Formal citation behavior in research papers of dataset– Standardisation of dataset information
GBIF Indicators 19
Data Usage Index (DUI): What is it?Data Usage Index (DUI): What is it?
As set of indicators operating on data concerned with: Unique Visits Loyal Visits (repeated visits by same IP address) Download of datasets & dataset records Volume and (rank) distributions of dataset records
per visit, visitor, dataset provider (institution, country, region, world, theme) & period
Indicators to be normalised (by records or MB), relative (to world, theme) and weighted (according to provider profile of species/taxa/themes)
Chavan, June 2009
Data Flow type
DigitisationBottom – TopTop – Bottom
GlobalDUIs
Natl.,Regional,ThematicDUIs
Local DUIs
UN
IVE
RS
AL
D
UI
Mirror MirrorGDUI
GDUIGDUI
Aggregator AggregatorAggregator
RDUITDUI
TDUI
AggregatorAggregator
AggregatorNDUINDUI
LDUI
Publishin
g Toolkit
Publishin
g Toolkit
Publishin
g Toolkit
Publishin
g Toolkit
LDUILDUI
Implementation of DUI
Data Usage Index (DUI) implementation
Data Usage Index (DUI)
Phase I Phase IIIPhase II
Access UseManagement
Data Life Cycle
Improving the relevance of Data Usage Index
DPF: ChallengesDPF: Challenges
Chavan and Ingwersen (2009) , BMC Bioinformatics, 10 (Suppl. 14): S2
Policy & Political Uptake
Cultural & Social Acceptance
Individual Researcher
Scientific and Academic Institutions
Funding and Donor Agencies
Traditional Publishing Industry
Funding Agencies
Project
Data Creation, Collection
Analysis,Interpretation
ScholarlyPublishing
Data Management,& Archival
DataPublishing
Increased Data Usage
Knowledge Dissemination
support
results in Inspires another
results in
requires
provide feedback on gaps
and strategies fo
r
leads t
o
Metadata
facilitate
facilitate
results in
Impr
oves
dat
a qu
ality
and
fitne
ss
facil
itate
enco
urag
es
Existing cycle
Complementary Expected cycleImpact Factor
Data Usage Index
DataDiscovery
Incentivisation
through Data Paper
leads to
leads to
leads to results
in
Source: BMC Bioinformatics 2009, 10(Suppl 14):S2, doi:10.1186/1471-2105-10-S14-S2
Impact of Data Publishing FrameworkImpact of Data Publishing Framework
Primary Data and Scholarly PublicationsPrimary Data and Scholarly Publications
Seamless, embedded interconnections between data & paper
• Unconventional use of data
• Improving reliability & credibility
Email: [email protected]
Data Publishing together with Scholarly Publishing!
Data Publishing together with Scholarly Publishing!