strategies for open science and research data · • keystone is to establish an open data platform...

45
Strategies for Open Science and Research Data Dr Simon Hodson Executive Director, CODATA www.codata.org CODAT A CODAT A I I S S U U Conferência: Dados de investigação e ciência abierta Porto, Portugal 22 September 2016

Upload: others

Post on 21-May-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

StrategiesforOpenScienceandResearchData

DrSimonHodsonExecutive Director,CODATA

www.codata.org

CODATACODATAIISSUU

Conferência:Dadosdeinvestigaçãoe ciência abiertaPorto,Portugal

22September2016

Page 2: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

ResearchdataandOpenScience:Towardsanationalstrategy

§ Identify…§ themainstrategicareastobuildnationalandinstitutional roadmapsforRDMservices§ themain/priority requirements tobeaddressed

§ Agreathonour tobeinvitedtosharemyexperience.§ UKnationalprogramme todevelopRDMcapacityininstitutions;§ CODATAworkswithnationalmembersofopensciencestrategies(processofco-

design)

Page 3: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

CODATA:CommitteeonDataoftheInternationalCouncilforScience

§ EstablishedbytheInternationalCouncilofSciencetoaddressissuesofdataavailabilityandquality.

§ Remithasbroadenedovertheyears.

§ NewExecutiveCommittee:includesmembersfromKenyaandSouth Africa,willco-optamemberfromLatinAmerica.

§ IncreasedorientationtowardsplayingacoordinatingroleonnationalandregionalOpenSciencestrategies.

§ CODATAPresident,GeoffreyBoulton,wasleadauthorandchairofRoyalSocietyReport:ScienceasanOpenEnterprise.

§ Identifieschallengesandopportunities forsciencesystems,technicalandhuman.

§ Fundamentalmethodologicalissuesforreproducibilityandtransparency.

§ PublicationsanddatashouldbeIntelligentlyOpenandavailableconcurrently.

§ Reportwithverysignificantimpact:G8,H2020

CODATAPresidentGeoffreyBoulton,FRS

RoyalSocietyReport:ScienceasanOpen

Enterprise

Page 4: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

CODATAPrinciples,PoliciesandPractice

CapacityBuilding

FrontiersofDataScience

IDW2016,11-17Sept,Denver,CO.

Data Science Journal

Page 5: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

ResearchdataandOpenScience:Towardsanationalstrategy

§ Essentialtobeawareofinternational,nationalandinstitutionaldimensions§ Wemustaddressthehumandimensions (andweneglectthematourperil)

§ ProposedCODATAcollaborationonOpenSciencestrategyinPolandaddressesstakeholder responsibilities andenablingpractices.

§ CurrentICSU– CODATAOpenDataPlatforminitiativeinAfricaaddresses:§ Co-developmentofdatapolicies§ Incentivesandculture§ Trainingandskills§ Roadmapforresearchdatainfrastructure

Page 6: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

The Open Data Iceberg

The Technical Challenge

The Ecosystem Challenge

The Funding Challenge

The Support Challenge

The Skills Challenge

The Incentives Challenge

The Mindset Challenge

Technology

Processes &Organisation

People

motivationandethos.

Developedfrom:Deetjen,U.,E.T.MeyerandR.Schroeder(2015).OECDDigitalEconomyPapers,No.246,OECDPublishing.

Page 7: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

Whereshouldresearchdatago?

• Earthobservationdata;• Geneticdata;• Socialsciencesurveydata…

Homogenousdatacollectionsessentialforresearch

• Significantdataoutputs fromfundedprojects;

• Rawandanalysedexperimentaldata…

Significantdataoutputsof

publiclyfundedresearch

• Rawandanalysed dataforreproducibility (evidence);

• Databehind thegraph…

Dataunderpinning

researchpublications

Nationalandinternationaldata

archives

Nationalorinstitutionaldataarchives;data

papers

Dedicateddataarchives(e.g.

Dryad)

Page 8: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

TheCaseforOpenDatainaBigDataWorld

• ScienceInternationalAccordonOpenDatainaBigDataWorld:http://www.science-international.org/

• Presentsapowerfulcasethattheprofoundtransformationsmeanthatdatashouldbe:• Openbydefault• Intelligentlyopen

• Supported byfourmajorinternationalscienceorganisations.

• Laysoutaframeworkofprinciples,responsibilities andenablingpractices forhowthevisionofOpenDatainaBigDataWorldcanbeachieved.

• Campaignforendorsements:over100organisations sofar.PleaseconsiderendorsingtheAccord.

• Translations:Chinese,Russian,Polish,Spanish,French.• IUCr PositionPaperinresponse:

http://www.iucr.org/iucr/open-data

Page 9: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

AnOpenResearchDataStrategyforPoland

§ Collaborationonanationalworkshop todevelopanationalopen researchdata/opensciencestrategyforPoland.

§ CODATAleadsmetearlierthisyearwithrepresentativesfromMinistryofScienceandEducationandwithOpenScienceCentretoplanaworkshop forFeb/March2017.

§ Drawsstronglyontheapproachoftheaccord.§ StakeholdersandResponsibilities:governments/funders, universitiesandresearch

institutions, institutional libraries,nationalacademiesandlearnedsocieties,nationalandinternationalresearchanddatainfrastructures,publishers andjournaleditorialboards.

§ WorkingGroupsonEnablingPractices: boundaries ofopen,normativevalues(sharing, timeliness),non-restrictivereuseandTDM,incentives,interoperability,sustainabilityofdatainfrastructure,dataliteracy.

Page 10: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

AfricanOpenDataPlatformInitiativeICSU-CODATA

• Proposals forOpenDataPlatforminitiatives,AfricaandLatinAmericaandCaribbean.

• Holistic‘sciencesystems’approach:policies,procedures, incentives,datainfrastructure,scholarlycommunications, skillsandtraining.

• KeystoneistoestablishanOpenDataPlatformwithacoordinatingrole.

• PilotinitiativefundedbyDepartmentofScienceandTechnology inSouthAfrica:nearly500Keurosoverthreeyears.

• ImplementedbystafffromSouthAfricanAcademyofSciences,underdirection fromICSU-CODATA.

• Currentlyundertakingpreparatorystudytoidentifypartners.

Page 11: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

BuildingtheInitiative

Establish African OpenDataForum/Platform

Funded Research DataInfrastructureInitiatives

Funded,co-designed transdisciplinary researchprojects

Co-design African OpenDataPolicies

Develop Incentives Frameworks

Develop Research DataScienceTraining

African Research DataInfrastructureRoadmap

Activities requirelow funding forcoordination,secondment,

contributions inkind andevaluation.

Activities requirehigher investmentforcoordination,

co-designimplemenatationandevaluation.

Page 12: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

CODATAinKenya

§ Internationalworkshoponopendataforscienceindevelopingcountries,UNESCO,Nairobi,August2014.

§ StrongendorsementfortheworkshopfromKenyanCabinetSecretaryandfromlocaluniversitiesandresearchinstitutes.

§ CabinetSecretaryDr.FredMatiang’i:calledonCODATAandother internationalorganisations to'becomemorevisibleineducationandcapacity-building,bydevelopingscienceandeducationalprogramsandactivitiesthatfocusondataandinformation’indevelopingcountries.

§ Announced datacentretobeestablishedatJomoKenyattaUniversityofAgricultureandTechnology.

§ ‘JKUAThasnowestablishedanICTCentreofExcellenceandOpenData(iCEOD)thatwaspartoftheNairobi-CODATAconferencerecommendation’

§ WorkingwithCODATAondatamanagementpoliciesanddevelopmentofiCEOD:http://www.codata.org/membership/national-members/kenya

Page 13: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

ChallengesanddevelopmentsJKUAT/Kenya

1. Lackofnationallegal/policyframeworkforopendata:e.g.FOIAct…stillBill• JKUATenactedJORDPolicy…aspartofimplementationofCODATAstrategy• Undertakingresearchinvariousdomains• UtilizingCollaborationseg CODATA,CASetc

2. BigData/OpenDatastillanewconcept…datareuseandsharingminimal• DevelopedcoursesforPhDIT- BusinessAnalyticsandreviewed

undergraduatecourses• Supply(motivation)anddemandbalancing

3. StillbuildingandIntegratinginfrastructure• BuildingiCEOD OpenDataPlatformtosupportdatareuse,preservation,

innovation4. IPissues…notenoughlegalandinstitutionalinstrumentstoencouragemoreopen

approaches5. Culturalpractices…dataprivatebydefault6. Capacitybuilding…organizingshortcourses&newcurriculumforDataScience.

Page 14: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

Resources:CurrentBestPracticeforResearchDataManagementPolicies

§ ExpertreportcommissionedbyCODATAmember.

§ Provides comprehensive summaryofbestpracticeinfunderdatapolicies.

§ Identifieskeyelementstobeaddressed:1. Summaryofpolicydrivers

2. Intelligentopenness

3. Limitsofopenness

4. Definitionofresearchdata

5. Definedatainscope

6. Criteriafor selection

7. Summaryofresponsibilities

8. Infrastructureandcosts

9. DMPrequirements

10. Enablingdiscoveryandreuse

11. Recognitionandreward

12. Reportingrequirements,compliancemonitoring

§ Zenodo:http://dx.doi.org/10.5281/zenodo.27872

§ SeealsoRECODEReport,AnnexonPolicyDevelopment:http://recodeproject.eu/

Page 15: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

Developments:JournalDataPolicies

§ DryadJointDataArchivingPolicy,Feb2010:http://datadryad.org/jdap§ Thisjournalrequires,asaconditionforpublication,thatdatasupportingtheresultsinthepapershould

bearchivedinanappropriatepublicarchive,suchasGenBank,TreeBASE,Dryad,ortheKnowledgeNetworkforBiocomplexity.

§ PLOSDataAvailabilityPolicy,revisedFeb2014:http://www.plosone.org/static/policies.action#sharing§ PLOSjournalsrequireauthorstomakealldataunderlyingthefindingsdescribedintheirmanuscriptfully

availablewithoutrestriction,withrareexceptions.

§ Jisc worktodevelopregistryofjournaldatapolicies;BioSharing https://biosharing.org/§ LikelynewinitiativethroughRDAtoencouragedevelopmentandadoptionofjournaldatapolicies.§ CODATAworkingwithICSUtoencourageISUs toaddressdatapolicyfromdisciplinaryperspective.

Page 16: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

Barriers toDataAvailability /Publication

Researchersconcerns:§ Concernthatdatamaybemisusedormisunderstood.§ Concernthatwilllosescientificedgeifsharingbefore

fullyexploited.§ Desiretoretaincontrolofaprofessionalasset.§ Concernthatwillnotbecredited.§ Lackofcareerrewardsfordatapublication.§ SeeODEreport,usingParse.Insight findings:http://www.alliancepermanentaccess.org/wp-

content/uploads/downloads/2011/11/ODE-ReportOnIntegrationOfDataAndPublications-1_1.pdf

§ Cultureinparticularresearchdisciplines;availabilityofinfrastructure.

§ Fundamentally,researchersarereluctanttoexpendeffortsharingdatabecausetheydonotfeelthatdataisadequatelyexposedorcredited. Naturespecial issueondatasharing:

http://www.nature.com/news/specials/datasharing/index.html

Page 17: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

Piwowar andVision(2013),PeerJ DOI:10.7717/peerj.175

CitationadvantageofhavingarchivedGeneExpressionOmnibusdata

Examined10,555 studiesthatcreatedgeneexpressionmicroarraydata,comparingthosethatmadedataavailableandthosethatdidn’t.

Studies thatmadedataavailableinapublicrepository received9%morecitationsthansimilarstudiesforwhichthedatawasnotmadeavailable.Increasedcitationof30%forthosepublished 2004-5.

Page 18: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

DataPolicies:DataCitation

Out of Cite, Out of Mind

http://bit.ly/out_of_citeJointDeclarationofDataCitation

Principles:https://www.force11.org/datacitation

BackgroundandDevelopments:http://bit.ly/data_citation_principles

Task GrouponDataCitationPrinciples andPracticesIfpublicationsarethestarsand

planetsofthescientificuniverse,dataarethe‘darkmatter’–influentialbutlargelyunobservedinourmappingprocess

Page 19: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

DataCitationasaRecognisedScientific Responsibility

§ ICSU,InternationalCouncil forScience,Statementon‘Openaccesstoscientificdataandliteratureandtheassessmentofresearchbymetrics’,Sept2014http://bit.ly/icsu-OA-statement

§ EndorsestheOECDPrinciplesandGuidelinesonAccesstoDatafromPubliclyFundedResearch(2007)

§ Recommendation4:‘Sciencepublishersandchiefeditorsofscientificpublicationsshouldrequireauthorstoprovideexplicitreferencestothedatasetsunderlyingpublishedpapers,usinguniquepersistentidentifiers.Theyalsoshouldrequireclearassurancesthatthesedatasetsaredepositedandavailableintrustedandsustainabledigitalrepositories.Citingdatasetsinreferencelistsusinganacceptedstandardformatshouldbeconsideredthenorm.’

§ AccordonOpenDatainaBigDataWorld,Principleix;Paras 54-57onCitationandProvenance:‘When,inscholarlypublications,researchersusedatacreatedbyothers,thosedatashouldbecitedwithreferencetotheiroriginator,totheirprovenanceandtoapermanentdigitalidentifier.’

Page 20: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

DataCitation:FromPrinciplestoPractice

§ CODATATaskGrouponDataCitation‘DataCitation:FromPrinciplestoPractice,AFocusontheResearchPolicyandFundingCommunity’:http://www.codata.org/task-groups/data-citation-standards-and-practices

§ Organising aninternationalseriesofimplementationandadoptionworkshops.

§ Promotethe implementationofdatacitationprinciplesintheresearchpolicyandfundingcommunitiesthroughout theworld.

§ Stakeholdersinclude:government,funders,researchperforminginstitutions, researchadministrators,researchlibrarians,researchers,learnedsocieties,publishers,dataarchives,journaleditors…§ Whatisthepolicyenvironmentfordatacitation?

§ Whatarecurrentattitudestodatacitation?§ Whatinfrastructurecurrentlyexiststosupport datacitation?§ Whatspecificplansforimplementation wereidentified?

Page 21: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

WearetakingDataCitationworkshopsonaworldtour!

2015:China,Australia,Japan,India andSouth Africa.2016:USA, Israel,Russia +Finland (Nov)andTaiwan(Dec).2017:France,Korea, Indonesia,Brazil…

Synthesis Reportoffirst8workshopstobe published insoon!

Page 22: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

CODATA-RDASchoolofResearchDataScience

• Contemporary research– particularlywhenaddressing themostsignificant,transdisciplinary researchchallenges–increasinglydepends onarangeofskillsrelatingtodata.TheseskillsincludetheprinciplesandpracticeofOpenScienceandresearchdatamanagementandcuration,thedevelopmentofarangeofdataplatformsandinfrastructures,thetechniquesoflargescaleanalysis,statistics,visualisation andmodellingtechniques,softwaredevelopmentanddataannotation. Theensembleoftheseskills,relatingtodatainresearch,canusefullybecalled‘ResearchDataScience’.

Page 23: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

FoundationalResearchDataScienceCurriculum

Sevencomponents:openscience,datamanagementandcuration;softwarecarpentry;datacarpentry;datainfrastructures;statisticsandmachinelearning; visualisation.

Buildsonmuchexistingcoursestocreatesomethingmorethanthesumofitsparts:§ OpenScience– reflectiononethosandrequirementsofsharing/openness§ OpenResearchData– Basicsofdatamanagement,DMPs,RDMlife-cycle,data

publishing, metadataandannotation§ SoftwareCarpentry– Introduction toprogramming inR,theUnixshellandGit (sharing

softwareanddata)§ DataCarpentry– Introduction toSQLdatabases§ Visualisation – Tools,CriticalAnalysisofVisualisation§ Analysis– StatisticsandMachineLearning(Clustering, supervisedandunsupervised

learning)§ ComputationalInfrastructures– Introduction tocloudcomputing, launching aVirtual

MachineonanIaaS cloud

Page 24: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

CODATA-RDASchoolofResearchDataScience

§ FirstSchoolofResearchDataScience,1-12August2016,ICTP,Trieste

§ Funding forstudentsandtutorsprovidedbyICTP,TWAS,CODATA,ACU,RDAEurope,GEOandGODAN.

§ Attendedby70studentsfromallaround theworld.

Page 25: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

#DataTrieste

Page 26: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

#DataFoo…

§ Programme for#datatriestehttp://bit.ly/School_of_Research_Data_Science-Programme

§ SchoolwillrepeatatTriestein2017and2018,at least…

§ PossiblywithadditionofoneweekmoreadvancedonBigData.

§ WillrunfoundationaltwoweekcourseatICTPINESPinSaoPaolo,Brazil,December2017.

§ Schoolscanberunwithagreaterorlesserdegreeofsupportandcoordinationfromtheinternationalconvenors.

§ Keentoencourageanetworkofschools,butalsolocalschoolswithlowercentralinput.

§ DiscussionswithpossiblepartnersinSouth AfricaandIndia.§ KeentoexploreopportunitieswithCODATANationaland

UnionMembers.

Page 27: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

ResearchDataInfrastructureRoadmaps

§ DataIntensiveResearchInfrastructureSA(DIRISA)andSAResearchInfrastructureRoadmap(SARIR)identifykey issuesindevelopmentofresearchinfrastructure.§ SARIRidentifies17keyresearchinfrastructuresforSA,basedonanESFRI-typemethodology.

§ ESFRIapproachandthedevelopmentoftheERICs /researchinfrastructuresimportant.§ Whatisthedatalandscapeandecosystem?Whatisprovidedbynationalandinternational

infrastructuresandwhatbyresearchinstitutions?§ ComparableexercisewillbeperformedwithpartnersofDataScienceCapacityBuildingInitiative.

§ ImportanttoensurethatresearchinfrastructureadequatelyaddressesOpenSciencerequirements>developroadmapResearchDataInfrastructures

§ WhataretheRDIrequirementsataregionallevelandforAfricannations?§ Whatistheroleofdisciplinaryinfrastructuresandofresearchinstitutions?§ Importanceofafull-lifecycleapproach.

Page 28: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

Plan

Create

Use

AppraisePublish

Discover

Reuse

Store

Annotate

Select

DiscardDescribe

Identify HandOver?

Access

SupportingtheResearchDataLifecycle

Page 29: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

Whereshouldresearchdatago?

• Earthobservationdata;• Geneticdata;• Socialsciencesurveydata…

Homogenousdatacollectionsessentialforresearch

• Significantdataoutputs fromfundedprojects;

• Rawandanalysedexperimentaldata…

Significantdataoutputsof

publiclyfundedresearch

• Rawandanalysed dataforreproducibility (evidence);

• Databehind thegraph…

Dataunderpinning

researchpublications

Nationalandinternationaldata

archives

Nationalorinstitutionaldataarchives;data

papers

Dedicateddataarchives(e.g.

Dryad)

Page 30: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

ResearchDataInfrastructureRoadmaps

§ Researchprioritiesandgapanalysis.§ Ecosystem:whatistheprovisionofRDIs forparticular

disciplinesthroughnationalandinternationalinitiatives?

§ RoleofResearchInstitutions: Islifecyclesupportandlongtailbeingsupported ininstitutions.

§ RDIs arenotjusthardware,but ‘partofaresearchecosystem’,somustaddress:governance;training,personnelandcareerstructures’sustainablefunding;accessandoutreachtonational,publicandcommercialpartners.

§ RoadmapforDataInfrastructure§ Co-designtomeetnationalneedsandpriorities.§ Researchpriorities,opportunities forshared

infrastructures,examplesofgoodgovernanceandsustainablefundingmodels.

§ SustainableBusinessModelsforRDIs

Page 31: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

TheChallenge:SustainableBusinessModelsforDataRepositories

§ Researchfunder policies– quiterightly– mandatedatastewardship.§ OECDPrinciplesandGuidelines, 2007§ G8ScienceMinistersStatement,2013§ Majorfunders inUS,UK,ECHorizon2020datapolicyetc.

§ Increasingneedfordatarepositoriesanddatastewardship.§ Increasingvolumepresentsachallenge.§ Requirementsforstewardshippresentagreaterchallenge.

§ Sustainingdigitaldatainfrastructureisamajorissueforsciencepolicy!§ Genuineconcernthatcurrentfundingmodelswillproveinelasticandnotmeetthegrowing

requirements– concernonthepartofrepositoriesandfunders.§ Witnessing Innovation

§ Changesinfunding /businessmodels(ADS,TAIR;DANS,ICPSR)§ Innovativebusinessmodels (Dryad,FigShare)

Page 32: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

OECDGlobalScienceForumProject:SustainableBusinessModelsforDataRepositories

§ Questions toaddress:1. Howaredatarepositoriescurrentlyfunded?2. Whatinnovativeincomestreamsareavailable?3. Whatmeansofrestrainingcostsareavailable?4. Howdoincomestreamsmatchwillingness/ability topayofvariousstakeholders?5. Howdoincomestreams/willingness topayfittogetherintoasustainablebusiness

model?§ BuildsonpreviousworkofRDA-WDSInterestGroup:

http://dx.doi.org/10.5281/zenodo.46693§ Broaderlandscapesurveyofcurrentfundingmodels,May-Sept2016.§ Focusgrouponinnovativeincomestreamsandoncostrestraint,workshopNov2016.§ Microandmacroeconomicanalysisofbusinessmodels,Nov2016-Mar2017.§ Testbusinessmodelswithstakeholdergroups,workshopApril2017.§ Policyrecommendationsbasedonconcretebusinessmodeloptions, April-June2017.

Page 33: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

iCEOD ValueChain:DataandSocietyAgricultureandNutrition

Page 34: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

DataRevolution:howcanweimprove…withopendata?

§ GODAN-ODIReport:improvingagriculture,foodandnutritionwithopendata.

§ ‘Althoughtheamountofdataopenlyavailableisconstantlyincreasing,therearestillchallengesrelatedtodatamanagement,licensing, interoperabilityandexploitation.Thereisaneedtoevolvepolicies,practicesandethicsaround closed,shared,and opendata.’

§ Enablingmoreefficientandeffectivedecisionmaking >lowerscostofaccessinginformationandunderpinstoolsthatfarmersthemselvescanuse.

§ Fosteringinnovationtobenefiteveryone>anopportunitythatmustnotbemissedforcreatingnewbusinessesandjobs in‘newdata-poweredinnovationecosystems’.

§ Drivingorganisational andsectorchangethroughtransparency>opendataisessentialtounderstandingcomplexsystems,interventions,targets,change.

§ Availabilityisnotenough>essentialthatthedatabeinteroperableandmachine-readable.

§ Problemorientedandsolution-baseddatastrategies.

§ Developinfrastructureandhumancapacity.

Page 35: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

TheValueofOpenDataSharing

§ ReportbyCODATAforGEO,theGrouponEarthObservation.

§ Providesaconcise,accessible,highlevelsynthesisofkeyargumentsandevidenceofthebenefitsandvalueofopendatasharing.

§ Particular,butnotexclusive,referencetoEarthObservationdata.

§ Benefitsintheareasof:

§ EconomicBenefits§ SocialWelfareBenefits§ ResearchandInnovationOpportunities§ Education§ Governance

§ Availableathttp://dx.doi.org/10.5281/zenodo.33830§ GEODSWGisbuildingonthisworkwithfurther

examples:wouldbevaluabletoworkwiththiscommunity.

Page 36: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

Thank you foryour attention!

Credits forslides:inc.GeoffreyBoulton, JosephMuliaro WafulaCredit forphotos: Andjani Gatzweiler

SimonHodsonExecutive Director CODATA

www.codata.orghttp://lists.codata.org/mailman/listinfo/codata-international_lists.codata.org

Email:[email protected]:@simonhodson99

Tel(Office):+33145250496|Tel(Cell):+33686304259

CODATA (ICSU Committee on Data for Science and Technology), 5 rue Auguste Vacquerie, 75016 Paris, FRANCE

Page 37: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

ExtraSlide

§ XXXX§ XXXX

Page 38: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

MotivationsandDrivers

§ Rigour andreproducibility§ Researchbenefitsofdatareuse

Page 39: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

80%ofecologydatairretrievableafter20years(516studies)

VinesTH etal.(2013)Current Biology DOI:10.1016/j.cub.2013.11.014

Page 40: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

DataRevolution:AWorldthatCounts!

§ Creatingaworldthatcounts:Mobilising theDataRevolutionforSustainableDevelopment.

§ Tomeetthenewsustainablity goals‘thereisanurgentneedtomobilise thedatarevolution forallpeople andthewhole planetinordertomonitorprogress,holdgovernmentsaccountableandfostersustainable development.’

§ Withoutimmediateaction,gapsbetweendevelopedanddeveloping countries,betweeninformation-rich andinformation-poorpeople,andbetweentheprivateandpublic sectorswillwiden, andrisksofharmandabuses ofhuman rightswillgrow.

§ Dataqualityandintegrity§ Datadisaggregation(no-oneshouldbeinvisible)§ Datatimeliness§ Datatransparencyandopenness

§ Datausabilityandcuration§ Dataprotectionandprivacy§ Datagovernanceandindependence§ Dataresourcesandcapacity§ Datarights

Page 41: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

How can we improve agriculture, food and nutrition with open data? | Open Data Institute 2015 17

Use cases

Improving crop varieties with open data on breeding trials: AgTrials Cultivar testing is an important means of improving crop varieties. A wide range of trials are taking place on sites all over the world, addressing issues such as drought tolerance, heat stress, and soil management. However, almost all of the data generated OHZ�ILLU�PUHJJLZZPISL�[V�V[OLY�YLZLHYJOLYZ�¶�ÄSLK�H^H`�VU�SHIVYH[VY`�OHYK�KYP]LZ��VY�sometimes lost completely due to bad data management.

By compiling data from agronomic and plant breeding trials and making it open, the Global Agricultural Trial Repository (AgTrials)38 hosted by a CGIAR Research Programme VU�*SPTH[L�*OHUNL��(NYPJ\S[\YL�HUK�-VVK�:LJ\YP[`��**(-:���VɈLYZ�H�YPJO�RUV^SLKNL�base to inform ongoing, collaborative research, while eliminating unnecessary and JVZ[S`�K\WSPJH[PVU�VM�LɈVY[Z��

:JPLU[PZ[Z�\ZLK�����VWLU�(N;YPHSZ�KH[HZL[Z�[V�I\PSK�JYVW�TVKLSZ�ZWLJPÄJ�[V�[OL�>LZ[�Africa region. The models are used to project the local impacts of climate change, and KLÄUL�IYLLKPUN�WYVNYHTTLZ�MVY�HKHW[H[PVU�39

)YPUNPUN�HNYPJ\S[\YHS�YLZLHYJO�[V�[OL�THZZLZ!�-(6�(.90:�WVY[HS(.90:40 is an international network of research institutions and information nodes making agricultural research information globally available. It collects and disseminates bibliographic information on diverse food and agricultural publications, from over 150 KH[H�WYV]PKLYZ�PU����KPɈLYLU[�JV\U[YPLZ�

AGRIS uses bibliographic data as an aggregator for locating related content online and organises it via an open data repository (of over 8 million records). An application combines records with other open data repositories and links to other quality sources of data such as the World Bank, Nature, and the Chinese Germplasm Database.

38 AgTrials website: www.agtrials.org, accessed 03/05/15

39 CGIAR CCAFS (2015), AgTrials helps repurpose data for adaptation research, http://ccafs.cgiar.org/AgTrials, accessed 15/05/15

40 FAO AGRIS Portal: http://agris.fao.org, accessed 03/05/15

Page 42: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

BenefitsofOpenData:someexamplesfromGEO

§ BarbaraRyan,DirectorofSecretariatGEO,TED-XTalkBarcelona

§ In2008USGovernmentwasconvincedtomakeLandsat Dataopenlyavailable,forfree.

§ Undercharging,thehighestnumberofdownloadswas53scenesperday.

§ Nowover5700scenesperdayaredownloaded.

§ Spanishdeforestationresearch:underthechargingregimedataaccessalonewouldhavecost€260M

§ CODATAproducedaWhitePaperontheValueofDataSharingfortheGEO-XIIPlenary:http://dx.doi.org/10.5281/zenodo.33830

https://www.youtube.com/watch?v=9umWTFgFIVs

Page 43: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

EconomicBenefitsofDataSharing:LandSat

§ 2006StudyestimatedthelossincaseofadatagapasequivalenttoUS$935M.

§ 2011Studyestimatedbenefitsoflandsat-sourcedinformation foragricultureasUS$858MjustforthestateofIowa.

§ 2015StudyestimatedworldwideeconomicbenefitofUS$2.19BN.

§ Estimatedbenefit inUSofUS$1.8BN.§ ValuingGeospatialInformation:UsingtheContingent

ValuationMethod toEstimatetheEconomicBenefitsofLandsat SatelliteImagery:http://dx.doi.org/10.14358/PERS.81.8.647 (Paywall…Irony…)

§ Opendataandopendatainfrastructurehasasignificanteconomicbenefit.

Page 44: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science
Page 45: Strategies for Open Science and Research Data · • Keystone is to establish an Open Data Platform with a coordinating role. • Pilot initiative funded by Department of Science

CODATACODATAIISSUU

EconomicBenefitsofDataSharing

§ ‘Manystudiesandreportshavedocumentedthepositivevalueofopenness forEOdata,specifically,andforvariousothertypesofdataandinformation,moregenerally.’

§ Weiss2002:quantifiedconsiderableeconomicbenefitsofmakingmeteorologicaldataopen($400-700Mingrossreceipts;businessesandemployment).

§ Houghton2011:apartfromeconomicbenefits,grosssavingforAustralianBureauofStatisticsofAU$3.5Mbyeliminatingchargingandmanagementstructure.

§ Houghton2014:Estimateunrealised benefitsofresearchdataofAU$1.4-4.9BNsetagainstestimatedAU$130-200Mcostofdatainfrastructure.

§ Interestedtoknowwhatstudiesofthebenefitsofdataavailabilityhavebeenconductedinthisareaofresearch?