strategies for open science and research data · • keystone is to establish an open data platform...
TRANSCRIPT
StrategiesforOpenScienceandResearchData
DrSimonHodsonExecutive Director,CODATA
www.codata.org
CODATACODATAIISSUU
Conferência:Dadosdeinvestigaçãoe ciência abiertaPorto,Portugal
22September2016
CODATACODATAIISSUU
ResearchdataandOpenScience:Towardsanationalstrategy
§ Identify…§ themainstrategicareastobuildnationalandinstitutional roadmapsforRDMservices§ themain/priority requirements tobeaddressed
§ Agreathonour tobeinvitedtosharemyexperience.§ UKnationalprogramme todevelopRDMcapacityininstitutions;§ CODATAworkswithnationalmembersofopensciencestrategies(processofco-
design)
CODATACODATAIISSUU
CODATA:CommitteeonDataoftheInternationalCouncilforScience
§ EstablishedbytheInternationalCouncilofSciencetoaddressissuesofdataavailabilityandquality.
§ Remithasbroadenedovertheyears.
§ NewExecutiveCommittee:includesmembersfromKenyaandSouth Africa,willco-optamemberfromLatinAmerica.
§ IncreasedorientationtowardsplayingacoordinatingroleonnationalandregionalOpenSciencestrategies.
§ CODATAPresident,GeoffreyBoulton,wasleadauthorandchairofRoyalSocietyReport:ScienceasanOpenEnterprise.
§ Identifieschallengesandopportunities forsciencesystems,technicalandhuman.
§ Fundamentalmethodologicalissuesforreproducibilityandtransparency.
§ PublicationsanddatashouldbeIntelligentlyOpenandavailableconcurrently.
§ Reportwithverysignificantimpact:G8,H2020
CODATAPresidentGeoffreyBoulton,FRS
RoyalSocietyReport:ScienceasanOpen
Enterprise
CODATACODATAIISSUU
CODATAPrinciples,PoliciesandPractice
CapacityBuilding
FrontiersofDataScience
IDW2016,11-17Sept,Denver,CO.
Data Science Journal
CODATACODATAIISSUU
ResearchdataandOpenScience:Towardsanationalstrategy
§ Essentialtobeawareofinternational,nationalandinstitutionaldimensions§ Wemustaddressthehumandimensions (andweneglectthematourperil)
§ ProposedCODATAcollaborationonOpenSciencestrategyinPolandaddressesstakeholder responsibilities andenablingpractices.
§ CurrentICSU– CODATAOpenDataPlatforminitiativeinAfricaaddresses:§ Co-developmentofdatapolicies§ Incentivesandculture§ Trainingandskills§ Roadmapforresearchdatainfrastructure
The Open Data Iceberg
The Technical Challenge
The Ecosystem Challenge
The Funding Challenge
The Support Challenge
The Skills Challenge
The Incentives Challenge
The Mindset Challenge
Technology
Processes &Organisation
People
motivationandethos.
Developedfrom:Deetjen,U.,E.T.MeyerandR.Schroeder(2015).OECDDigitalEconomyPapers,No.246,OECDPublishing.
CODATACODATAIISSUU
Whereshouldresearchdatago?
• Earthobservationdata;• Geneticdata;• Socialsciencesurveydata…
Homogenousdatacollectionsessentialforresearch
• Significantdataoutputs fromfundedprojects;
• Rawandanalysedexperimentaldata…
Significantdataoutputsof
publiclyfundedresearch
• Rawandanalysed dataforreproducibility (evidence);
• Databehind thegraph…
Dataunderpinning
researchpublications
Nationalandinternationaldata
archives
Nationalorinstitutionaldataarchives;data
papers
Dedicateddataarchives(e.g.
Dryad)
CODATACODATAIISSUU
TheCaseforOpenDatainaBigDataWorld
• ScienceInternationalAccordonOpenDatainaBigDataWorld:http://www.science-international.org/
• Presentsapowerfulcasethattheprofoundtransformationsmeanthatdatashouldbe:• Openbydefault• Intelligentlyopen
• Supported byfourmajorinternationalscienceorganisations.
• Laysoutaframeworkofprinciples,responsibilities andenablingpractices forhowthevisionofOpenDatainaBigDataWorldcanbeachieved.
• Campaignforendorsements:over100organisations sofar.PleaseconsiderendorsingtheAccord.
• Translations:Chinese,Russian,Polish,Spanish,French.• IUCr PositionPaperinresponse:
http://www.iucr.org/iucr/open-data
CODATACODATAIISSUU
AnOpenResearchDataStrategyforPoland
§ Collaborationonanationalworkshop todevelopanationalopen researchdata/opensciencestrategyforPoland.
§ CODATAleadsmetearlierthisyearwithrepresentativesfromMinistryofScienceandEducationandwithOpenScienceCentretoplanaworkshop forFeb/March2017.
§ Drawsstronglyontheapproachoftheaccord.§ StakeholdersandResponsibilities:governments/funders, universitiesandresearch
institutions, institutional libraries,nationalacademiesandlearnedsocieties,nationalandinternationalresearchanddatainfrastructures,publishers andjournaleditorialboards.
§ WorkingGroupsonEnablingPractices: boundaries ofopen,normativevalues(sharing, timeliness),non-restrictivereuseandTDM,incentives,interoperability,sustainabilityofdatainfrastructure,dataliteracy.
CODATACODATAIISSUU
AfricanOpenDataPlatformInitiativeICSU-CODATA
• Proposals forOpenDataPlatforminitiatives,AfricaandLatinAmericaandCaribbean.
• Holistic‘sciencesystems’approach:policies,procedures, incentives,datainfrastructure,scholarlycommunications, skillsandtraining.
• KeystoneistoestablishanOpenDataPlatformwithacoordinatingrole.
• PilotinitiativefundedbyDepartmentofScienceandTechnology inSouthAfrica:nearly500Keurosoverthreeyears.
• ImplementedbystafffromSouthAfricanAcademyofSciences,underdirection fromICSU-CODATA.
• Currentlyundertakingpreparatorystudytoidentifypartners.
CODATACODATAIISSUU
BuildingtheInitiative
Establish African OpenDataForum/Platform
Funded Research DataInfrastructureInitiatives
Funded,co-designed transdisciplinary researchprojects
Co-design African OpenDataPolicies
Develop Incentives Frameworks
Develop Research DataScienceTraining
African Research DataInfrastructureRoadmap
Activities requirelow funding forcoordination,secondment,
contributions inkind andevaluation.
Activities requirehigher investmentforcoordination,
co-designimplemenatationandevaluation.
CODATACODATAIISSUU
CODATAinKenya
§ Internationalworkshoponopendataforscienceindevelopingcountries,UNESCO,Nairobi,August2014.
§ StrongendorsementfortheworkshopfromKenyanCabinetSecretaryandfromlocaluniversitiesandresearchinstitutes.
§ CabinetSecretaryDr.FredMatiang’i:calledonCODATAandother internationalorganisations to'becomemorevisibleineducationandcapacity-building,bydevelopingscienceandeducationalprogramsandactivitiesthatfocusondataandinformation’indevelopingcountries.
§ Announced datacentretobeestablishedatJomoKenyattaUniversityofAgricultureandTechnology.
§ ‘JKUAThasnowestablishedanICTCentreofExcellenceandOpenData(iCEOD)thatwaspartoftheNairobi-CODATAconferencerecommendation’
§ WorkingwithCODATAondatamanagementpoliciesanddevelopmentofiCEOD:http://www.codata.org/membership/national-members/kenya
CODATACODATAIISSUU
ChallengesanddevelopmentsJKUAT/Kenya
1. Lackofnationallegal/policyframeworkforopendata:e.g.FOIAct…stillBill• JKUATenactedJORDPolicy…aspartofimplementationofCODATAstrategy• Undertakingresearchinvariousdomains• UtilizingCollaborationseg CODATA,CASetc
2. BigData/OpenDatastillanewconcept…datareuseandsharingminimal• DevelopedcoursesforPhDIT- BusinessAnalyticsandreviewed
undergraduatecourses• Supply(motivation)anddemandbalancing
3. StillbuildingandIntegratinginfrastructure• BuildingiCEOD OpenDataPlatformtosupportdatareuse,preservation,
innovation4. IPissues…notenoughlegalandinstitutionalinstrumentstoencouragemoreopen
approaches5. Culturalpractices…dataprivatebydefault6. Capacitybuilding…organizingshortcourses&newcurriculumforDataScience.
CODATACODATAIISSUU
Resources:CurrentBestPracticeforResearchDataManagementPolicies
§ ExpertreportcommissionedbyCODATAmember.
§ Provides comprehensive summaryofbestpracticeinfunderdatapolicies.
§ Identifieskeyelementstobeaddressed:1. Summaryofpolicydrivers
2. Intelligentopenness
3. Limitsofopenness
4. Definitionofresearchdata
5. Definedatainscope
6. Criteriafor selection
7. Summaryofresponsibilities
8. Infrastructureandcosts
9. DMPrequirements
10. Enablingdiscoveryandreuse
11. Recognitionandreward
12. Reportingrequirements,compliancemonitoring
§ Zenodo:http://dx.doi.org/10.5281/zenodo.27872
§ SeealsoRECODEReport,AnnexonPolicyDevelopment:http://recodeproject.eu/
CODATACODATAIISSUU
Developments:JournalDataPolicies
§ DryadJointDataArchivingPolicy,Feb2010:http://datadryad.org/jdap§ Thisjournalrequires,asaconditionforpublication,thatdatasupportingtheresultsinthepapershould
bearchivedinanappropriatepublicarchive,suchasGenBank,TreeBASE,Dryad,ortheKnowledgeNetworkforBiocomplexity.
§ PLOSDataAvailabilityPolicy,revisedFeb2014:http://www.plosone.org/static/policies.action#sharing§ PLOSjournalsrequireauthorstomakealldataunderlyingthefindingsdescribedintheirmanuscriptfully
availablewithoutrestriction,withrareexceptions.
§ Jisc worktodevelopregistryofjournaldatapolicies;BioSharing https://biosharing.org/§ LikelynewinitiativethroughRDAtoencouragedevelopmentandadoptionofjournaldatapolicies.§ CODATAworkingwithICSUtoencourageISUs toaddressdatapolicyfromdisciplinaryperspective.
CODATACODATAIISSUU
Barriers toDataAvailability /Publication
Researchersconcerns:§ Concernthatdatamaybemisusedormisunderstood.§ Concernthatwilllosescientificedgeifsharingbefore
fullyexploited.§ Desiretoretaincontrolofaprofessionalasset.§ Concernthatwillnotbecredited.§ Lackofcareerrewardsfordatapublication.§ SeeODEreport,usingParse.Insight findings:http://www.alliancepermanentaccess.org/wp-
content/uploads/downloads/2011/11/ODE-ReportOnIntegrationOfDataAndPublications-1_1.pdf
§ Cultureinparticularresearchdisciplines;availabilityofinfrastructure.
§ Fundamentally,researchersarereluctanttoexpendeffortsharingdatabecausetheydonotfeelthatdataisadequatelyexposedorcredited. Naturespecial issueondatasharing:
http://www.nature.com/news/specials/datasharing/index.html
Piwowar andVision(2013),PeerJ DOI:10.7717/peerj.175
CitationadvantageofhavingarchivedGeneExpressionOmnibusdata
Examined10,555 studiesthatcreatedgeneexpressionmicroarraydata,comparingthosethatmadedataavailableandthosethatdidn’t.
Studies thatmadedataavailableinapublicrepository received9%morecitationsthansimilarstudiesforwhichthedatawasnotmadeavailable.Increasedcitationof30%forthosepublished 2004-5.
CODATACODATAIISSUU
DataPolicies:DataCitation
Out of Cite, Out of Mind
http://bit.ly/out_of_citeJointDeclarationofDataCitation
Principles:https://www.force11.org/datacitation
BackgroundandDevelopments:http://bit.ly/data_citation_principles
Task GrouponDataCitationPrinciples andPracticesIfpublicationsarethestarsand
planetsofthescientificuniverse,dataarethe‘darkmatter’–influentialbutlargelyunobservedinourmappingprocess
CODATACODATAIISSUU
DataCitationasaRecognisedScientific Responsibility
§ ICSU,InternationalCouncil forScience,Statementon‘Openaccesstoscientificdataandliteratureandtheassessmentofresearchbymetrics’,Sept2014http://bit.ly/icsu-OA-statement
§ EndorsestheOECDPrinciplesandGuidelinesonAccesstoDatafromPubliclyFundedResearch(2007)
§ Recommendation4:‘Sciencepublishersandchiefeditorsofscientificpublicationsshouldrequireauthorstoprovideexplicitreferencestothedatasetsunderlyingpublishedpapers,usinguniquepersistentidentifiers.Theyalsoshouldrequireclearassurancesthatthesedatasetsaredepositedandavailableintrustedandsustainabledigitalrepositories.Citingdatasetsinreferencelistsusinganacceptedstandardformatshouldbeconsideredthenorm.’
§ AccordonOpenDatainaBigDataWorld,Principleix;Paras 54-57onCitationandProvenance:‘When,inscholarlypublications,researchersusedatacreatedbyothers,thosedatashouldbecitedwithreferencetotheiroriginator,totheirprovenanceandtoapermanentdigitalidentifier.’
CODATACODATAIISSUU
DataCitation:FromPrinciplestoPractice
§ CODATATaskGrouponDataCitation‘DataCitation:FromPrinciplestoPractice,AFocusontheResearchPolicyandFundingCommunity’:http://www.codata.org/task-groups/data-citation-standards-and-practices
§ Organising aninternationalseriesofimplementationandadoptionworkshops.
§ Promotethe implementationofdatacitationprinciplesintheresearchpolicyandfundingcommunitiesthroughout theworld.
§ Stakeholdersinclude:government,funders,researchperforminginstitutions, researchadministrators,researchlibrarians,researchers,learnedsocieties,publishers,dataarchives,journaleditors…§ Whatisthepolicyenvironmentfordatacitation?
§ Whatarecurrentattitudestodatacitation?§ Whatinfrastructurecurrentlyexiststosupport datacitation?§ Whatspecificplansforimplementation wereidentified?
WearetakingDataCitationworkshopsonaworldtour!
2015:China,Australia,Japan,India andSouth Africa.2016:USA, Israel,Russia +Finland (Nov)andTaiwan(Dec).2017:France,Korea, Indonesia,Brazil…
Synthesis Reportoffirst8workshopstobe published insoon!
CODATACODATAIISSUU
CODATA-RDASchoolofResearchDataScience
• Contemporary research– particularlywhenaddressing themostsignificant,transdisciplinary researchchallenges–increasinglydepends onarangeofskillsrelatingtodata.TheseskillsincludetheprinciplesandpracticeofOpenScienceandresearchdatamanagementandcuration,thedevelopmentofarangeofdataplatformsandinfrastructures,thetechniquesoflargescaleanalysis,statistics,visualisation andmodellingtechniques,softwaredevelopmentanddataannotation. Theensembleoftheseskills,relatingtodatainresearch,canusefullybecalled‘ResearchDataScience’.
CODATACODATAIISSUU
FoundationalResearchDataScienceCurriculum
Sevencomponents:openscience,datamanagementandcuration;softwarecarpentry;datacarpentry;datainfrastructures;statisticsandmachinelearning; visualisation.
Buildsonmuchexistingcoursestocreatesomethingmorethanthesumofitsparts:§ OpenScience– reflectiononethosandrequirementsofsharing/openness§ OpenResearchData– Basicsofdatamanagement,DMPs,RDMlife-cycle,data
publishing, metadataandannotation§ SoftwareCarpentry– Introduction toprogramming inR,theUnixshellandGit (sharing
softwareanddata)§ DataCarpentry– Introduction toSQLdatabases§ Visualisation – Tools,CriticalAnalysisofVisualisation§ Analysis– StatisticsandMachineLearning(Clustering, supervisedandunsupervised
learning)§ ComputationalInfrastructures– Introduction tocloudcomputing, launching aVirtual
MachineonanIaaS cloud
CODATACODATAIISSUU
CODATA-RDASchoolofResearchDataScience
§ FirstSchoolofResearchDataScience,1-12August2016,ICTP,Trieste
§ Funding forstudentsandtutorsprovidedbyICTP,TWAS,CODATA,ACU,RDAEurope,GEOandGODAN.
§ Attendedby70studentsfromallaround theworld.
#DataTrieste
CODATACODATAIISSUU
#DataFoo…
§ Programme for#datatriestehttp://bit.ly/School_of_Research_Data_Science-Programme
§ SchoolwillrepeatatTriestein2017and2018,at least…
§ PossiblywithadditionofoneweekmoreadvancedonBigData.
§ WillrunfoundationaltwoweekcourseatICTPINESPinSaoPaolo,Brazil,December2017.
§ Schoolscanberunwithagreaterorlesserdegreeofsupportandcoordinationfromtheinternationalconvenors.
§ Keentoencourageanetworkofschools,butalsolocalschoolswithlowercentralinput.
§ DiscussionswithpossiblepartnersinSouth AfricaandIndia.§ KeentoexploreopportunitieswithCODATANationaland
UnionMembers.
CODATACODATAIISSUU
ResearchDataInfrastructureRoadmaps
§ DataIntensiveResearchInfrastructureSA(DIRISA)andSAResearchInfrastructureRoadmap(SARIR)identifykey issuesindevelopmentofresearchinfrastructure.§ SARIRidentifies17keyresearchinfrastructuresforSA,basedonanESFRI-typemethodology.
§ ESFRIapproachandthedevelopmentoftheERICs /researchinfrastructuresimportant.§ Whatisthedatalandscapeandecosystem?Whatisprovidedbynationalandinternational
infrastructuresandwhatbyresearchinstitutions?§ ComparableexercisewillbeperformedwithpartnersofDataScienceCapacityBuildingInitiative.
§ ImportanttoensurethatresearchinfrastructureadequatelyaddressesOpenSciencerequirements>developroadmapResearchDataInfrastructures
§ WhataretheRDIrequirementsataregionallevelandforAfricannations?§ Whatistheroleofdisciplinaryinfrastructuresandofresearchinstitutions?§ Importanceofafull-lifecycleapproach.
Plan
Create
Use
AppraisePublish
Discover
Reuse
Store
Annotate
Select
DiscardDescribe
Identify HandOver?
Access
SupportingtheResearchDataLifecycle
CODATACODATAIISSUU
Whereshouldresearchdatago?
• Earthobservationdata;• Geneticdata;• Socialsciencesurveydata…
Homogenousdatacollectionsessentialforresearch
• Significantdataoutputs fromfundedprojects;
• Rawandanalysedexperimentaldata…
Significantdataoutputsof
publiclyfundedresearch
• Rawandanalysed dataforreproducibility (evidence);
• Databehind thegraph…
Dataunderpinning
researchpublications
Nationalandinternationaldata
archives
Nationalorinstitutionaldataarchives;data
papers
Dedicateddataarchives(e.g.
Dryad)
CODATACODATAIISSUU
ResearchDataInfrastructureRoadmaps
§ Researchprioritiesandgapanalysis.§ Ecosystem:whatistheprovisionofRDIs forparticular
disciplinesthroughnationalandinternationalinitiatives?
§ RoleofResearchInstitutions: Islifecyclesupportandlongtailbeingsupported ininstitutions.
§ RDIs arenotjusthardware,but ‘partofaresearchecosystem’,somustaddress:governance;training,personnelandcareerstructures’sustainablefunding;accessandoutreachtonational,publicandcommercialpartners.
§ RoadmapforDataInfrastructure§ Co-designtomeetnationalneedsandpriorities.§ Researchpriorities,opportunities forshared
infrastructures,examplesofgoodgovernanceandsustainablefundingmodels.
§ SustainableBusinessModelsforRDIs
CODATACODATAIISSUU
TheChallenge:SustainableBusinessModelsforDataRepositories
§ Researchfunder policies– quiterightly– mandatedatastewardship.§ OECDPrinciplesandGuidelines, 2007§ G8ScienceMinistersStatement,2013§ Majorfunders inUS,UK,ECHorizon2020datapolicyetc.
§ Increasingneedfordatarepositoriesanddatastewardship.§ Increasingvolumepresentsachallenge.§ Requirementsforstewardshippresentagreaterchallenge.
§ Sustainingdigitaldatainfrastructureisamajorissueforsciencepolicy!§ Genuineconcernthatcurrentfundingmodelswillproveinelasticandnotmeetthegrowing
requirements– concernonthepartofrepositoriesandfunders.§ Witnessing Innovation
§ Changesinfunding /businessmodels(ADS,TAIR;DANS,ICPSR)§ Innovativebusinessmodels (Dryad,FigShare)
CODATACODATAIISSUU
OECDGlobalScienceForumProject:SustainableBusinessModelsforDataRepositories
§ Questions toaddress:1. Howaredatarepositoriescurrentlyfunded?2. Whatinnovativeincomestreamsareavailable?3. Whatmeansofrestrainingcostsareavailable?4. Howdoincomestreamsmatchwillingness/ability topayofvariousstakeholders?5. Howdoincomestreams/willingness topayfittogetherintoasustainablebusiness
model?§ BuildsonpreviousworkofRDA-WDSInterestGroup:
http://dx.doi.org/10.5281/zenodo.46693§ Broaderlandscapesurveyofcurrentfundingmodels,May-Sept2016.§ Focusgrouponinnovativeincomestreamsandoncostrestraint,workshopNov2016.§ Microandmacroeconomicanalysisofbusinessmodels,Nov2016-Mar2017.§ Testbusinessmodelswithstakeholdergroups,workshopApril2017.§ Policyrecommendationsbasedonconcretebusinessmodeloptions, April-June2017.
CODATACODATAIISSUU
iCEOD ValueChain:DataandSocietyAgricultureandNutrition
CODATACODATAIISSUU
DataRevolution:howcanweimprove…withopendata?
§ GODAN-ODIReport:improvingagriculture,foodandnutritionwithopendata.
§ ‘Althoughtheamountofdataopenlyavailableisconstantlyincreasing,therearestillchallengesrelatedtodatamanagement,licensing, interoperabilityandexploitation.Thereisaneedtoevolvepolicies,practicesandethicsaround closed,shared,and opendata.’
§ Enablingmoreefficientandeffectivedecisionmaking >lowerscostofaccessinginformationandunderpinstoolsthatfarmersthemselvescanuse.
§ Fosteringinnovationtobenefiteveryone>anopportunitythatmustnotbemissedforcreatingnewbusinessesandjobs in‘newdata-poweredinnovationecosystems’.
§ Drivingorganisational andsectorchangethroughtransparency>opendataisessentialtounderstandingcomplexsystems,interventions,targets,change.
§ Availabilityisnotenough>essentialthatthedatabeinteroperableandmachine-readable.
§ Problemorientedandsolution-baseddatastrategies.
§ Developinfrastructureandhumancapacity.
CODATACODATAIISSUU
TheValueofOpenDataSharing
§ ReportbyCODATAforGEO,theGrouponEarthObservation.
§ Providesaconcise,accessible,highlevelsynthesisofkeyargumentsandevidenceofthebenefitsandvalueofopendatasharing.
§ Particular,butnotexclusive,referencetoEarthObservationdata.
§ Benefitsintheareasof:
§ EconomicBenefits§ SocialWelfareBenefits§ ResearchandInnovationOpportunities§ Education§ Governance
§ Availableathttp://dx.doi.org/10.5281/zenodo.33830§ GEODSWGisbuildingonthisworkwithfurther
examples:wouldbevaluabletoworkwiththiscommunity.
CODATACODATAIISSUU
Thank you foryour attention!
Credits forslides:inc.GeoffreyBoulton, JosephMuliaro WafulaCredit forphotos: Andjani Gatzweiler
SimonHodsonExecutive Director CODATA
www.codata.orghttp://lists.codata.org/mailman/listinfo/codata-international_lists.codata.org
Email:[email protected]:@simonhodson99
Tel(Office):+33145250496|Tel(Cell):+33686304259
CODATA (ICSU Committee on Data for Science and Technology), 5 rue Auguste Vacquerie, 75016 Paris, FRANCE
CODATACODATAIISSUU
ExtraSlide
§ XXXX§ XXXX
CODATACODATAIISSUU
MotivationsandDrivers
§ Rigour andreproducibility§ Researchbenefitsofdatareuse
80%ofecologydatairretrievableafter20years(516studies)
VinesTH etal.(2013)Current Biology DOI:10.1016/j.cub.2013.11.014
CODATACODATAIISSUU
DataRevolution:AWorldthatCounts!
§ Creatingaworldthatcounts:Mobilising theDataRevolutionforSustainableDevelopment.
§ Tomeetthenewsustainablity goals‘thereisanurgentneedtomobilise thedatarevolution forallpeople andthewhole planetinordertomonitorprogress,holdgovernmentsaccountableandfostersustainable development.’
§ Withoutimmediateaction,gapsbetweendevelopedanddeveloping countries,betweeninformation-rich andinformation-poorpeople,andbetweentheprivateandpublic sectorswillwiden, andrisksofharmandabuses ofhuman rightswillgrow.
§ Dataqualityandintegrity§ Datadisaggregation(no-oneshouldbeinvisible)§ Datatimeliness§ Datatransparencyandopenness
§ Datausabilityandcuration§ Dataprotectionandprivacy§ Datagovernanceandindependence§ Dataresourcesandcapacity§ Datarights
How can we improve agriculture, food and nutrition with open data? | Open Data Institute 2015 17
Use cases
Improving crop varieties with open data on breeding trials: AgTrials Cultivar testing is an important means of improving crop varieties. A wide range of trials are taking place on sites all over the world, addressing issues such as drought tolerance, heat stress, and soil management. However, almost all of the data generated OHZ�ILLU�PUHJJLZZPISL�[V�V[OLY�YLZLHYJOLYZ�¶�ÄSLK�H^H`�VU�SHIVYH[VY`�OHYK�KYP]LZ��VY�sometimes lost completely due to bad data management.
By compiling data from agronomic and plant breeding trials and making it open, the Global Agricultural Trial Repository (AgTrials)38 hosted by a CGIAR Research Programme VU�*SPTH[L�*OHUNL��(NYPJ\S[\YL�HUK�-VVK�:LJ\YP[`��**(-:���VɈLYZ�H�YPJO�RUV^SLKNL�base to inform ongoing, collaborative research, while eliminating unnecessary and JVZ[S`�K\WSPJH[PVU�VM�LɈVY[Z��
:JPLU[PZ[Z�\ZLK�����VWLU�(N;YPHSZ�KH[HZL[Z�[V�I\PSK�JYVW�TVKLSZ�ZWLJPÄJ�[V�[OL�>LZ[�Africa region. The models are used to project the local impacts of climate change, and KLÄUL�IYLLKPUN�WYVNYHTTLZ�MVY�HKHW[H[PVU�39
)YPUNPUN�HNYPJ\S[\YHS�YLZLHYJO�[V�[OL�THZZLZ!�-(6�(.90:�WVY[HS(.90:40 is an international network of research institutions and information nodes making agricultural research information globally available. It collects and disseminates bibliographic information on diverse food and agricultural publications, from over 150 KH[H�WYV]PKLYZ�PU����KPɈLYLU[�JV\U[YPLZ�
AGRIS uses bibliographic data as an aggregator for locating related content online and organises it via an open data repository (of over 8 million records). An application combines records with other open data repositories and links to other quality sources of data such as the World Bank, Nature, and the Chinese Germplasm Database.
38 AgTrials website: www.agtrials.org, accessed 03/05/15
39 CGIAR CCAFS (2015), AgTrials helps repurpose data for adaptation research, http://ccafs.cgiar.org/AgTrials, accessed 15/05/15
40 FAO AGRIS Portal: http://agris.fao.org, accessed 03/05/15
CODATACODATAIISSUU
BenefitsofOpenData:someexamplesfromGEO
§ BarbaraRyan,DirectorofSecretariatGEO,TED-XTalkBarcelona
§ In2008USGovernmentwasconvincedtomakeLandsat Dataopenlyavailable,forfree.
§ Undercharging,thehighestnumberofdownloadswas53scenesperday.
§ Nowover5700scenesperdayaredownloaded.
§ Spanishdeforestationresearch:underthechargingregimedataaccessalonewouldhavecost€260M
§ CODATAproducedaWhitePaperontheValueofDataSharingfortheGEO-XIIPlenary:http://dx.doi.org/10.5281/zenodo.33830
https://www.youtube.com/watch?v=9umWTFgFIVs
CODATACODATAIISSUU
EconomicBenefitsofDataSharing:LandSat
§ 2006StudyestimatedthelossincaseofadatagapasequivalenttoUS$935M.
§ 2011Studyestimatedbenefitsoflandsat-sourcedinformation foragricultureasUS$858MjustforthestateofIowa.
§ 2015StudyestimatedworldwideeconomicbenefitofUS$2.19BN.
§ Estimatedbenefit inUSofUS$1.8BN.§ ValuingGeospatialInformation:UsingtheContingent
ValuationMethod toEstimatetheEconomicBenefitsofLandsat SatelliteImagery:http://dx.doi.org/10.14358/PERS.81.8.647 (Paywall…Irony…)
§ Opendataandopendatainfrastructurehasasignificanteconomicbenefit.
CODATACODATAIISSUU
EconomicBenefitsofDataSharing
§ ‘Manystudiesandreportshavedocumentedthepositivevalueofopenness forEOdata,specifically,andforvariousothertypesofdataandinformation,moregenerally.’
§ Weiss2002:quantifiedconsiderableeconomicbenefitsofmakingmeteorologicaldataopen($400-700Mingrossreceipts;businessesandemployment).
§ Houghton2011:apartfromeconomicbenefits,grosssavingforAustralianBureauofStatisticsofAU$3.5Mbyeliminatingchargingandmanagementstructure.
§ Houghton2014:Estimateunrealised benefitsofresearchdataofAU$1.4-4.9BNsetagainstestimatedAU$130-200Mcostofdatainfrastructure.
§ Interestedtoknowwhatstudiesofthebenefitsofdataavailabilityhavebeenconductedinthisareaofresearch?