legal, ethical, and policy issues of “big data...

30
Legal, Ethical, and Policy Issues of “Big Data 2.0” Collaborative Ventures and Roles for Info Pros Sheila Corrall Kip Currier 45th LIBER Annual Conference Libraries Opening Paths to Knowledge Wednesday, June 29, 2016

Upload: others

Post on 20-May-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Legal, Ethical, and Policy Issues of “Big Data 2.0” Collaborative Ventures and Roles for Info Pros

Sheila Corrall Kip Currier

45th LIBER Annual Conference Libraries Opening Paths to Knowledge Wednesday, June 29, 2016

Information Culture & Data Stewardship

Legal, Ethical, and Policy Issues of “Big Data 2.0” Collaborative Ventures and Roles for Info Pros

Outline •  Background

Libraryliterature–Defini7onofkeyterms

•  CasestudiesPi=sburghHealthDataAlliance–UKBiobank–BigDataEurope–PersonalGenomeProject–PrecisionMedicineIni7a7ve–OncologyResearchInforma7onExchangeNetwork

•  Implica7onsLegal,ethical,policy

•  ConclusionsRolesandcompetencies

Information Culture & Data Stewardship

Library literature •  Educa7ngstudentsabouthowcompaniesusebigdataandadvisingusers

onhowtofinddatasetsforresearch(Bieraugel,2013;Hoy,2014)•  Movingbeyondresearchdatamanagementtodefineanddiscussother

specializeddata-relatedroles(Lyon&Brenner,2015;Lyonetal.,2016)•  Exposing(linked)librarycollec7onsdataandmakingthemreusablefor

resourcediscovery(Campbell&Cowan,2016;Teets&Goldner,2013)•  Carryingouttheirownbigdataprojectstoanalyzecollec7onuseand

conductcross-disciplinarycomparisons(Huwe,2014;Ta=ersall,2016)•  Helpingcommuni7escreatelocaldatainfrastructuresandmakebigdata

moreuseful,bycrea7ngtaxonomies,designingmetadataschemes,andsystema7zingretrievalmethods,andalsoassis7ngwithpolicyconcerns(Bertotetal.,2014;Bieraugel,2013;Reinhalter&Wi=man,2014)

•  Servingasauthori7esoncopyrightandintellectualpropertyissuesarisingfrombigdata(Gordon-Murnane,2012)

Information Culture & Data Stewardship

What are data? (When are data?) Dataareformsofinforma7onthatmaybedefinedbyexample,processinglevel,origin,andpreserva7onvalue

“Inaddi7ontodigitalmanifesta7onsofliterature(includingtext,sound,s7llimages,movingimages,models,games,orsimula7ons),[theterm]refersaswelltoformsofdataanddatabasesthatgenerallyrequiretheassistanceofcomputa7onalmachineryandsoewareinordertobeuseful,suchasvarioustypesoflaboratorydataincludingspectrographic,genomicsequencing,andelectronmicroscopydata;observa7onaldata,suchasremotesensing,geospa7al,andsocioeconomicdata;andotherformsofdataeithergeneratedorcompiled,byhumansormachines.”

(Uhlir&CoheninBorgman,2015,p.19)

Information Culture & Data Stewardship

Critical questions •  Whatarethechieflegal,ethical,andpolicyissuestriggered

byBigData(andLi=leData)?

•  Whatbestprac1cescanbeiden7fiedtoaddressthesekindsoflegal,ethical,andpolicyissues?

•  Whataretherolesthatinforma7onprofessionalsandresearchlibrariescanandwillassumeincontribu7ngtoconsidera7onsofthelegal,ethical,andpolicyissuesraised?

•  Whatarethecompetencyimplica7onsintermsoftheknowledge,skills,andabili1eslibrariesneedtoacquireordevelopfortheBigDataworld?

Information Culture & Data Stewardship

Information Culture & Data Stewardship

“Thehealthcarefieldgeneratesanenormousamountofdataeveryday.Thereisaneed,andopportunity,tominethisdataandprovideittothemedicalresearchersandprac77onerswhocanputittoworkinreallife,tobenefitrealpeople.Manyorganiza7onscanfulfillpartofthisprocess,butnoneofthemareequippedtobeginwithrawdata,developanideaandmovethatideadirectlyintoaprac7ceseing.”

What roles can information professionals and research libraries play in such endeavors?

World-classCS/machinelearning

Medical+research+exper7se

Deepdata,clinicalseing,commercializa7on

Information Culture & Data Stewardship

Information Culture & Data Stewardship

Background “Amajorna7onalhealthresource”•  Registeredcharity•  Est.byWellcomeTrust,MRC,

Dept.ofHealth,ScoishGov.,andNWRegionalDev.Agency;fundedbyWelshDev.Agency,BHF,andDiabetesUK)

•  HostedbyU.Manchester,supportedbyNHS

•  Opentobonafideresearchersanywhereintheworld,includingthosefundedbyacademiaandindustry

•  Aimstoimprovepreven7on,diagnosisandtreatmentoflife-threateningillnesses

•  Recruited500,000peopleaged40-69in2006-2010

•  Par7cipantshaveundergonemeasures,providedblood,urineandsalivasamples,anddetailedpersonalinforma7on–  andagreedtohavetheirhealthfollowed

“…tohelpscien7stsdiscoverwhysomepeopledeveloppar7culardiseasesandothersdonot”

Information Culture & Data Stewardship

Best Ethical Practice? UKBiobankwantstobe“amodelnotonlyforbestsciencebutforbestethicalprac7cetoo,inrela7ontothesebigbiobankprojects”ProfessorRogerBrownsword,Chair(2011-2015)UKBiobankEthicsandGovernanceCouncil(UKEGC)h=p://www.ukbiobank.ac.uk/ethics/

What are some of the “best science” and “best ethical practice” lessons that can be learned from UK Biobank?

Information Culture & Data Stewardship

Information Culture & Data Stewardship

Big Data Europe Who is Big Data Europe for? Ø  “Small,Mediumandlarge-sizeden77escomingfromanysectorwithin

industry,researchorthepublicsector,thathavemuchtogainfrommakingsenseoflargevolumesofdata(ofbothsta7cordynamicnature,andfromvarioussources)torealisenewandinnova7veuse-cases,notjustwithintheirdomainbutalsoacrossdifferentsectors”

Ø  16Europeanpartnersatpresent,represen7ngadiverserangeofacademic,for-profit,andgovernmenten77esin10countries

Big Data partnership projects – A key question Ø  Givencurrentpoli7caluncertain7es(e.g.,BREXIT),

whatcanbedonetoensurestabilityandcon7nuityofBigDatapartnerships(likeBigDataEurope),whileprovidingleewayforaccommoda7ngchangesandcoursecorrec7onsthatmaybeperiodicallywarranted?

Information Culture & Data Stewardship

Information Culture & Data Stewardship

Information Culture & Data Stewardship

About PGP HarvardPGPis“anopenscienceresearchproject…designedtocreatepublicscien7ficresourcesthateveryonecanaccessbybringingtogethergenomic,environmental,andhumantraitdatadonatedbyourpar7cipants”

•  FoundedatHarvardMedicalSchoolin2005,nowaGlobalNetworkinvolvingCanada(UniversityofToronto),theUK(UCL)andAustria(AustrianAcademyofSciences)

•  HarvardPGPisstaffedbyasmall,largelyvolunteergroupofresearchers,engineers,andethicistswhoareallpioneersintheirfields.

•  MembersoftheGlobalNetworkfollowacommonsetofguidelines,butthequan7tyandqualityofinforma7ononna7onalsitesvariessignificantly

“Privacy,confiden7alityandanonymityareimpossibletoguaranteeina...researchstudywherepublicsharingofgene7cdataisanexplicitgoal”

Information Culture & Data Stewardship

d)  Oversight.EachmembermustmaintaincurrentIns7tu7onalReviewBoard[ResearchEthics]orlocalequivalentapproval

e)  Notforprofit.Managedorsponsoredbyanon-profitorganiza7on(orlocalequivalent).–  Amembershallnotsellor

licensepar7cipantdataor7ssues“otherthanpurposesofreasonablecostrecovery”

Pretty Good Privacy?

Guidelines of the Global PGP Network a)  PublicData.Par7cipantsare

invitedtosharegenomicandtraitdatausingaCC0waiver

b)  Non-anonymous.Risksofpar7cipantre-iden7fica7onareaddressedupfrontaspartoftheconsentandenrollmentprocess− Neitheranonymitynor

confiden1alityoftheirdataispromisedtopar1cipants

c)  Equalaccess.Par7cipantsaregiven7melyandcompleteaccesstotheirindividualdatai.e.,rawdataandnotjustsummaryresults“wherefeasible”

Information Culture & Data Stewardship

Precision Medicine Initiative

•  LaunchedbyPresidentObamainhisJanuary2015StateoftheUnionaddress

•  Aimstoleverageadvancesingenomics,emergingmethodsformanagingandanalyzinglargedatasets,andhealthICTstoacceleratebiomedicaldiscoveries–  whileprotec7ngprivacy

•  Planstoenrollonemillionormorevolunteersandmayincludechildren

“commi=edtoengagingmul7plesectorsandforgingstrongpartnershipswith

academicandothernon-profitresearchers,pa7entgroups,andtheprivatesectortocapitalizeonworkalreadyunderway”

Information Culture & Data Stewardship

Information Culture & Data Stewardship

Information Culture & Data Stewardship

Big projects, Big problems Ø  VerylargescaleØ  InterdisciplinaryØ  HumansubjectsØ  Inter-state/interna7onal/globalØ  Mul7plejurisdic7onsØ  Cross-sectorpartners(public/private)Ø  Culturaldifferences

Information Culture & Data Stewardship

Legal issues arising from Big Data CompliancewithØ  PrivacylawsØ  Dataprotec7on/securitylawsØ  Gene7cinforma7onlawsØ  Freedomofinforma7onØ  Righttobeforgo=enØ  Intellectualproperty

e.g.,paten7ngofhumangenes/synthe7chumangenescf.EUandUS(MyriadGene6cscase,2013)

Ø  LicensingandcontractualissuesØ  Publishing

Information Culture & Data Stewardship

Ethical issues arising from Big Data Ø  Privacy

–  ofdonors–  howtocomplywithprivacylawsofdifferentna7ons/groups

Ø Maintaininganonymityofspecimendonors–  protec7onagainstbadactors,e.g.,cybercriminals,hac7vists–  triangula7onofdatafrommul7plesourcesusedtocircumventanonymiza7onofdonors

Ø Mone7za7on,Commodifica7on–  sellingofhealthdatatocommercialinterests–  useofindigenousknowledge/tradi7onalknowledge–  shouldspecimendonorsshareinanypoten7alprofits?

Information Culture & Data Stewardship

Ethical issues arising from Big Data Ø  Peaceful/PublicGood/PublicInterestusesvs.Military/

Na7onalSecurityusesvs.Terroristapplica7ons–  whowilldeterminethesocietallyacceptable/desirableusesandapplica7onsforhealthdata/bigdata?

Ø  Psychologicalwell-being/Informedconsentofdonors–  fullyadvisingdonorsoftheirrightsandoftheobliga7onsoftherespec7vedata-gatheringanddata-usingen77estodonors

–  takingaccountofthebestinterestsofdonorsinmakingtheirdataavailabletothem

Ø  Solicita7onofspecimendonorsforpar7cipa7oninstudiesIn2015theUKBioBankEthicsandGovernanceCouncilfacedapolicyissueoveritsproposeduseasarecruitmentplaEormbyresearcherswhowantedtoiden6fypeopleforaseparatestudy

Information Culture & Data Stewardship

“…a precedent-setting case” •  Researcherswantedtouse

UKBiobanktoiden7fypeopletoinviteintoaseparatestudy

•  TheyaskedUKBiobanktosendanintroductoryemailtoitspar7cipantspoin7ngtothewebsiteofthenewstudy

•  Offeringsucharecruitmentmechanismcouldbenefittheresearchcommunity–  Buttake7meandresources

thatcouldbeusedelsewhere

•  InwhatcircumstanceswoulditbeacceptableforBiobanktodivertresourcesinthisway?–  Howshouldadhocthird-party

re-contactsbeaccommodated?

•  UKBEGCproposedtwoop7ons–  Createadedicatedwebpageto

provideneutralinforma7onabout(approved)studies

–  ProvideawithdrawalcategoryallowingBiobankpar7cipantsopt-outfromemailinvita7ons

TheprojectwasapprovedasapilotsubjecttofiHngwithBiobank’s6metableofre-contactsandwillbeusedtodrawupaframeworkforfuturerequests

UKBIOBANKETHICSANDGOVERNANCECOUNCILANNUALREVIEW2015

Information Culture & Data Stewardship

Policy issues arising from Big Data •  Howandbywhomwillhealthdata/bigdatabepreservedand

maderetrievableforandbyfuturestakeholders?•  Whatguidelinesandrequirementsareneededforpublishing

relatedtohealthdata/bigdata?•  Whoneedstohaveavoiceinpolicy-seingandpolicy-making,and

whoshouldcraethegoverningpoliciesandcodesofethics?☞ Giventhepaceofchange,howoeenshouldpoliciesandcodesbe

reviewedandupdated?

•  Whatoversightandenforcementmechanismsareneededtoensurecompliance?☞ Whatarethepenal7esforpiracyofhealthdataormalfeasance,

negligence,willfulblindness,andharmfulimpactsonhumansubjects?☞ Whatprotec7onsareavailableorneedtobedevelopedandcodified

forwhistleblowerswhoreportlapsesandbreachesofcompliance?

Information Culture & Data Stewardship

Library Roles and Competencies •  Dataareformsofinforma7onrequiringstewardship

–  likethemanyotherknowledgeresourceslibrariesmanage

•  Bigdata2.0ini7a7vesposepar7cularchallenges–  becauseoftheirscale,variety,complexity,andopenness

•  Librariesarewellposi7onedtoassumeaproac7verole–  buildingontheirexis7ngworkinscholarlycommunica7on

•  Poten7alrolesforlibrariesinthebigdataarenarequireprofessional,technical,organiza7onal,managerial,personal,andinterpersonalknowledge,skills,andabili7es–  includingexper7seassociatedwithotherprofessionsandenhancedcompetenciesinrela7onshipmanagement

Information Culture & Data Stewardship

Potential Library Roles in Open Domains Types OpenContent OpenProcess OpenInfrastructure

Domains OA OData OER OBib OSS OD OEP OPR OSci OI OStd OSys

RolesUse

EducateAdvocateFacilitateMediate

CollaborateCoordinateIntegrate

Lead

(Corrall,2016,Inpress)

Information Culture & Data Stewardship

References Bertot,J.,Butler,B.,&Travis,D.(2014).Localbigdata:Theroleoflibrariesin

buildingcommunitydatainfrastructures.dg.o2014:Proceedingsofthe15thAnnualInterna6onalConferenceonDigitalGovernmentResearch(pp.17-23).doi:10.1145/2612733.2612762.

Bieraugel,M.(2013).Keepingupwith...bigdata.ALA,ACRL.Retrievedfromh=p://www.ala.org/acrl/publica7ons/keeping_u_with/big_data.

Borgman,C.L.(2015).Bigdata,liUledata,nodata:Scholarshipinthenetworkedworld.Cambridge,MA:MITPress.

Campbell,D.G.,&Cowan,S.R.(2016).Theparadoxofprivacy:Revisi7ngacorelibraryvalueinanageofbigdataandlinkeddata.LibraryTrends,64(3),492-511.doi:10.1353/lib.2016.0006.

Gordon-Murnane,L.(2012).Bigdata:Abigopportunityforlibrarians.Online,36(5),30-34.

Hoy,M.B.(2014)Bigdata:Anintroduc7onforlibrarians.MedicalReferenceServicesQuarterly,33(3),320-326.doi:10.1080/02763869.2014.925709.

Huwe,T.K.(2014,March).Bigdataandthelibrary:Anaturalfit.ComputersinLibraries,34(2),17-18.

Information Culture & Data Stewardship

References Huwe,T.K.(2014,March).Bigdataandthelibrary:Anaturalfit.Computersin

Libraries,34(2),17-18.Lyon,L.,&Brenner,A.(2015).Bridgingthedatatalentgap:Posi7oningthe

iSchoolasanagentforchange.Interna6onalJournalofDigitalCura6on,10(1),111-122.doi:10.2218/ijdc.v10i1.349.

Lyon,L.,Acker,A.,Ma=ern,E.,&Langmead,A.(2016).Applyingtransla7onalprinciplestodatasciencecurriculumdevelopment.iPres2015:Proceedingsofthe12thInterna6onalConferenceonDigitalPreserva6on(pp.109-117).Retrievedfromh=ps://phaidra.univie.ac.uk/view/o:429552.

Reinhalter,L.,&Wi=mann,R.J.(2014).Thelibrary:Bigdata'sboomtown:TheSerialsLibrarian,67(4),363-372.doi:10.1080/0361526X.2014.915605.

Ta=ersall,A.(2016).Bigdata–whatisitandwhyitma=ers.HealthInforma6on&LibrariesJournal,33(2),89-91.doi:10.1111/hir.12147.

Teets,M.,&Goldner,M.(2013).Libraries'roleincura7ngandexposingbigdata.FutureInternet,5(3),429-438.doi:10.3390/fi5030429.

Any Questions?

SheilaCorrallscorrall@pi=.edu

[email protected]=.eduDepartmentofInforma1onCulture&DataStewardship

SchoolofInforma7onSciences