rcsb protein data bank advisory committee€¦ · rcsb protein data bank advisory committee...

Post on 12-May-2020

6 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

RCSB Protein Data BankAdvisory Committee

Teleconference Monday November 19, 2018

Meeting Participants§ AdvisoryCommittee• Participating:CynthiaWolberger(Chair),PaulAdams,PeterAndolfatto,JudyBlake,AndyByrd,BridgetCarragher,Wah Chiu,KirkClark,PaulCraig,RolandDunbrack,CathyPeishoff,SueRhee,Torsten Schwede,JillTrewhella

• Absent:RobertB.Darnell,PaulFalkowski,ThomasFerrin,AndrejSali*

§ RCSBPDB• Rutgers:StephenK.Burley,HelenM.Berman,JohnWestbrook,JasmineYoung,ChristineZardecki

• UCSD:ColeH.Christie

*Sali/UCSFwillformallyjoinRCSBPDBin2019 1

Highlights: 2017 - Present

13,049 structuresdeposited into the PDB

New structures added to the archive for a total of 136,472 entries

Over 1 million unique users served

>679 million data files downloaded from wwPDB

web and FTP sites

Annual IQB Boot Camp Single Particle Cryo-Electron Microscopy

Rutgers Undergraduate Course on Antimicrobial Resistance

wwPDB Summit

Molecule of the Month on Biodegradable

Plastic

wwPDB AC Meeting

RCSB PDB AC Meeting4th Annual Video Challenge Results Molecular View of Diabetes Treatment and Management

Year in the Life of the RCSB PDB Community

IQB Crash Course: Anti-cancer Immune

Checkpoint Therapies

3

Responses to 2017 RCSB PDB AC ReportCommitteestronglyencouragestheRCSBleadershiptousetherenewalasanopportunitytoexplainthesignificanceofeachactivityandhowasanintegratedwholetheyaddresstheneedsoftheresearch,industryandeducationcommunities.

Seriesofposters/flyersdocumentingRCSBPDBimpactandsupportforfederalfundingagencygoals

RCSBPDBimpactanalysespublishedinProteinScience,ScientificData

PDBImpactonRecentUSFDADrugApprovalsnowinpressatStructure

TheCommitteealsoencouragestheRCSBPDBtoaggressivelypursuenewsourcesofsupportbyapproachingprivatefoundations,pharmaceuticalcompaniesandNIHinstitutesthatutilizeRCSBPDBresourcesbutdonotcurrentlyprovidefunding.

Ongoing;Conversationsinitiatedwith• HHMI• NCI• SciencePhilanthropyAlliance• ScienceGatewaysCommunityInstitute

4

RCSB PDB: Four Interoperating Services

CustomerServiceHelpDeskandITSupport

5

Deposition/Biocuration

Archive Management/Access

1 2

DataExploration

3 4

Outreach/Education

• Deposition• Validation• Biocuration

• Datastandards• Dataintegration• Datastorage• Dataaccess

• Portal• Search• Browse• 3Dvisualization

• PDB-101

Deposition/Biocuration

Archive Management/Access1 2 Data

Exploration3 4 Outreach/Education

1. Deposition/Biocuration in 2017§ Ontrackfor~12,100depositionsin2018

§ 3DEMgrowthcontinuingin2018

Method 2017Depositions

2016Depositions

MX 11,889(91.2%)

10583

NMR 460(3.5%) 474

3DEM 658(5.0%) 531

Other 44(0.3%) 27

6

48%

31%

21%

2017ProcessingSites

PDBj

PDBe

RCSB PDB

30%

38%

21%

1%3% <1%

7%

2017DepositorLocationsNorth America

Europe

Asia

South America

Oceania

Africa

Commercial

1. Deposition/Biocuration in 2018§ OneDep• ORCiD nowmandatory• Biocuration moreefficient• SupportsSFX/XFELentries• Bettersoftwaremanagementvia GitHub

§ CarbohydrateRemediation• Collaborationwithglycoscience community

• Projectannounced atwwpdb.org

• PDBx/mmCIF Dictionaryextension and example filesavailable via GitHub

7

400450500550600650700750800

2009

2010

2011

2012

2013

2014

2015

2016

2017

#ofEntriesProcessed

Year

NewStructures/Biocurator

OneDep launched*

1. Deposition/Biocuration in 2019§ LigandValidationenhancement§ NMRRestraintValidationimplementation§ Author-initiatedCoordinatereplacement§ Carbohydrateremediation§ ChemicalComponentversioning§ OngoingDeposition/Biocuration efficiencyimprovement

8

Master Copy ofPDB FTP

Repository

Depositors

DepositionBiocurationValidation

Archive keeping; Release coordination

Data Integration ServicesExchange DB

External Data Resources

OneDep

Archive Management/Access

Sequence &3D ClusteringServices

PDB Archive Data External Data

Sequence and 3D Data

RCSB PDB Copy of PDB

FTP Repository

GraphQL/REST APIs

ftp/RSYNCServices

Data Exploration

RCSB PDB Content Delivery Network

rcsb.org

Search Aggregator APIs

pdb101.rcsb.org

Data harvesting; Pre-deposition validation

Outreach/Education

Users

Programmers;External data resources

Researchers

Students; Teachers

Archive replicators; Power users; External data resources

Search Services

RCSB PDB Data Architecture Redesign

PDBx/mmCIF Data Schema Throughout!9

2. Archive Management/Access in 2018§ ExtendedPDBx/mmCIF dataschemaacrossallfourRCSBPDBservices

§ IntegratedArchiveManagement/AccessandDataExplorationbydevelopingnewAPIs(ApplicationProgramInterface)andWebServices

§ Legacysearchanddatadeliveryinfrastructurereplacedbycloudfriendlytechnologies(inbeta)• Searchindexingandsuggestions(ApacheSolr)• Archivingservices/updatestransitionedtoadistributedobjectstore(MongoDB)

• DataAccessservicestransitionedtoGraphQL API• Specializedsearch(Sequence&3D)featuresre-packagedasindependentWebServices

10

2. Archive Management/Access in 2019§ UpgradeArchiveManagementdatastoragesystem§ ContinuetoproductionizenewservicearchitectureinsupportofthenewRCSB.org websitedesignandexpandedprogrammaticdataaccess

§ Continuetargetedremediation(carbohydrates)andextendeddataintegration(PubChem&CARD)

§ Continuecloudmigrationoftheweeklyupdateoperations

§ MigrateservicestoamoreportablepackagingusingDocker

11

3. Data Exploration in 2017RCSB.org Users§ >395,000monthly,>1millionannually

§ 3%annualgrowthinnon-bounceuniqueusers

RCSB.org Sessions§ 35%growthsince2010§ Highaveragesessionduration(~6minutes)

§ Lowfractionof0-second“bounce”sessions

GlobalPDBDataDownloads§ Total:679,421,200total• FTP:454,723,083• Websites:224,698,117

12

Potassium Channel (PDB 1bl8)Doyle et al. (1998) Science 280, 69-77

Frequently access structure––Structure data downloaded ~281K times since 2007

Cited >4700 times

3. Data Exploration in 2018§ Solr textsearchfunctionalityimplementedonRCSB.org (pilotedonPDB101.RCSB.org)

§ NewNGLvisualizationfeatures• Electrondensitymaps• Ligand-proteininteractions• Validationreportin3D

§ Newwebsitearchitecturedesigned/developed• Improvesspeedandscalingofexistingservices• Acceleratessoftwaredevelopmentofnewservices

13

3. Data Exploration in 2019§ NewwebsitedesignutilizingAPIsfordeliveryofdatatoRCSB.org users

§ SameAPIssupportingprogrammaticaccesstoRCSB.org dataforpowerusers,externalresources

§ Newwebsitecapabilitiessupporting• Enhancedsearching(Solr plusotherdatatypes)• AutoSuggest,DrillDown,andAdvancedSearch• TabularReporting• BatchDataDownload

§ Mol*(mol-star)communitygraphicslibraryforincreasing/extendingNGLcapabilities

14

4. Outreach/Education in 2017/2018/2019§ >620KPDB-101Usersin2017

§ HealthFocus:Diabetes,AntibioticResistance• VideoChallenge• Curricularmaterials• GlobalHealthResources

15

What is an Enzyme?>142K views since 2017

>313K views since 2017

HPV

Any Questions About Recent Milestones?

16

wwPDB AC Meeting November 2, 2018§ IntroducedChair-Elect(PeterRosenthal,UK)

§ Reviewed2017metrics

§ Reviewed2017/2018progressversus goals

§ DescribednewwwPDBorganizationalstructure

§ ExplainednewfeaturesofrevisedwwPDBCharter(totakeeffectJanuary1st 2019)

§ Outlined2018/2019goals

§ Obtainedconcurrenceonvariouspolicymatters

§ ThankedoutgoingChair(AndyByrd,US)

17

New wwPDB Organizational Structure

18

CORE ARCHIVES

PDBBMRBEMDB

EMPIAR

SASBDB

MX Images

CORE MEMBERSRCSB PDB

PDBePDBj

BMRB EMDB

FEDERATED RESOURCES

wwPDB Core Archives

Definition:AwwPDB“CoreArchive”isaglobalstructuralbiologydataresourcejointlymanagedbywwPDBCoreMembers.

§ CurrentwwPDBCoreArchives:• PDBCoreArchive:3DStructureDataResourcehousingmultiscale/atomicstructuralmodelsplusmoleculardataandmetadata,MXexperimentaldataandmetadata,andotherexperimentaldata.(ArchiveKeeper:RCSBPDB)

• BMRBCoreArchive:BiomolecularNMRDataResource housingmoleculardataandmetadata,NMRexperimentaldataandmetadata,andotherexperimentaldata.(ArchiveKeeper:BMRB)

§ NextCoreArchiveexpectedtojoinwwPDB:• EMDBCoreArchive:MolecularandCellularEMDataResource housingmolecular/biologicaldataandmetadata,experimentalelectricpotentialmapdata,andotherexperimentaldata.(ArchiveKeeper:EMDB) 19

Any Questions About wwPDB AC?

20

Discussion Topics

21

Urgent Matters§ Fundraising:Othersuggestions?

§ MembershipTransitions• Chair:2019- 2021• NSFReviewPanelsuggestedinclusionofadditionaldimensionsofdiversity,especially...membersfromunderrepresentedcommunities,andexperienceindiverseorganizationtypes

§ RCSBPDBACMeetingschedule2019andbeyond

22

Membership Transitions§ NewChair:PaulD.Adams• DivisionDirectorMolecularBiophysics&IntegratedBioimagingLawrenceBerkeleyNationalLaboratory

§ NewMember:Mandë Holford• AssociateProfessorDepartmentofChemistry andBiochemistryHunterCollegeBelfer ResearchBuildingandCUNYGraduateCenter

• ResearchAssociateSacklerInstitutefor ComparativeGenomicsInvertebrateZoologyAmericanMuseumof NaturalHistory

23

RCSB PDB AC Meeting Schedule

24

§ OurFallMeetingsconflictwithwwPDBAC

§ Proposedmeetingplanfor2019-2023• Spring2019• TargetWindowMondayApril1- ThursdayApril4

• Spring2020• PDB50in2021• RepeatWashington,DCareameetingtoenableprogramofficerparticipationinSpring2022

Planning ongoing for PDB 2021

25October 20, 1971Nature New Biology

Many Thanks to the RCSB PDB AC§ Commentsontherenewalproposalweremuchappreciated

§ FeedbackonourMay2018SiteVisitpresentationsalsoprovidedsignificantbenefit

§ Lookforwardtoyourongoingfeedbackon• New2019RCSB.org websitedesign• SFX/XFEL• 3DEM(single-particleandtomography)• Integrative/HybridMethods

§ Yourhelpwithfundraisingactivitiesgoingforward

26

Many Thanks to Cynthia Wolbergerfor 10 Years of Advice and Support

27

RCSB PDB AC member since 2009 RCSB PDB Chair, 2013-2018

Celebration of Open Access in Structural Biology Symposium, 2013

RCSB PDB AC 2010

Join the RCSB Protein Data Bank at University of California San Diego

Open Positions:

Postdoctoral Fellows

The Challenge:Develop innovative analysis, integration, query, and visualization tools for 3D biomolecular structures to help accelerate research and training in biology, medicine, and related disciplines.

RCSB PDB Team

29

RCSB PDB is funded by a grant (DBI-1338415) from the National Science Foundation, the National Cancer Institute, the National Institute of General Medical Sciences, and the US Department of Energy

RCSB PDB is a member of the Worldwide Protein Data Bank partnership (wwPDB; wwpdb.org)

RCSB.ORGinfo@rcsb.org

Funding

Management

Follow us

RCSB PDB is hosted by:

Executive Session

30

top related