parsalliance conference 20071115 1 ©john womersley/keith jeffery/stfc developing tomorrow’s...
TRANSCRIPT
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 1
Developing Tomorrow’s Infrastructure for
Science
John WomersleyDirector, Science Strategy
Science and Technology Facilities Council
presented by:Keith Jeffery
Director, IT & International StrategyScience and Technology Facilities Council
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 2
Overview
1. Some STFC Science
2. Tomorrow’s Digital Infrastructure for Science
3. Supporting the Research Lifecycle
4. Some Policy Frameworks
5. Conclusion
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 3
What is STFC?
The Science and Technology Facilities Council (UK) Created on April 1, 2007 It is responsible for
– fundamental research in particle physics, nuclear physics, astronomy, space
– major UK facilities for the physical and life sciences synchrotrons, light sources, lasers, neutrons
– national laboratories at RAL, Daresbury, UKATC– international science projects
CERN, ESO, ESA, ILL, ESRF… Over 2000 staff and an annual budget of over £700M
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 4
The Science we Address
Some examples
Why is there a universe? What is the origin of mass? Was there ever life on Mars? How are the chemical elements created? How can we design better treatments for cancer? How do cells work? How can we create new materials to store energy?
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 5
STFC Facilities
data
ComputingAnalysisModelling
knowledgebeam
sample
Imaging detector
Neutrons and photons Provide complementary views of matter:
Photons “see” electric charge – high atomic number nuclei
Neutrons “see” nucleons – especially hydrogen atoms
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 6
Some STFC Projects
ESA centre
ISIS TS2 phase 3
Diamond phase 3
Sapphire
Materials Innovation Institute
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 7
Some STFC Projects
ESRF upgrade
4GLS
Hartree CentreComputational Science
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 8
Some STFC Projects
HIPER
Future neutron sources:ESS/MW neutron sourceILL 20/20 upgrade
ELI
DIPOLE laser
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 9
Some STFC Projects
European ELT
SKA
Next generationGravitational waveobservatory
FAIR
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 10
Some STFC Projects
Neutrino factory
International Linear Collider
LHC upgrades
Underground scienceNeutrinos, dark matter
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 11
XFEL
Project “launched” on 5th June– This means DESY is now authorised to spend
XFEL GmbH to be set up by end of year Our goal is to maximise our in-kind contributions within the
£30M already allocated in LFCF– Pixel detector, streak camera…
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 12
Part 2Tomorrow’s Digital
Infrastructure for Science
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 13
The 7 C’s
Creation Collection Capacity Computation Curation Collaboration Communication
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 14
Its all about scale
Creation: Examining the
detector arrays on the MAPs spectrometer at ISIS
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 15
Its all about scale
Collection: An ATSR
image of Sicily with Mount Etna eruption; taken 24 July 2001
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 16
Its all about scaleEstd Data Storage CCLRC to 2010
0
2000
4000
6000
8000
10000
12000
14000
16000
18000
20000
2003-04 2005-06 2007-8 2009-10
Year
Vol
ume
(TB
)
CSE
BADC
E-SCI
Diamond
PP
External
Total (TB)
Cum Total (TB)
Capacity:
eg at CCLRC
20PB by 2010
1PB = 1015 Bytes
Billions of Floppys
Millions of CDs
Thousands of PCs
(today’s)
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 17
Its all about scale
Computation: 3-D rabbit heart
MRI rendered at 512 x 512 x 1400 using 12 GPUs
Data needs interpretation and analysis
Picture of heart
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 18
Its all about scale
Curation: Some CCLRC based Repositories
– The Atlas Datastore
– The British Atmospheric Data centre
– The CCLRC Data Portal
– The CCLRC Publications Archive
– The CCPs (Collaborative Computational Projects)
– The Chemical Database Service
– The Digital Curation Centre
– The EUROPRACTICE Software service
– The HPCx Supercomputer
– The JISCmail service
– The NERC Datagrid
– The NERC Earth Observation Data Centre
– The Starlink Software suite
– The UK Grid Support Centre
– The UK Grid for Particle Physics Tier 1A
– The World Data Centre for Solar-Terrestrial Physics
Atlas Datastore Tape Robot
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 19
Its all about scale
Collaboration: Barrel toroid magnet
and detector module from ATLAS at CERN
ATLAS: 2000 scientists 150 Universities 30 countries
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 20
It’s all about scaleCommunication: “The web has changed
everything...”
Technology enables:– access to everything
distributed,searchable information sources
Interlinking enables:– Revalidation of results
‘repeat experiment’
Discovery enables:– new knowledge from old
Archiving enables:– Recording unique events
Antarctic environmental data
CCLRC’s “e-pubs” Institutional Repository has records of 20,000 publications spanning 20 years
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 21
Part 3 Supporting
the Research Lifecycle
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 22
The Body of Knowledge
The GovernmentProcess
The ResearchProcess
Aggregation of Knowledge lies at the heart of the innovation lifecycle
Enabling Knowledge Creation
Enabling Wealth Creation
Quality Assessment
Strategic Direction
Improved Quality of Life
Improved Understanding
The Innovation Lifecycle
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 23
The Body of Knowledge
The Information Infrastructure
Creation
Archival
Access
Storage ComputeNetwork
Services
Curation
the researcher actsthrough ingest and access
Virtual Research Environment
the researcher shouldn’t have to worry about the information infrastructure
Information Infrastructure
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 24
Current View
Raw DataData Analysis
Analysed Data
Publication Data
Publications
Facility 1
Raw DataData Analysis
Analysed Data
Publication Data
Publications
Facility 2
Raw DataData Analysis
Analysed Data
Publication Data
Publications
Facility 3
Distinct Infrastructures / Distinct User Experiences
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 25
Future View
Raw DataData Analysis
Analysed Data
Publication Data
Publications
Facility 1
Raw DataData Analysis
Analysed Data
Publication Data
Publications
Facility 2
Raw DataData Analysis
Analysed Data
Publication Data
Publications
Facility 3CapacityStorage
Publications Repositories
Standards/Converters
Data Repositories
Raw Data Catalogue
Data Analysis
Analysed Data Catalogue
Publication Data Catalogue
Publications Catalogue
Common Infrastructure / Common User Experience
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 26
Part 4
Some Policy Frameworks
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 27
Some Policy Frameworks
– UK Research Councils’ initiative on access to research outputs 2005 and 2006 statements of principles
– OECD Guidelines on Access to Research Data 2004 Declaration, 2007 Guidelines
– UK Office of Science and Innovation Report (2006) Developing the UK’s e-infrastructure for science and innovation
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 28
RCUK Policy (2005, 2006)
Four principles:
Ideas and knowledge derived from publicly-funded research are made available and accessible for public use, interrogation, and scrutiny, as widely, rapidly, and effectively as practicable
Effective mechanisms are in place to ensure that published research outputs are subject to rigorous quality assurance, through peer review
The models and mechanisms for publication and access to research results are both efficient and cost-effective in the use of public funds
The outputs from current and future research can be preserved and remain accessible not only for the next few years but for future generations
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 29
OECD Recommendation (2006)OECD Recommendation on Access to
research data from public funding
13 principles:A – Openness Openness means access on equal terms for the international research
community at the lowest possible cost, .... B – Flexibility, C – Transparency, D – Legal conformity, E –
Protection of intellectual property, F – Formal responsibility, G – Professionalism
H – Interoperability Technological and semantic interoperability is a key consideration in
enabling and promoting international and interdisciplinary access to and use of research data. ...
I – Quality, J – Security, K – Efficiency, L – Accountability M – Sustainability ... taking administrative responsibility for the measures to guarantee
permanent access to data that have been determined to require long-term retention.
http://webdomino1.oecd.org/horizontal/oecdacts.nsf/Display/3A5FB1397B5ADFB7C12572980053C9D3?OpenDocument
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 30
OSI e-Infrastructure Steering Group
“Developing the UK’s e-infrastructure for science and innovation”
– Cross departmental view– 6 working groups:
1. Data and Information creation2. Preservation and curation3. Search and navigation4. Virtual research communities5. Networks, compute power and storage hardware6. Middleware, AAA and digital rights management
– Reports available on UK National eScience Centre Website
http://www.nesc.ac.uk/documents/OSI/index.html
– Note here report on Data and Information Creation
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 31
OSI e-Infrastructure Steering Group Data and Information Creation
Key findings:
1. The future e-infrastructure should directly support the management of data throughout its lifecycle ’from cradle to grave’
2. The future e-infrastructure should reduce the cycle time from conducting research, through analysis, publication and feedback into new research
3. There should be a much greater use of simulation-based research and its much closer integration with physical research
4. The future e-infrastructure should support the use for research purposes of data collected for other purposes
5. The future e-infrastructure should be based upon standards which support uniform classification, integration, certification and citation of data across all sources
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 32
Conclusion
STFC has massive holdings of data and information The benefits of ready, online, open access to research
are self-evident – wealth creation, improvement in quality of life
The data and information requires:– Preservation: making it available indefinitely– Curation: making it understandable indefinitely
This implies use of metadata– Needs to be ‘more intelligent’ (semantics on syntax)– Needs standards (for interoperation)
This is what the PARS Alliance is all about
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 33
TheThe
EndEnd