Social Science Datasets
November 2011
John Kaye – Social Sciences Dataset Lead
http://www.slideshare.net/johnkayebl
2
What is a dataset?
Seismic measurements taken by a geologist.
Genetic data collected by a medical researcher.
A survey of public opinions collected by a sociologist.
3
The Foundation for Research
Data is a crucial component of the scholarly record.
Re-acquisition may be impossible
Datasets are essential to the British Library’s mission to advance the World’s knowledge.
4
The British Library Datasets Programme
We envision a future where researchers can:
Discover, access, reuse, and reference datasets.
Track the impact of the data that they generate and receive appropriate credit.
Our approach is to:
Provide a focus for the community to establish needs, requirements and agreement.
Explore novel technology and creative solutions.
5
Datasets in Explore The British Library
6
Explore The British Library (Portals)
7
Explore The British Library (Portals)
8
British Library Resource Guides
Topical Bibliographies and Dataset Resource Guides are currently in production
http://www.bl.uk/reshelp/findhelpsubject/socsci/topbib/bibliographies.html
Quantitative methods in social research Management and Business Studies Coming soon
Free GIS resource guide Local Area Statistics UK government data Sport and Society
9
Economic and Social Data Service - (ESDS)
Data search and download
Research method guides
Thematic guides
Online analysis
10
Economic and Social Data Service - (ESDS)
http://www.esds.ac.uk/
ESDS Government large-scale government surveys, such as the Labour Force Survey and
the General Household Survey
ESDS International multi-nation databanks, such as World Bank's World Development
Indicators, and survey data including Eurobarometer
ESDS Longitudinal major UK surveys following individuals over time, such as the British
Household Panel Survey
ESDS Qualidata a range of multimedia qualitative data sources
11
Other Sources of Data – EDiNA - Spatial Data
Go Geo! Searchhttp://www.gogeo.ac.uk/cgi-bin/index.cgi
Edina Digimap and UK Bordershttp://edina.ac.uk/digimap/
http://edina.ac.uk/ukborders/
12
Other Sources of Data – Other Spatial Data
Ordanance Survey Open Datahttp://www.ordnancesurvey.co.uk/oswebsite/products/os-opendata.html
Landmaphttp://landmap.mimas.ac.uk/
13
Census Dissemination Unit
http://cdu.mimas.ac.uk/
1971-2001 Census statistics - http://casweb.mimas.ac.uk/
Experian Geodemographic Data http://cdu.mimas.ac.uk/experian/index.htm
Infuse 2001 Census analysis tool http://infuse.mimas.ac.uk/
Geoconvert – Postcode Data http://geoconvert.mimas.ac.uk/
14
UK Government Open Data
http://data.gov.uk/
Admin and Statistical data portal
Office for National Statistics
http://www.statistics.gov.uk/default.asp
http://www.neighbourhood.statistics.gov.uk/dissemination/
https://www.nomisweb.co.uk/Default.asp
National Digital Archive of Datasets
http://www.ndad.nationalarchives.gov.uk/
Regional Government
http://data.london.gov.uk/
15
Other Sources of Data – International Organisations
United Nations
http://data.un.org/
European Union
http://epp.eurostat.ec.europa.eu/portal/page/portal/eurostat/home/
OECD
http://www.oecd.org/statsportal/
World Bank
http://data.worldbank.org/
IMF
http://www.imf.org/external/data.htm
16
Examples of Other Sources of Data
Arts and Humanities data Service (AHDS)
http://ahds.ac.uk
Guardian Data Store
http://www.guardian.co.uk/data-store
Financial Times
http://www.ft.com/home/uk
Economist Intelligence Unit
http://www.eiu.com/Default.aspx
Web Archive
http://www.webarchive.org.uk/ukwa/
17
Analysis Tools and Software
Statistical - SPSS, SATA, R (open source)GIS - ArcGIS, MapInfo, Quantum GIS (open source)ExcelOnline Tools
18
Examples of Online Analysis Tools
ESDS NESSTAR
http://nesstar.esds.ac.uk
ESDS Spatial Tools
http://www.ccsr.ac.uk/esds/gis/
Economists Online Dataverse
http://dvn.iq.harvard.edu/dvn/dv/NEEO
United Nations
http://data.un.org/Explorer.aspx
London Profiler
http://www.londonprofiler.org/
London Heat Map
http://www.londonheatmap.org.uk/Mapping/
19
Online Mapping Tools using Google Maps
MapTube
http://www.maptube.org/
Google Fusion Tables
http://www.google.com/fusiontables/Home/
Gmap Creator
http://www.casa.ucl.ac.uk/software/gmapcreator.asp
20
Data Visualization
Presenting data in a useful and interesting manner
Allowing concepts to be easily understood
Lots of examples online e.g:
http://flowingdata.com/
http://datavisualization.ch/
http://www.guardian.co.uk/news/datablog
21
Citing Data
22
DataCite
DataCite is an international consortium which aims to:
Establish easier access to research data on the Internet
Increase acceptance of research data as legitimate, citable contributions to the scholarly record
Support data archiving that will permit results to be verified and re-purposed for future study
http://datacite.org/
23
Digital Object Identifiers (DOIs) offer a solution
Mostly widely used identifier for scientific articles
Researchers, authors, publishers know how to use them
Put datasets on the same playing field as articles
Connecting an Article with the Underlying Data
DatasetYancheva et al (2007). Analyses on sediment of Lake Maar. PANGAEA.doi:10.1594/PANGAEA.587840
URLs are not persistent
(e.g. Wren JD: URL decay in MEDLINE- a 4-year follow-up study. Bioinformatics. 2008, Jun 1;24(11):1381-5).
24
Depositing and Archiving Data
Why Archive?Institutional RepositoriesUK Data Archive/ESDSMetadata and Code!
25
John KayeLead Curator – DatasetsSocials Sciences The British Library96 Euston Road London NW1 2DB Telephone: 020 7412 7450Email: [email protected]: @johnkayebl
Slides - http://www.slideshare.net/johnkayebl