finding and accessing data on speech and language using the uk data service beate lichtwardt uk data...

73
Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded Born Talking seminar series University of East Anglia Norwich, 11 May 2015

Upload: dylan-lane

Post on 23-Dec-2015

227 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Finding and accessing data on speech and language using the UK Data Service

Beate Lichtwardt

UK Data Service

UK Data Archive

University of Essex

ESRC-funded Born Talking seminar series

University of East Anglia

Norwich, 11 May 2015

Page 2: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Who

Where

How

… are we?

… to find relevant data?

… to access the data?

Roadmap

Page 3: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

UK Data Service

ukdataservice.ac.uk

Integrates ESDS, Survey Question Bank and Census.ac.uk

Page 4: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

What is the UK Data Service?

• a comprehensive resource funded by the ESRC

• a single point of access to a wide range of secondary social science data (6,000 datasets)

• support, training and guidance

(listen to the recorded webinars at /ukdataservice.ac.uk/news-and-events/webinars.aspx )

Page 5: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Who is it for?

• academic researchers and students

• government analysts

• charities and foundations

• business consultants

• independent research centres

• think tanks

ukdataservice.ac.uk

Page 6: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

What does the UK Data Service do?

• put together a collection of the most valuable data and enhance that over time

• preserve data in the long term for future research purposes

• make the data and documentation available for reuse

• provide data management advice for data creators

• provide support for users of the service

• information about the use to which data are put

• easy access through a website - ukdataservice.ac.uk

Page 7: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

What do users do with the data ?

• comparative research, restudy or follow-up study

• re-analysis/secondary analysis

• research design and methodological advancement

• replication of published statistics

• teaching and learning

Page 8: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Data sources

• official agencies - mainly central government

• international statistical time series

• individual academics - research grants

• market research agencies

• public records/historical sources

• access to international data via links with other data

archives worldwide

Page 9: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Types of data collections

• Survey microdata- cross-sectional- Panel/ Longitudinal- International microdata

• Aggregate statistics- International macrodata

• Census data- Aggregate data for 1971 - 2011- Microdata for 1991, 2001, 2011

• Qualitative and mixed methods data- in-depth interview transcripts, diaries, anthropological field notes, answers to open-ended survey questions, audio-visual recordings and images (mixed methods: combined with numeric data)

Page 10: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Overview service

Kinds of data:

UK Surveys InternationalLongitudinal

large-scale government funded surveys

Census Business

major UK surveys following individuals over time

multi-nation aggregate databanks and survey data

range of multimedia qualitative data sources

Census data 1971 – 2011

microdata

Qualitative

Page 11: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Key data

ukdataservice.ac.uk/get-data/key-data.aspx

Page 12: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Longitudinal data – some examples• National Child Development Study (NCDS)

• 1970 British Cohort Study (BCS70)

• Millennium Cohort Study (MCS)

• British Household Panel Survey (BHPS)

• Understanding Society/ UK Household Longitudinal Study (UKHLS)

• Families and Children Study (FACS)

• English Longitudinal Study of Ageing (ELSA)

• Next Steps (Longitudinal Study of Young people in

England - LSYPE)

• Growing Up in Scotland (GUS)

• Life Study

Page 13: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

The UK Household Longitudinal Study

• started in 2009: UKHLS or ‘Understanding Society’ ; First data: 2010

• study of the socio-economic circumstances and attitudes of 100,000 individuals in 40,000 British households

• UKLHS replaces and incorporates the BHPS (1991-2009 , 5500 hh’s), retaining the latter's longevity whilst adding - to the sample size - to the scope of the study (new components: innovation panel, greater detail on ethnic minority groups, qualitative and biomedical data collections)

• coverage includes:

current employment and earnings; employment status; parenting and childcare arrangements; family networks; benefit payments; political party identification; household finances; environmental behaviours; consents to administrative data linkage (health and education)

Page 14: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

British Birth Cohort Studies

impact of childhood conditions on later life and understanding children and families in the UK

• National Child Development Study follows a cohort born in a single week in 1958 - data collected at birth & ages 7, 11, 16, 23, 33, 42, 50 (7 Up TV series), and 55 (60)

• 1970 British Cohort Study follows a cohort born in a single week in 1970 - data collected around birth & ages 5, 10, 16, 26, 30, 34, 38, and 42 (46)

• Millennium Cohort Study focuses on children born in 2000/ 2001 - sweeps at 9 months, 3, 5, 7 and 11years (14)

wide range of social, economic, health, medical and psychological issues

Page 16: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

UK surveys - examples

• Health Surveys (Health Survey for England 1991-, Scottish Health Survey 1995-, Welsh Health Survey 1998-)

• Labour Force Survey, 1973-

• Life Opportunities Survey 2009-2012

• British Election Study Ethnic Minority Survey, 2010

• Phonological Memory as a Predictor of Language Development in Down Syndrome, 1995 and 2001

• ONS Omnibus Survey, 1990-2007 (from 2008 changed its name to ONS Opinions Survey (OPN) and became part of the Integrated Household Survey (IHS))

Page 17: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

HSE: 1991- 2013

links to Nesstar

Page 18: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

International data - examples

• European Community Survey, 1970-1992

• Eurobarometer,1970 –(GB 1973 -)

(Source: Eurobarometer 81, 2014)

Page 19: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Qualitative dataThe UK Data Service collections contain examples from a range of qualitative methods including:

• online data• observation notes• documents• audio data• newspaper clippings

• interview transcripts• visual data• focus groups• essays • diaries

etc.

Interview transcripts are the most commonly deposited qualitative data with the UK Data Archive.

Most contemporary data is deposited in a digital form, but we also hold (and sometimes digitise) legacy data which is a paper (handwritten or typewritten) format.

Page 20: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Qualitative/mixed methods data - examples

• The Relation Between Parenting, Children's Social Understanding and Language, 2006-2009(builds upon longitudinal study examining relations between parenting and 3-to-5 year-olds' social understanding (SU) and language;follows up children between 8 and 11 years, examining mothers' own SU and general language ability to determine whether it is mothers' talk that facilitates children's SU, or there is some kind of genetic predisposition (evident in mothers' own SU and language) that causes the link)

• British Sign Language Corpus, 2008-2011(collection of video clips showing deaf people using BSL, together with background information about the signers and some written descriptions of the signing)

Page 21: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Qualitative/mixed methods data - examples• From communication disorders research to conversation-

based interventions for adults with aphasia, 2012-13(consists of video recordings of everyday conversations in the home between people with aphasia (a post-stroke communication disorder that affects speaking, understanding, reading and writing) and a family member; video recordings of conversation therapy for aphasia)

“Effective interventions to improve the everyday conversation skills of adults with aphasia are considered vital to long term psychosocial health and wellbeing, and have a growing evidence base. However this type of therapy places heavy demands on the skill-set of speech and language therapists (SLTs). In response, we developed a free e-learning resource called Better Conversations with Aphasia for professionals, and for people with aphasia and their families. Our project partners were Connect, and the Tavistock Trust for Aphasia. The resource allows SLTs to learn how to do conversation therapy, with access to: a complete therapy package interactive learning materials advice from experienced clinicians over 60 video clips of real therapy in action study resources including reflective logs and quizzes. People with aphasia and their families can find out what conversation therapy is, and what other people with aphasia think about it, in order to consider whether it is right for them. In addition, there are online discussion forums where users can share experiences and ideas.“

• Auditory processing and language in children with mild to moderate sensorineural hearing loss, 2010-2013(investigate why some children with permanent mild or moderate hearing loss (MMHL) have impaired language, while others do not)

Page 22: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Finding data on speech and language

Page 23: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Search using Discover

discover.ukdataservice.ac.uk/

Page 24: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded
Page 25: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Variable and question search

discover.ukdataservice.ac.uk/variables

Page 26: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Variable and question search – my variables

Page 27: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Variable and question search – my variables

Page 28: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

QualiBank search

https://discover.ukdataservice.ac.uk/QualiBank

Page 29: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

QualiBank search (login first!)

Page 30: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Data access

Page 31: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Who can access the data?

All registered users

However, which data can be accessed and the

particular access conditions vary according to:

• User type (UK HE/FE, Non HE/FE, Non UK)

• Usage/project characteristics (Commercial, non-commercial)

• Specific data access conditions attached tochosen data

(EUL, SC, SL (AR), Secure Lab data access)

Page 32: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Data access

• web access to data and metadata

• data are freely available for use in higher education institutions

• data supplied in a variety of formats- statistical package formats (e.g. SPSS, STATA) - databases and spreadsheets- word processed documents, PDF documents etc.

Page 33: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Accessing data – step by step

• register with us via UK Federation using your university/UKDA username and password

• agree to an End User Licence (EUL)

• select the desired data from the Data Catalogue by clicking on the ‘Download/Order’ button

• specify a project for which the data is to be used (minimum: 30-word description)

• then: - download the data to local machine selecting your preferred format (SPSS, STATA, TAB etc.)or- place an order for the data (SC, if no online agreement; SL) and complete all relevant forms

Page 35: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Data access conditions

• End User Licence (EUL)

• Special Conditions (SC)

• Special Licence (SL) - ONS - Approved Researcher Status (SL September 2005 - April 2008)- Other depositors - SL (e.g. ISER, CLS)

• Secure Lab Data access (UK researchers only !)- Approved Researcher- Training- Password and username

Page 36: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

End User Licence

“Every user is required to agree to an End User Licence outlining the

terms and conditions of use of the data. This includes, for example,

- not to share data with anyone who is not authorised to access them;

- not to use the data for commercial purposes, except with permission;

- not to try to use the data to identify individuals, households or

organisations;

- to cite the data correctly and inform the UK Data Achive/UK Data

Service of any publications;

- not to disclose your login details to anyone else;

- agreeing to meet any charges…”

Page 37: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Special Licence Access

• Strategy to provide access to datasets that are detailed, yet anonymised

• As these data pose a higher risk of disclosure, they have additional special conditions attached to them

• The SL requires the signature(s) of the researcher(s) and the institution with responsibility for the researcher; also needs the explicit permission of the data owner to release the data to the researcher(s)

Page 38: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Secure Lab data access

• data not available for download

• access requires accreditation as an Approved Researcher, completion of face-to-face training, and agreement to the Service's User Agreement and Breaches Penalties Policy

• applications are screened by the UK Data Archive and the individual or institution having ownership of the data (or their designated authority), and access is only granted to those researchers requiring data for statistical research purposes and who can justify their need for the data

Page 39: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Secure Lab data access

Restriction: Secure Lab funded by the ESRC to provide access to academic researchers in the UK

Page 40: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Secure Lab data access – Example

• ‘Millennium Cohort Study, 2001-2008: Linked Education Administrative Dataset, England: Secure Access’ (SN 6862)

• Includes all the main MCS study and Teacher Survey files also held under End User Licence at the UK Data Archive, plus:

- a data file containing selected Key Stage 1 information from the Department for Education's National Pupil Database;

- additional data from School Census 2010 for those cohort members attending a school in England at the time of MCS4 interview;

- anonymised Local Education Authorities (LEA) and anonymised School Numbers, to allow comparison of results across LEA and school

Page 41: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

File structure

data download

Page 43: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

MCS – EUL access data

Page 44: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Millennium Cohort Study: Fifth Survey, 2012 - file structure (SPSS download)

(SN 7464)

Page 45: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

SN 7464 file structure – file-information

Page 46: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

SN 7464 file structure – mrdoc folder

Data dictionaries

Questionnaire documentation and user guide

Study and citation information

Page 47: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

SN 7464 file structure – data dictionaries

Page 48: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

SN 7464 file structure – UKDA (data citation)

Page 49: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Exploring data online

Page 50: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Online analysis with Nesstar

• Online data browsing and analysis system

• Allows users to search for, locate, browse and analyse and download a wide variety of statistical data within a web browser

• UK Data Archive, as service provider for the UK Data Service, hosts a Nesstar server populated by its most popular data series

• Registration is required for analysis such as crosstabulations and regressions

Page 51: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Nesstar catalogue

Source: nesstar.ukdataservice.ac.uk/webview

Page 52: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Nesstar - help links

• Nesstar website:www.nesstar.com/

• Demo video Nesstar:ukdataservice.ac.uk/get-data/themes/ageing/analysing.aspx

• Thematic videos using NESSTAR:ukdataservice.ac.uk/get-data/themes/education/analysing.aspx ukdataservice.ac.uk/get-data/themes/ageing/analysing.aspx ukdataservice.ac.uk/get-data/themes/crime/analysing.aspxukdataservice.ac.uk/get-data/themes/environment/analysing.aspx ukdataservice.ac.uk/get-data/themes/health/analysing.aspxukdataservice.ac.uk/get-data/themes/housing/analysing.aspx ukdataservice.ac.uk/get-data/themes/labour/analysing.aspx

Page 54: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

BSA - Pay gap

Page 55: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Each year the British Social Attitudes series feeds data into International Social Survey Programme allowing international comparisons

ISSP – Pay gap

Page 56: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

GHS - Time series (Nesstar)

Page 57: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Mapping (Nesstar) – Sunburn by GOR

Page 58: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

International macrodata outputs

Proportion of seats held by women in national parliaments (%)World Bank (2013): World Development Indicators (Data downloaded: 30 January 2013). Mimas, University of Manchester. DOI: http://dx.doi.org/10.5257/wb/wdi/2013-01-30

Page 59: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

International macrodata

Download the series you want from the appropriate online tool: Beyond 20:20, UKDS.Stat or IMF eLibrary.

UKDS.Stat Beyond 20:20

Page 61: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Example census data - religion

Page 62: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Support and resources

Page 63: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

• video tutorials/webinars

• case studies

• guides

• themes

• advice on managing and sharing data

• teaching data and resources

• helpdesk, individual user support

Support and resources

Page 64: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

ukdataservice.ac.uk/news-and-events/webinars.aspx

Page 65: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

ukdataservice.ac.uk/news-and-events/webinars.aspx

Page 66: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

ukdataservice.ac.uk/use-data/data-in-use.aspx

Case studies

Page 67: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Managing and sharing data

(ukdataservice.ac.uk/manage-data.aspx)

Page 68: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Teaching resources

ukdataservice.ac.uk/use-data/teaching.aspx

Page 69: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Teaching datasets in Discover

Page 70: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Teaching resources – interview methods

ukdataservice.ac.uk/teaching-resources/interview.aspx

Page 71: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Help

ukdataservice.ac.uk/help/faq.aspx

Page 72: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Contact• Helpdesk:

ukdataservice.ac.uk/help/get-in-touch

Tel: 01206 872143

• Subscription emails, facebook, twitter and youtube:

to subscribe to UK Data Service emails: www.jiscmail.ac.uk/cgi-bin/webadmin?A0=UKDATASERVICE

to follow UK Data Service on Twitter: @UKDataService

Facebook

Youtube: www.youtube.com/user/UKDATASERVICE

Page 73: Finding and accessing data on speech and language using the UK Data Service Beate Lichtwardt UK Data Service UK Data Archive University of Essex ESRC-funded

Questions?