data documentation and metadata for data archiving and sharing managing research data well workshop...
TRANSCRIPT
![Page 1: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009](https://reader035.vdocuments.us/reader035/viewer/2022062423/56649ec45503460f94bced37/html5/thumbnails/1.jpg)
Data documentation and metadatafor data archiving and sharing
Managing research data well workshop London, 30 June 2009
Manchester, 1 July 2009
![Page 2: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009](https://reader035.vdocuments.us/reader035/viewer/2022062423/56649ec45503460f94bced37/html5/thumbnails/2.jpg)
2
Why document data?
• enables you to understand/interpret data• needed to make data independently understandable• ensures informed and correct use, reduces chance of
incorrect use/misinterpretation• if using your data for the first time, what would you need to
know?
• UKDA uses data documentation to: – create user guide(s) for dataset– ensure accurate processing and archiving– supplement information for catalogue record
![Page 3: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009](https://reader035.vdocuments.us/reader035/viewer/2022062423/56649ec45503460f94bced37/html5/thumbnails/3.jpg)
3
What is data documentation?
1. Wider contextual information about project(Study-level metadata)
• background, history, aims, objectives
• academia: end-of-award reports
• Government/voluntary sector: published reports, e.g. Family Spending (EFS), Living in Britain (GHS)
• publications based on dataset
![Page 4: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009](https://reader035.vdocuments.us/reader035/viewer/2022062423/56649ec45503460f94bced37/html5/thumbnails/4.jpg)
4
2. Methodology and processes: technical reports (also Study-level metadata)
• sample construction
• collection process - fieldwork, interviewer instructions
• instruments - questionnaires, showcards, interview schedules
• data validation - cleaning, error-checking
• data characteristics - temporal/geographic coverage
• variables - labels, coding, classifications, missing values
• derived variables - compilation
• dataset structure - files, relationships, cases, variables
What is data documentation?
![Page 5: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009](https://reader035.vdocuments.us/reader035/viewer/2022062423/56649ec45503460f94bced37/html5/thumbnails/5.jpg)
5
2. Methodology and processes: technical reports (contd.)
• confidentiality measures: anonymisation carried out – aggregation, banding, coding and top-coding,
disclosure control?– editing of sensitive material in interview transcripts
• weighting: factors and variables, weighting process• any secondary data sources used?
What is data documentation?
![Page 6: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009](https://reader035.vdocuments.us/reader035/viewer/2022062423/56649ec45503460f94bced37/html5/thumbnails/6.jpg)
6
3. researcher may add metadata routinely to files (Data-level metadata)
• quantitative data: variable/value labels; worksheet information; table relationships and queries in relational database; GIS data layers/tables
• qualitative data/text documents: interview transcript speech demarcation; respondent details
• technical reports (back to Study-level metadata)
• Data Documentation Initiative (DDI) (Study or Data-level metadata)
• http://www.ddialliance.org/codebook/index.html• metadata tools: http://tools.ddialliance.org• German Institute for Educational Progress (IQB) – educational
data codebooks www.iza.org
What is data documentation?
![Page 7: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009](https://reader035.vdocuments.us/reader035/viewer/2022062423/56649ec45503460f94bced37/html5/thumbnails/7.jpg)
7
UKDA metadata
• UKDA collects and creates structured metadata for each archived dataset
• created during ingest data processing (Data-level metadata) – data dictionaries, format transfer, data listing, ingest processing details and
information gathered in ‘readme’ file for users
• Catalogue record and keyword index(mix of Study-/Data-level metadata - ‘Catalogue metadata’. Also contains ‘Administrative metadata’, such as access conditions, date of publication, etc.)– data deposit form – keyword index covers data elements and concepts– international standards: DDI, METS, ISAD(G), TEI– standardised elements + controlled vocabularies = consistent search and retrieval– sufficient information for users to decide if the data suitable– information on the provenance of a dataset– record of publications
![Page 8: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009](https://reader035.vdocuments.us/reader035/viewer/2022062423/56649ec45503460f94bced37/html5/thumbnails/8.jpg)
8
Providing good documentation
• quality of the information provided by the data creator determines ease of discovery and appropriate re-use– comprehensive and comprehensible documentation and
metadata– complete the deposit form as fully as possible
• contact the UKDA if not sure what to produce or provide:– see advice on our Managing and Sharing web pages:
http://www.data-archive.ac.uk/sharing/metadata.asp– contact [email protected]
![Page 9: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009](https://reader035.vdocuments.us/reader035/viewer/2022062423/56649ec45503460f94bced37/html5/thumbnails/9.jpg)
9
Recap – why document data?
• enables you to understand/interpret data• needed to make data independently understandable• ensures informed and correct use, reduces chance of
incorrect use/misinterpretation• if using your data for the first time, what would you
need to know?
![Page 10: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009](https://reader035.vdocuments.us/reader035/viewer/2022062423/56649ec45503460f94bced37/html5/thumbnails/10.jpg)
10
Examples
• English Longitudinal Study of Ageing (ELSA) – very large study
• Quantitative dataset – depends on size and scale– Health Survey for England (HSE)– BHPS provides link to documentation site– smaller scale study, less documentation
• Qualitative dataset – depends on size and scale– data listing, interview schedules, methodology
![Page 11: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009](https://reader035.vdocuments.us/reader035/viewer/2022062423/56649ec45503460f94bced37/html5/thumbnails/11.jpg)
11
ELSA documentation
![Page 12: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009](https://reader035.vdocuments.us/reader035/viewer/2022062423/56649ec45503460f94bced37/html5/thumbnails/12.jpg)
12
Quantitative study
• smaller-scale study - user guide may just contain survey questionnaire, methodology information
• example from HSE 2007 – documents separated, bigger study
![Page 13: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009](https://reader035.vdocuments.us/reader035/viewer/2022062423/56649ec45503460f94bced37/html5/thumbnails/13.jpg)
13
Qualitative study 1
• User guide contains variety of documents
![Page 14: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009](https://reader035.vdocuments.us/reader035/viewer/2022062423/56649ec45503460f94bced37/html5/thumbnails/14.jpg)
14
Qualitative study 2
• Data Listing