research data management: what is it and why does it matter to … · 2019. 4. 1. · “data are...

74
Research Data Management: What is it and Why does it matter to me? Jane Fry, MacOdrum Library April 2, 2019

Upload: others

Post on 06-Oct-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Research Data Management:

What is it and

Why does it matter to me?

Jane Fry, MacOdrum Library

April 2, 2019

Page 2: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Agenda

Why the Library

What data resources are out there

What are the key components of RDM

What is an RDMP

2

Page 3: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Where are we? Where are we?

3

Page 4: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

4https://library.carleton.ca/find/data

Page 5: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

What do we offer?

Main services| Access to data

• different data sources and portals

• different types of data

| Research Data Management

• Data management plans

| Stats Consulting

• SPSS, Stata, SAS, R

One restriction for data use| Can ONLY be used for academic research or teaching

5

Page 6: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Canadian data sources

Statistics Canada| Data Liberation Initiative

| Includes

• All Statistics Canada’s public use data files, databases and

geographic files

Canadian Election Surveys| 1965 - 2015

6

Page 7: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Public Opinion Polls

7

Page 8: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

How do you access the data?

Different data portals| Search for questions or variables across datasets

| Perform online analyses

| Create tables, graphs and charts

| Download data in SPSS, SAS, Stata, …

8

Page 9: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Canadian data portals

ODESI| Data portal for social science data for Ontario Universities

| Data from StatCan, public opinion polls, …

| http://www.library.carleton.ca/find/data

Statistics Canada| Public use microdata files (E/F)

| Metadata for Research Data Centre Master files (E/F)

| http://dli-idd-nesstar.statcan.gc.ca/webview/

9

Page 10: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

ODESI

10

Page 11: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

11

Page 12: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Statistics Canada

http://www5.statcan.gc.ca/subject-sujet/index.action?&lang=eng

12https://www.statcan.gc.ca/eng/subjects

Page 13: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

International Data Sources

ICPSR | Inter-university Consortium for Political and Social

Research

| https://www.icpsr.umich.edu/

| World renowned summer program in quantitative

methods of social statistics

13

Page 14: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

International Data Portals

CESSDA| Consortium of European Social Science Data Archives

| http://www.cessda.eu/

GESIS| Leibniz Institute for the Social Sciences

| https://www.gesis.org/home/

ESS ERIC| European Social Survey, European Research Infrastructure

| http://www.europeansocialsurvey.org/

14

Page 15: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Confidential data

COOL Research Data Centre| Located at University of Ottawa

• available for all Carleton researchers

| Secure access to detailed microdata from Statistics Canada's

surveys, Canadian censuses' data, as well as an increasing

number of administrative data sets.

| Data can only be used for research and teaching purposes

• Check out metadata here:

• “Statistics Canada metadata for Master Files (RDC)”

http://dli-idd-nesstar.statcan.gc.ca/webview/

| Money for grad students!

https://crdcn.org/carleton-ottawa-outaouais-rdc-cool-rdc

Reference: Prof. Jennifer Stewart 15

Page 16: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Recent RDC Data additions

December 2018

DES (Digital Economy Survey) 2018

EIB (Employment Insurance Beneficiaries) January 1997 - June 2018

EICS (Employment Insurance Coverage Survey) 2017

LFS (Labour Force Survey) November 2018

LISA (Longitudinal and International Study of Adults) 2016

ROE (Record of Employment) 1987 - current

November 2018

APS (Aboriginal Peoples Survey) 2017

CTADS (Canadian Tobacco Alcohol and Drugs Survey) 2017

ELMLP (Education and Labour Market Longitudinal Linkage Platform)

NCS (National Cannabis Survey) 2018 Wave 3

LAD (Longitudinal Administrative Databank) 2016

LFS (Labour Force Survey) October 2018

Reference: Prof. Jennifer Stewart 16

Page 17: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

What else can we do to help you?

Consultation

| on the use of data

| Research data management (RDM)

• RDM Plans

Provide general tips and help on using data

17

Page 18: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Acronyms

Research data management| aka RDM

| aka data management

• aka DM

Research data management plan| aka RDMP

| aka data management plan

• aka DMP

18

Page 19: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Why the Library?

Research partner

Support the research endeavor

RDM expert

Partner with CU Research Office

The scholarly life-cycle

Discipline-agnostic

19

Page 20: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Why the Library? (cont’d)

Our role| Information

| Consultation

Challenge| Determine how we can help researchers advance their

research

References: Rambo Neil; Shorish, Yasmeen20

Page 21: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Research data

Exercise #1| Get together in groups

| Come up with a definition for Research data

| 2 minutes!

| GO!

21

Page 22: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

What are Research Data?

“Research data are the original sources or

material that you have created or collated to

conduct your research project. They can be

digital or non-digital. The response to your

research question is based on the analysis of

these research data.”

Source: https://blogs.ucl.ac.uk/rdm/2015/09/what-is-research-data/22

Page 23: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

“Data are facts, observations or experiences on which an

argument or theory is constructed or tested. Data may be

numerical, descriptive, aural or visual. Data may be raw,

abstracted or analysed, experimental or observational. Data

include but are not limited to: laboratory notebooks; field

notebooks; primary research data (including research data

in hardcopy or in computer readable form); questionnaires;

audiotapes; videotapes; models; photographs; films; test

responses. Research collections may include slides;

artefacts; specimens; samples.”

Source: https://blogs.ucl.ac.uk/rdm/2015/09/what-is-research-data/23

Page 24: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Research Data

Why are research data important?

Sharing research data

Check out the following examples …

24

Page 25: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Example: Reproducibility

Political Persuasion and Attitude

Change Study: The Los Angeles

Longitudinal Field Experiments, 2013-

2014

Principal Investigator(s):| Michael J. LaCour

Reference: https://www.openicpsr.org/openicpsr/project/100037/version/V8/view25

Page 26: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

26Reference: http://stanford.io/2bzRWFo

Page 27: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

27

Summary

We report a number of irregularities in the replication

dataset posted for LaCour and Green … that jointly suggest

the dataset (LaCour 2014) was not collected as described.

These irregularities include baseline outcome data that is

statistically indistinguishable from a national survey and

over-time changes that are unusually small and

indistinguishable from perfectly normally distributed noise.

Other elements of the dataset are inconsistent with patterns

typical in randomized experiments and survey responses

and/or inconsistent with the claimed design of the study. …

Reference: http://stanford.io/2bzRWFo

Page 28: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

28Reference: http://bit.ly/1NxWG5M

Page 29: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Another Example

“New Study Links Vaccines To Autism.

There's Just One Tiny Problem With It”

“… one of its own co-authors claimed

that figures in the paper were

deliberately altered before

publication. The data had been tampered

with. …”

Reference: http://bit.ly/2zSwAxo 29

Page 30: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

“Researchers from the University of British Columbia

are retracting their scientific paper linking aluminum in

vaccines to autism in mice, because one of the co-

authors claims figures published in the study were

deliberately altered before publication — an issue he

says he realized after allegations of data manipulation

surfaced online.”

“…original data cited in the study is inaccessible, which

would be a contravention of the university's policy

around scientific research. ”

“…the original data is in China, with an analyst who

worked on the paper.”

(October 16, 2017)

Reference: https://bit.ly/2kSjMRJ 30

Page 31: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

31

Page 32: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

And another Example

“A top Cornell food researcher has had

13 studies retracted. That’s a lot.”| September 21, 2018

| Brian Wansink

| “committed academic misconduct,”

| “he would retire from the university on June 30, 2019”

| “has been removed from all teaching and research,”

| “will spend his remaining time … in an “ongoing review of

his prior research.”

Reference: https://bit.ly/2xocIjs 32

Page 33: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Wansink refuted these findings. “There was no fraud, no

intentional misreporting, no plagiarism, or no

misappropriation,” he wrote. “I believe all of my findings

will be either supported, extended, or modified by other

research groups.”

“In a press release, JAMA said Cornell couldn’t “provide

assurances regarding the scientific validity of the 6

studies” because they didn’t have access to Wansink’s

original data. So, Wansink’s ideas aren’t necessarily

wrong, but he didn’t provide credible evidence for them.”

Reference: https://bit.ly/2xocIjs

33

Page 34: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

34Source: https://bit.ly/2OyCH1N

Page 35: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

And next …

What is RDM?

35

Page 36: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

RDM

What is RDM?| “ …describes the activities researchers perform as they

create and save their research data.”

• Source: http://researchdata.library.ubc.ca/learn/

Includes| Sound practices

| Data curation

| Data stewardship

36

Page 37: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Benefits of RDM

Confirmation of original findings

Further research

Planning follow-up studies

Bonus …

37

Page 38: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Why RDM Now

Requirement by funders

| Tri-Council (SSHRC, CIHR and NSERC)

| CFI

| Genome Canada

Tri-Agency Statement of Principles on Digital

Data Management

We should be ahead of the curve in this

You are at the beginning of a research career38

Page 39: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Metadata

Exercise | Groups

| What have I given you?

| Define it!

| What can you tell me about it?

| 2 minutes

| GO!

39

Page 40: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Metadata

What is it

Explains …

Why is it important

Who enters it

40

Page 41: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Metadata (cont’d)

Why keep metadata| Researchers re-use data

| Good research practice

When to record it

What to keep

End goal

41

Page 42: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Time to think again!

Exercise| Is this dataset ready for deposit?

| Is there enough metadata in these variable and value

labels?

42

Page 43: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Metadata (cont’d)

Survey metadata | Questionnaire

| Data collection

| Interviewer instructions

| ???

43

Page 44: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

44

http://taitegallery.net/wp-content/uploads/2012/02/unanswered-questions.jpg

Page 45: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

The Research Data Lifecycle

Exercise| What is a Research data lifecycle?

| Get into groups

| Define it, draw it, …

| 5 minutes

| Go!

45

Page 46: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

46

University of Central Florida Libraries

Source: http://stars.library.ucf.edu/cgi/viewcontent.cgi?article=1058&context=lib-docs

Page 47: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

47

UKDA RDM Lifecycle

Source: http://www.data-archive.ac.uk/create-manage/life-cycle

Page 48: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

48

http://www.data-archive.ac.uk/create-manage/life-cycle

Page 49: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

49

http://www.data-archive.ac.uk/create-manage/life-cycle

Page 50: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

http://www.data-archive.ac.uk/create-manage/life-cycle

50

Page 51: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

http://www.data-archive.ac.uk/create-manage/life-cycle

51

Page 52: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

http://www.data-archive.ac.uk/create-manage/life-cycle 52

Page 53: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

http://www.data-archive.ac.uk/create-manage/life-cycle 53

Page 54: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

54

UKDA RDM Lifecycle

Source: http://www.data-archive.ac.uk/create-manage/life-cycle

Page 55: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

What’s next?

Need an RDMP

Why an RDMP| Safety

| Efficiency

| Quality

If no RDMP?| Potential problems

55

Page 56: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

56

Reference: https://portagenetwork.ca/

Page 57: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

57

Reference: https://portagenetwork.ca/

Page 58: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

RDMP (Cont’d)

Portage DMP Assistant

| Data Collection

| Documentation and Metadata

| Storage and Backup

| Preservation

| Sharing and Re-use

| Responsibilities and Resources

| Ethics and Legal Compliance

58

Page 59: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Data collection

Types of data

File formats

Conventions and procedures

59

Page 60: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Documentation and metadata

Documentation

Consistency

Metadata standard and tools

60

Page 61: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Storage and backup

Storage requirements

Storage and backup

Access to data

61

Page 62: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Preservation

What data

Where will you deposit your data

Preservation ready

62

Page 63: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Sharing and reuse

What data

How

End-user license

Promotion

63

Page 64: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Responsibilities and resources

Who

How to handle change

Resources

64

Page 65: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Ethics and legal compliance

Sensitive data| Primary use

| Secondary use

Legal, ethical and IP issues

65

Page 66: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

DMP Assistant

Anyone

Step-by-step

The length

Different agencies

Remember …

66

Page 67: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Some tips

Mark it down!

It is not written in stone!

Easy!

An example of another DMP

67

Page 68: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Still don’t believe me?

What could happen if you don’t practice

good RDM?

https://www.youtube.com/watch?v=N2zK3sAtr-4#t=17

68

Page 69: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

RDM help

Help with RDM| https://library.carleton.ca/services/research-data-

management

| Consultations

Help with RDMPs| Portage: https://assistant.portagenetwork.ca/

| Word template:

https://library.carleton.ca/services/research-data-

management#how

69

Page 70: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

In summary

You are now able to:| Explore different data sources

| Define the key components of RDM

| Define an RDMP

| Create an RDMP

70

Page 71: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Resources

RDM at Carletonhttps://library.carleton.ca/services/research-data-management

Portage DMP Assistant

https://portagenetwork.ca/

Research Data Lifecycle (UK Data Archive)http://www.data-archive.ac.uk/create-manage/life-cycle

Tri-Agency Statement of Principles on Digital

Data Managementhttp://www.science.gc.ca/default.asp?lang=En&n=547652FB-1

71

Page 72: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

References

Rambo, Neil (October 22, 2015). “Research data

management roles for Libraries” .http://www.sr.ithaka.org/publications/research-data-management/

Shorish, Yasmeen (November 23, 2015). “The

Library as Research Partner”. ACRL

TechConnect Blog. http://acrl.ala.org/techconnect/post/the-library-as-research-partner

72

Page 73: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

http://dilbert.com/strip/2016-01-0673

Page 74: Research Data Management: What is it and Why does it matter to … · 2019. 4. 1. · “Data are facts, observations or experiences on which an argument or theory is constructed

Contact Information

Jane Fry

Data Services Librarian

Rm 122, MacOdrum Library

613.520.2600 x1121

[email protected]

Chris Shoniker

Data Support Specialist

Rm 122, MacOdrum Library

613.520.2600 x8140

[email protected]

http://www.library.carleton.ca/find/data74