take control research data 2016-10

25
Take control of your PhD journey: Manage your research data according to best practice Philipp Conzett & Lars Figenschou UiT University Library October 27, 2016 Slides will be available at: http :// www.slideshare.net / UiT_takecontrol

Upload: university-library-uit

Post on 10-Jan-2017

56 views

Category:

Education


0 download

TRANSCRIPT

Page 1: Take control research data 2016-10

Take control of your PhD journey:

Manage your research data according to best practice

Philipp Conzett & Lars FigenschouUiT University LibraryOctober 27, 2016

Slides will be available at: http://www.slideshare.net/UiT_takecontrol

Page 2: Take control research data 2016-10

Outline

• Course objectives and layout (Lars)

• General background and rationale for research data management and sharing (Lars)

• Best practice during the life cycle of research data– Searching/reusing (Philipp)– Collecting (Philipp)– Processing (Philipp)– Archiving (Philipp)– Planning (Lars)

• Support at the UiT Library

• Course evaluationwww.business.mcmaster.ca

Page 3: Take control research data 2016-10

Course objectives and layout

• Objectives- Give you a glimpse into how research data should be managed - Data management plan at the outset of your project- Show how to structure, document, and preserve data - …and how you can archive and share your data

Goals- To help you crack some codes within “The Lifecycle of Research Data Management”- Be more prepared to fulfill present and future requests from research funding agencies (and your home institution)- Understand (better) the background and rationale of data sharing as such- Make you share and re-use data

www.powerful-sample-resume-formats.com

Page 4: Take control research data 2016-10

Course objectives and layout

• Layout– VERY dynamic…!!– Large variation in your needs, so let us know!– Disruptions are necessary (welcome…)– Presentation – Tasks and discussions– Working with your own (or others) data– 15 min. tea/coffee/fruit/chat break

layout.fm

Page 5: Take control research data 2016-10

Background

• This Module about Research Data …is the first ever… organized at UB

- dependent on the students- we need inputs, suggestions, contributions from you (initiative)

• The University Library are building competence on the field to enable OPEN SCIENCE

• I.e., UiT Open Research Data (launched in Sep. 2016)

Page 6: Take control research data 2016-10

Background

• Open Access to Research Data• Share and Re-use• Being transparent support your integrity• Increased visibility – more citations• Makes research more efficient

• Funders – EU & NFR (and others) • Obligatory

• Add value for your self• Add value for others• …a part of «OPEN SCIENCE»

• https://www.youtube.com/watch?v=2JBQS0qKOBU

www.fosteropenscience.eu

Page 7: Take control research data 2016-10

UiT and Open Science

• “As open as possible, as closed as necessary“(H2020 Programme: Guidelines on Open Access to Scientific Publications and

Research Data in Horizon 2020)

• “Open by default” (“åpen som standard”)(Tilgjengeliggjøring av forskningsdata, Norges forskningsråd)

• Right now (fall 2016), UiT are establishing a general policy for research data management. This policy should facilitate effective, responsible and future-oriented management of research data.By 2020, UiT will have established services for storage of all types of research data(Styringssignaler, Universitetsstyret UiT 2016)

1. UiT publiseringsfond (uit.no/ub)2. UiT Open Research Data (opendata.uit.no)

Page 8: Take control research data 2016-10

Background

Invest some time now – save time later«It is the planning itself that matters….»

So, be organized – from the startKeep track of changes and pass on (your) knowledge

Checklist for (a) Data Management (Plan): http://www.dcc.ac.uk/resources/data-management-plans

«Let your dataset`s live happily… ever after….»

Page 9: Take control research data 2016-10

Background: Discussion (or should we wait..?)

• Pros and cons of:- having good routines/following best practice for research data management

- research data sharing and/or Open Science

• From the video:– What if someone in your lab quits?– What if you need to use your old data?– What if you are accused of fraud?– What if you laptop is stolen?– What if you could get more credit for your work?– What if ...

depositphotos.com

Page 10: Take control research data 2016-10

The Lifecycle of Research Data Management

Planning

Phases:

Collecting

Processing

Archiving

Searching / reusing

Page 11: Take control research data 2016-10

Searching / reusing

• Good practice at the outset of your research project:– Literature survey– AND data survey

• Sources / places to search:– Research group, supervisor, colleagues, ...– (Electronic) literature– Directories, e.g.

Registry of Research Data Repository• Browse or search• Reference to research data:

Persistent Identifier (PID)– Research data archive(s)– Databases, e.g. DataCite

Page 12: Take control research data 2016-10

Data Storage

Page 13: Take control research data 2016-10

Data Storage: Backups

Avoid data loss by establishing good backup routines:• Regular backup• Several backups:

– Here: e.g. your computer– Near: e.g. your home directory at UiT (\\homer.uit.no)– Far: e.g. a cloud service like myDoc/OneDrive at UiT (https://

mydoc.uit.no/)• Shared storage areas, e.g. uDoc at UiT (https://udoc.uit.no/)• Versioning = keep track of changes

Check http://orakel.uit.no/ for help.

Page 14: Take control research data 2016-10

File and Folder Naming and Organizing

Some fundamental file naming recommendations:• Files should be named consistently• File names should be descriptive, but short (< 25 characters)• Use underscores ( _ ) instead of spaces• Avoid characters like “ / \ : * . ? ‘ < > [ ] ( ) & $ æÆ øØ åÅ ...• Use the international dating convention YYYY-MM-DDPossible strategies:• Order by date• Order by subject• Order by type• Forced order with numberingFolder naming and organization:• Choose a consistent system, stick to it, and document it in a ReadMe-file• Main structure should be visible in the file names

Page 15: Take control research data 2016-10

Documentation / metadata: ReadMe files

ReadMe file = description of your dataset / user guide to your dataBest practice recommendations for ReadMe files:• Start early• Describe

– contact information– what the dataset is about– file structure and naming conventions– where to find which data = overview of your files– methods and workflow– column headings in tabular data– abbreviations– units of measure– ...

• Save as Unicode .txt• Check specific metadata requirements (discipline, archive, ...)

Page 16: Take control research data 2016-10

Archiving: Preparing

Preparing your data for archiving:• Selection• Do not exclude negative / null data• Include raw version and analyzed version(s)• Provide your data in original AND persistent file format

Persistent file formats are usually• non-proprietary,• open, with documented international standards,• in common usage by the research community,• using standard character encodings (i.e. ASCII, UTF-8), and• uncompressed (space permitting)

Page 17: Take control research data 2016-10

Archiving: Persistent file formats

Persistent file formats for common document types:

Detailed data guide available here.

Type of Document Non-persistent format (examples)

Persistent format

Text MS Word (.docx) PDF/A

Spreadsheet MS Excel (.xlsx) Tabulator separated Unicode text (.txt)

Image Windows Bitmap (.bmp) Uncompressed TIFF

Sound AAC (.m4a) WAV

Video Quicktime (.mov) MPEG-4

Databases MS Access (.accdb) XML or tabulator separated Unicode text (.txt)

Page 18: Take control research data 2016-10

Archiving: Choosing your archive

• Research group, supervisor, colleagues, ...• Registry of Research Data Repository (http://www.re3data.org/)• UiTs own archive for open research: UiT Open Research Data (https://

opendata.uit.no/)

Page 19: Take control research data 2016-10

Planning

• Context dependent- which phase are you planning for?(designing, collecting, processing or analyzing, saving, archiving, dissemination)

• General rules- procrastinate behavior (postpone….) does not work- use common sense- ask for help/guidance

• DATA MANAGEMENTHave in mind:How would you…- collect- store- describe- and share your data?

Page 20: Take control research data 2016-10

Exercises

1. Find a relevant research data archive (within your discipline).

2. Check whether your own data comply with best practice for research data management. If you do not have own data available yet, you me use the following data set: http://tinyurl.com/zgb34mp.

Page 21: Take control research data 2016-10

Test base: http://henry2.ub.uit.no:8080/

User name: test01, test02, ... test10Password: test2016

Page 22: Take control research data 2016-10

Exercise: Create a dataset

1. Log in (production: Feide)2. Choose Add Data

and New Dataset3. Fill in the required

information in the form.Try to specify at least 3 keywords.

4. Upload one or more files.

Page 23: Take control research data 2016-10

Exercise: Register additional metadata

1. Go to the dataset and select the tab Metadata.

2. Select Add + Edit Metadata3. Look through the extended

metadata schema, and check if there is additional information that is relevant to the data set you have created.

Page 25: Take control research data 2016-10

Course evaluation

• Please use 5 min. to fill in our electronic evaluation form athttp://bit.ly/ubevalen

• Teachers’ name: Lars Figenschou, Philipp Conzett• Date: 27.10.2016• Title of course: TC

• Thanks, and good luck!