take control research data 2016-10
TRANSCRIPT
Take control of your PhD journey:
Manage your research data according to best practice
Philipp Conzett & Lars FigenschouUiT University LibraryOctober 27, 2016
Slides will be available at: http://www.slideshare.net/UiT_takecontrol
Outline
• Course objectives and layout (Lars)
• General background and rationale for research data management and sharing (Lars)
• Best practice during the life cycle of research data– Searching/reusing (Philipp)– Collecting (Philipp)– Processing (Philipp)– Archiving (Philipp)– Planning (Lars)
• Support at the UiT Library
• Course evaluationwww.business.mcmaster.ca
Course objectives and layout
• Objectives- Give you a glimpse into how research data should be managed - Data management plan at the outset of your project- Show how to structure, document, and preserve data - …and how you can archive and share your data
Goals- To help you crack some codes within “The Lifecycle of Research Data Management”- Be more prepared to fulfill present and future requests from research funding agencies (and your home institution)- Understand (better) the background and rationale of data sharing as such- Make you share and re-use data
www.powerful-sample-resume-formats.com
Course objectives and layout
• Layout– VERY dynamic…!!– Large variation in your needs, so let us know!– Disruptions are necessary (welcome…)– Presentation – Tasks and discussions– Working with your own (or others) data– 15 min. tea/coffee/fruit/chat break
layout.fm
Background
• This Module about Research Data …is the first ever… organized at UB
- dependent on the students- we need inputs, suggestions, contributions from you (initiative)
• The University Library are building competence on the field to enable OPEN SCIENCE
• I.e., UiT Open Research Data (launched in Sep. 2016)
Background
• Open Access to Research Data• Share and Re-use• Being transparent support your integrity• Increased visibility – more citations• Makes research more efficient
• Funders – EU & NFR (and others) • Obligatory
• Add value for your self• Add value for others• …a part of «OPEN SCIENCE»
• https://www.youtube.com/watch?v=2JBQS0qKOBU
www.fosteropenscience.eu
UiT and Open Science
• “As open as possible, as closed as necessary“(H2020 Programme: Guidelines on Open Access to Scientific Publications and
Research Data in Horizon 2020)
• “Open by default” (“åpen som standard”)(Tilgjengeliggjøring av forskningsdata, Norges forskningsråd)
• Right now (fall 2016), UiT are establishing a general policy for research data management. This policy should facilitate effective, responsible and future-oriented management of research data.By 2020, UiT will have established services for storage of all types of research data(Styringssignaler, Universitetsstyret UiT 2016)
1. UiT publiseringsfond (uit.no/ub)2. UiT Open Research Data (opendata.uit.no)
Background
Invest some time now – save time later«It is the planning itself that matters….»
So, be organized – from the startKeep track of changes and pass on (your) knowledge
Checklist for (a) Data Management (Plan): http://www.dcc.ac.uk/resources/data-management-plans
«Let your dataset`s live happily… ever after….»
Background: Discussion (or should we wait..?)
• Pros and cons of:- having good routines/following best practice for research data management
- research data sharing and/or Open Science
• From the video:– What if someone in your lab quits?– What if you need to use your old data?– What if you are accused of fraud?– What if you laptop is stolen?– What if you could get more credit for your work?– What if ...
depositphotos.com
The Lifecycle of Research Data Management
Planning
Phases:
Collecting
Processing
Archiving
Searching / reusing
Searching / reusing
• Good practice at the outset of your research project:– Literature survey– AND data survey
• Sources / places to search:– Research group, supervisor, colleagues, ...– (Electronic) literature– Directories, e.g.
Registry of Research Data Repository• Browse or search• Reference to research data:
Persistent Identifier (PID)– Research data archive(s)– Databases, e.g. DataCite
Data Storage
Data Storage: Backups
Avoid data loss by establishing good backup routines:• Regular backup• Several backups:
– Here: e.g. your computer– Near: e.g. your home directory at UiT (\\homer.uit.no)– Far: e.g. a cloud service like myDoc/OneDrive at UiT (https://
mydoc.uit.no/)• Shared storage areas, e.g. uDoc at UiT (https://udoc.uit.no/)• Versioning = keep track of changes
Check http://orakel.uit.no/ for help.
File and Folder Naming and Organizing
Some fundamental file naming recommendations:• Files should be named consistently• File names should be descriptive, but short (< 25 characters)• Use underscores ( _ ) instead of spaces• Avoid characters like “ / \ : * . ? ‘ < > [ ] ( ) & $ æÆ øØ åÅ ...• Use the international dating convention YYYY-MM-DDPossible strategies:• Order by date• Order by subject• Order by type• Forced order with numberingFolder naming and organization:• Choose a consistent system, stick to it, and document it in a ReadMe-file• Main structure should be visible in the file names
Documentation / metadata: ReadMe files
ReadMe file = description of your dataset / user guide to your dataBest practice recommendations for ReadMe files:• Start early• Describe
– contact information– what the dataset is about– file structure and naming conventions– where to find which data = overview of your files– methods and workflow– column headings in tabular data– abbreviations– units of measure– ...
• Save as Unicode .txt• Check specific metadata requirements (discipline, archive, ...)
Archiving: Preparing
Preparing your data for archiving:• Selection• Do not exclude negative / null data• Include raw version and analyzed version(s)• Provide your data in original AND persistent file format
Persistent file formats are usually• non-proprietary,• open, with documented international standards,• in common usage by the research community,• using standard character encodings (i.e. ASCII, UTF-8), and• uncompressed (space permitting)
Archiving: Persistent file formats
Persistent file formats for common document types:
Detailed data guide available here.
Type of Document Non-persistent format (examples)
Persistent format
Text MS Word (.docx) PDF/A
Spreadsheet MS Excel (.xlsx) Tabulator separated Unicode text (.txt)
Image Windows Bitmap (.bmp) Uncompressed TIFF
Sound AAC (.m4a) WAV
Video Quicktime (.mov) MPEG-4
Databases MS Access (.accdb) XML or tabulator separated Unicode text (.txt)
Archiving: Choosing your archive
• Research group, supervisor, colleagues, ...• Registry of Research Data Repository (http://www.re3data.org/)• UiTs own archive for open research: UiT Open Research Data (https://
opendata.uit.no/)
Planning
• Context dependent- which phase are you planning for?(designing, collecting, processing or analyzing, saving, archiving, dissemination)
• General rules- procrastinate behavior (postpone….) does not work- use common sense- ask for help/guidance
• DATA MANAGEMENTHave in mind:How would you…- collect- store- describe- and share your data?
Exercises
1. Find a relevant research data archive (within your discipline).
2. Check whether your own data comply with best practice for research data management. If you do not have own data available yet, you me use the following data set: http://tinyurl.com/zgb34mp.
Test base: http://henry2.ub.uit.no:8080/
User name: test01, test02, ... test10Password: test2016
Exercise: Create a dataset
1. Log in (production: Feide)2. Choose Add Data
and New Dataset3. Fill in the required
information in the form.Try to specify at least 3 keywords.
4. Upload one or more files.
Exercise: Register additional metadata
1. Go to the dataset and select the tab Metadata.
2. Select Add + Edit Metadata3. Look through the extended
metadata schema, and check if there is additional information that is relevant to the data set you have created.
Support at the UiT Library
• Check out our research support pages athttps://en.uit.no/ub/forskningsstotte#linje2 (English version) https://uit.no/ub/forskningsstotte#linje2 (Norwegian version)
• Contact us at [email protected] or contact your subject librarian, see https://en.uit.no/ub/fag#linje2 (English), https://uit.no/ub/fag#linje4 (Norwegian) for more info.
Course evaluation
• Please use 5 min. to fill in our electronic evaluation form athttp://bit.ly/ubevalen
• Teachers’ name: Lars Figenschou, Philipp Conzett• Date: 27.10.2016• Title of course: TC
• Thanks, and good luck!