just digitise it - daniel wilksch of the public records office victoria

73
25 October 2017 Just Digitise It Community Heritage Grants Program digitisation workshop

Upload: national-library-of-australia

Post on 28-Jan-2018

23 views

Category:

Government & Nonprofit


2 download

TRANSCRIPT

Page 1: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

25 October 2017

Just Digitise ItCommunity Heritage Grants Program digitisation workshop

Page 2: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Session outline

1.35 Arrival and welcome1.40 - 2.40 pm Planning a digitisation project

• Setting standards• Resources needed• Care of your originals• Care of your copies

2.40 - 3.00 pm Digitisation facility tour/ afternoon tea (first half)3.00 - 3.20 pm Digitisation facility tour/ afternoon tea (second half)3.30 - 4.00 pm Negotiating rights before you digitise

Michael Proud, NLA4.00 – 4.30 pm Providing access

• Getting images online• Metadata and sharing images

4.30 - 5.00 pm Q & A

Page 3: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Notes for the session

http://beta.prov.vic.gov.au/community/managing-your-collection/just-digitise-it

6 project stages described•Planning•Preparing•Creating•Describing•Editing•Publishing

Page 4: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Setting standards

Page 5: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Matching standards to the project

Setting standards

Two main impetus…es for digitisation• Preservation

– OHIO (only handle it once)– colour management, ‘master’ copies

• Access– search/ discoverability– crowdsourcing

Page 6: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Factors

Setting standards

• How much material to copy? • What condition? (preservation needs assesment)

• How much time/ money do you have?• Has somebody already digitised it? (books…)

• What is its significance? (significance statement, etc.)

Page 7: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

How do we see?

Setting standards

http://en.wikipedia.org/wiki/Color_vision

Page 8: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Describing light with numbers

Smooth gradient

Broken into 16 steps (4-bit)

No intensity(0 in 8-bit scale)

Step 10 of 16(160 in 8-bit scale)

Step 16(255 in 8-bit scale)

Hint: the smooth gradient is in 8-bit steps – each level of intensity is 2 px wide in the original drawing.

Page 9: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Bit-depth

Red 255 ff 1111 1111

Green 255 ff 1111 1111

Blue 255 ff 1111 1111

24 BInary digiTs

Page 10: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

What is a digital image?

Setting standards

4d4d 002a 0000 ea68 ffff ff00 0000 00000000 0000 0000 0000 0000 0000 0000 0000

.... Black pixels (0's) left out ....

0000 0000 00ff ffff 000e 0100 0003 0000 0001 0064 0000 0101 0003 0000 0001 00c8 0000 0102 0003 0000 0003 0000 eb16 0103 0003 0000 0001 0001 0000 0106 0003 0000 0001 0002 0000 0111 0004 0000 0001 0000 0008 0112 0003 0000 0001 0001 0000 0115 0003 0000 0001 0003 0000 0116 0003 0000 0001 00c8 0000 0117 0004 0000 0001 0000 ea60 0118 0003 0000 0003 0000 eb1c 0119 0003 0000 0003 0000 eb22 011c 0003 0000 0001 0001 0000 0153 0003 0000 0003 0000 eb28 0000 0000 0008 0008 0008 0000 0000 0000 00ff 00ff 00ff 0001 0001 0001

from http://local.wasp.uwa.edu.au/~pbourke/dataformats/tiff/

ffffff = 255,255,255 ( r g b )

in hexadecimal notation

Page 11: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Resolution

Setting standards

1 inch 1 inch

1 inch 1 inch

@300dpi

= 90,000 pixels

@72dpi

= 5,184 pixels

‘Screen’ resolution Standard ‘Print’ resolution

Page 12: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

How many dpi is enough?

Original 75 x 53 mm (VPRS 8609/P30 unit 3, item 6/108)

600 dpi

Page 13: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

4800 dpi. Could back off a little…

Page 14: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Colour management

https://en.wikipedia.org/wiki/RGB_color_space

sRGB (monitor standard)CIE Chart with sRGB gamut by spigget - own work.

Licensed under CC BY-SA 3.0

Adobe RGB (1998)

Page 15: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Compression

Setting standards

http://en.wikipedia.org/wiki/JPEG

83,261 bytes 1,523 bytes

Page 16: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Image generations

Cute kitten (modified) courtesy: https://www.flickr.com/photos/foundanimalsfoundation/8469463762/

Original scene Photographic Negative Print

Digital ‘masters’

Modified copy

Page 17: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Histograms

Page 18: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Cropped image

Page 19: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

‘Levels’ tool

Page 20: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

‘Curves’ tool

Page 21: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Checking the levels

Page 22: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

The most colours vs. the right colours

-- ‘positive’ images: scan for colour fidelity

-- negatives: scan for maximum tonal range(consider 16/48-bit)

It’s better to make a good scan than correct a ‘poor’ one. ‘Levels’ adjustment when scanning is different to adjusting an existing digital image.

Fidelity

Page 23: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

PROV camera setup

Page 24: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Still image standards

Setting standards

National Library of Australia

http://www.nla.gov.au/standards/image-capture

Public Record Office Victoriahttp://prov.vic.gov.au/government/standards-and-policy/capture

Page 25: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Photographing objects

Setting standards

Museums Australia (Victoria)

http://www.mavic.asn.au/resources/practical-training

Page 26: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Sound and moving pictures (advice)

Setting standards

National Film and Sound Archiveshttp://www.nfsa.gov.au/preservation/care/caring-for-film/http://www.nfsa.gov.au/preservation/care/caring-for-audio/http://www.nfsa.gov.au/preservation/services/

Page 27: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Resources needed

Page 28: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
Page 29: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
Page 30: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Physical resources

Resources needed

• Space– managed, secure (fire, flood, pests, ancient wiring, not about to be reclaimed by Council for

boutique carparks, etc.)

– flat (shelving, tables)

• Supplies– rehousing materials for copied originals– acid free paper, plastic film, gloves, pencils, spirit level, measuring tape/ rulers, gaffer

tape, extension cables, USB sticks, random things that aren’t too grubby

Page 31: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Physical resources

Resources needed

• Toys!– Scanning equipment

– Colour calibration equipment

– Workstations

– Storage

Page 32: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Human resources

Resources needed

• Project manager

• Project committee (for when the manager heads off to Noosa)

• Tame experts

• Volunteers– what do you need from them?

– what do they get out of it?

Page 33: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Documentation

Resources needed

• Digitisation policy/ strategy/ plan

• Project statement/ plan

• Risk management framework

• Specific policies/ procedures

• Written agreements with donors and digitisers

• Passwords. Write them down.

Page 34: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Permission

Resources needed

• From your group

• From your stakeholders

• From your donors/ owners of the material

• From your funders

Page 35: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Care of your originals

Page 36: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Collection management

Care of your originals

• Are the items catalogued? (Does the catalogue make sense?)

• Are they securely stored?• Do you know who owns what?

… things go missing.

Page 37: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Preservation management

Care of your originals

• put it in a box (controls light, humidity, physical safety)

• wrapped in plastic (anything except PVC)

• write on the enclosure, not the object• only take it out when you have to

… things get old.

Page 38: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Relationship management

Care of your originals

• have some handling rules (gloves, induction)• digitisation providers should be able to describe their

security and preservation measures• don’t break the original to digitise it

… things get dropped.

Page 39: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Further further reading

Care of your originals

National Standards for Museums and Galleries• http://www.collectionsaustralia.net/sector_info_item/107

Keeping Archives

Page 40: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Care of your copies

Page 41: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Hardware failure

Care of your copies

• backups, offsite preferably• understand the limits of the storage technology • checksums – or just look at your images every so often

Page 42: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Hardware obsolescence

Care of your copies

• migration, vary your storage options• active management of collection

Page 43: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Software obsolescence

Care of your copies

• open formats(image formats have been stable for decades)

• open applications (separate the data from the program)

• plan for and budget migrations

Page 44: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Poor management and documentation

Care of your copies

• Bill is your IT guy. He has just fallen under a bus.

• Try not to implement systems you don’t understand.

Page 45: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Hardware failure

Digital Preservation

• backups, offsite preferably

• understand the limits of the storage technology

• checksums – or just look at your images every so often

Page 46: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Summary

Metadata

• All of your data needs to be easily extractable from the software it’s in.

• Create a simple file structure and make sure people stick to it.

• Manage your backups properly (no lending to people, manage your risks, NO shortcuts).

Page 47: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Getting images online

Page 48: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Originals and renditions

Getting images online

• xyz

Page 49: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Originals and renditions

Getting images online

Constraints on delivering raw images• Connection speed and bandwidth• Screen size and resolution• Control over rights to the image

Page 50: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Options for publishing

Getting images online

• Don’t publish at all…• Use existing commercial tools and services (Flickr, Facebook, eHive)• Use existing community services (Victorian Collections)• Your own site (Wordpress, Omeka)

Page 51: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

What and why to put online

Getting images online

• Marketing your organisation (‘going viral’)• Online archive (TROVE)• Storytelling• Online communities

Page 52: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Metadata and sharing images

Page 53: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Examples

Flickr: https://flickr.com

Trove: http://trove.nla.gov.au

Museum Applied Arts and Sciences (Sydney): https://collection.maas.museum/object/96257

Page 54: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Definition

Metadata

• Data about data (and data systems)• Metadata shares and translates between – collections and users– collections and collections.

Page 55: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Possible stages in metadata usage• in-house catalogue of collection • standardisation of data captured

card catalogue --> small museums cataloguing standard• computerisation of data

card catalogue --> InMagic database• web publishing of data

InMagic database --> CIDOC RDF in JSON via RESTful interface (http://data.culturehack.org.uk/dataset/37251018-British-Museum-object-catalog)

Page 56: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Kinds of metadataDescriptive metadata•Dublin Core (http://dublincore.org/)•Victorian Collections (https://victoriancollections.net.au/api)

Structural metadata•METS (http://www.loc.gov/standards/mets/)

Preservation metadata•VERS (https://www.prov.vic.gov.au/recordkeeping-government/about-standards-framework-policies/vers-standard/vers-version-3)

Page 57: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Common metadata formats• CSV (comma separated values)• XML• JSON

RDF (resource description framework) is a set of standards and formats and vocabularies that aims to encompass everything talked about.

Page 58: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Metadata

• Think about how your existing data can be:

• Categorised into different functions (descriptive, discovery, preservation, etc.)• Standardised (eg. Dublin Core) enabling matches with other collections and websites.

Page 59: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Metadata

• Important things to record:– Identity (title, ‘control symbol’)– Classification (subject, function)– History (dates, purposes)– [Description]– Relationships

Page 60: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Identity

Metadata

• Some items may not have titles. What is the thing that distinguishes one item from the next in a collection?• Remember physical cues not same as digital. Perhaps the filename of your image is the title?• ‘Control Symbol’: Catalogue / collection / record-keeping number.

Page 61: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Classification

Metadata

• Information to manage items and help narrow down searches.• Library: ‘subject’ – what is it about?• Archive: ‘function’ – what does it do?• Internet: ‘tagging’ – where did I put it again?• Subject/ topic list for images: http://www.picturethesaurus.gov.au/

Page 62: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

History

Metadata

• Archive/ Museum: ‘provenance’ – where is it from? (which collection, which donor)• Management history: what has happened to it? ie., what date was it scanned?• Scanning is another layer to existing management history that might be recorded in your collection database.

Page 63: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Description

Metadata

• Extended stories about item (mum on a bike)• Description of physical original – dimensions, special features• Description of digital copy – dpi, file format

Page 64: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Relationships• This object is part of this collection• This book was owned by this person between these dates

The Resource Description Framework expresses all metadata in terms of relationships

subject -> predicate -> object

Page 65: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Modern web applications• Every URL is a reference to a resource

(book, photo, object, etc.)• Information about that resource is attached to that URL in human-readable (web page) or

machine-readable (Dublin Core, Resource Description Framework, etc.) form

Page 66: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Description

Metadata

• Extended stories about item (mum on a bike)• Description of physical original – dimensions, special features• Description of digital copy – dpi, file format

Page 67: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Description

Metadata

• Extended stories about item (mum on a bike)

• Description of physical original – dimensions, special features

• Description of digital copy – dpi, file format

Page 68: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Description

Metadata

• Extended stories about item (mum on a bike)

• Description of physical original – dimensions, special features

• Description of digital copy – dpi, file format

Page 69: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
Page 70: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
Page 71: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
Page 72: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Contributing to TroveAn example of how to share metadata.

•Harvesting tools built in to commonly used applications:http://help.nla.gov.au/trove/becoming-partner/for-smaller-collections

•Metadata embedded on pages (Sitemap method):http://help.nla.gov.au/trove/html-record

Page 73: Just Digitise It - Daniel Wilksch of the Public Records Office Victoria

Q & A