principles of information systems

74
Principles of Information Systems Session 05 Recording and Remembering

Upload: pink

Post on 23-Feb-2016

33 views

Category:

Documents


0 download

DESCRIPTION

Principles of Information Systems. Session 05 Recording and Remembering. Recording and Remembering. Chapter 4. “ Life without memory is life with no connections to your past, present and future and no ties to the events and people around you ”. Susumu Tonegawa. Overview. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Principles of Information Systems

Principles of Information SystemsSession 05Recording and Remembering

Page 2: Principles of Information Systems

2

Recording and Remembering

Chapter 4

Page 3: Principles of Information Systems

3

“Life without memory is life with no connections to your past, present and future and no ties to the events

and people around you”

Susumu Tonegawa

Page 4: Principles of Information Systems

4

OverviewLearning objectives

1. Introduction

2. The act of recording

3. Remembering

4. Other aspects of memory

5. Summary

Page 5: Principles of Information Systems

5

Learning objectives• Explain why it is essential for ideas or knowledge to

persist through time for individuals and groups• Compare the issues involved in written versus non-

written records for a culture• Choose an appropriate form and medium to record

something in• Outline the principles of electronic records

management• List and describe several types of digital and non-

digital storage media

Page 6: Principles of Information Systems

6

Learning objectives• Describe some strategies to ensure digital data is not

lost over time• Describe several ways in which stored information can

become inaccurate over time• List and describe the problems and some strategies

for long term digital preservation• Explain how memory and ideas can persist through

time in a society or an organisation• Explain the role of translators, librarians and archivists

in ensuring that stored memory can be understood in the future

Page 7: Principles of Information Systems

What did you do today?

How will you remember this tomorrow?

Page 8: Principles of Information Systems

8

• I tell my friends • I keep a diary…• I have a blog …• I write to do lists in my PDA…• I wrote an essay…• I got a receipt from the ATM …• I take photos with my mobile phone…• I wrote a letter…• I backed up my computer …• …

Page 9: Principles of Information Systems

9

Introduction

• We have seen how observations, ideas and discovered data may be communicated and represented between humans

• To ensure this information can be used in the future, or outside its immediate context, it must be recorded in some way

• So what do you record, and how do you store it?

• And how do you make sure it will still be there when you come back for it?

1. Introduction2. The act of recording3. Remembering4. Other aspects of memory5. Summary

Page 10: Principles of Information Systems

10

Introduction

• Making a record also reduces the burden on human memory – you don’t have to know everything, just where to find it

-What you store and how you store it determines how readily you can find it again

Page 11: Principles of Information Systems

11

Digital information• The vast majority of information now produced is in

digital format, either:-Born digital (such as emails, text messages, digital photos)

-Digitised (such as scanned paper documents, digitised versions of old movies)

• Lots of this is ephemeral – needed only for an immediate purpose

• But what should be kept, what has ongoing value:-To an individual?

-To society as a whole?

Page 12: Principles of Information Systems

12

The act of recording

• One of the most basic technologies for recording is writing

-Any visible form of language, including pictograms and hieroglyphics

• Writing makes information potentially available to all members of a culture and across time

1. Introduction2. The act of recording3. Remembering4. Other aspects of memory5. Summary

Page 13: Principles of Information Systems

13

When information is communicated verbally, there must be a continuous, living link – often across generations

view

Page 14: Principles of Information Systems

Recording information enables it to be removed from its original context and retrieved later, in a different one

view

Page 15: Principles of Information Systems

15

Written and non-written languages

• Written language “can be collected, stored, examined, manipulated and analysed in ways (traditionally) impossible for spoken language”

• In earlier societies trained scribes preserved social remembering and culture using languages available to them

Chafe & Tannen, 1987

Page 16: Principles of Information Systems

16

Written and non-written languages• Knowledge that is explicitly not intended for collection

and manipulation may not be written down

-Sacred or secret material or practices may be held only by elders or privileged members of the society

• Some languages may not even have a written form

-A purely spoken language, however, is only one generation away from extinction, and depends on a living link

Page 17: Principles of Information Systems

17

Should everything be recorded?• Researchers may try to preserve a spoken language

and the ideas and cultural heritage embedded in its stories it from extinction

• There are risks of writing down stories that are traditionally oral:

-No control over who may read them-Misinterpretation of meaning-Exploitation of the original culture by outsiders

• What do you think – is it worth it

Page 18: Principles of Information Systems

18

Recording information

• A recording is never as good as the original-A film of a concert compared with being there-A translation of a poem-The record of your day in your diary compared with living it

• This means that choosing what to record, and how to record it, is an important decision

Page 19: Principles of Information Systems

19

Recording information • Choosing a representation for the record to be stored

-Who will use the record, does it have enough information for their purpose, is it in a form they can understand?

• What storage medium?-Paper or digital document; film or digital image; analogue music or digital audio…

• Which digital format?

Page 20: Principles of Information Systems

20

Which digital format to choose?• Whether it has the features you need for the particular

purpose-Quality - ‘true colour’ image, or CD-quality audio-Amount of storage required by the format

• Portability-How widely supported the format is by application software

• Whether it is likely to remain available in the future- ‘Official’ standards are guaranteed to persist, though may not be the best quality for the immediate purpose

Page 21: Principles of Information Systems

21

Formats and standards• Some formats are developed by official standards

bodies, such as ISO or W3C -JPEG images, PNG images, MPEG video and audio

• Other formats are proprietary, or owned by a software company

-GIF images, TIFF images in scanners

• Some proprietary formats are so widely used and supported that they are known as‘de facto’ standards

-PDF documents, MOV, WMA

Page 22: Principles of Information Systems

22

Formats• Sometimes the simplest format is the most likely to

persist

• Project Gutenberg, which aims to record all the world’s great books, is storing them as plain ASCII text

-This is almost guaranteed to be readable in the future, though does not have many features of more sophisticated formats (such as fancy fonts or page layout)

Page 23: Principles of Information Systems

23

Recorded information is never as good quality as the original

experience, so it is important to choose the right medium and

format for a digital record

Recap

Page 24: Principles of Information Systems

24

Managing electronic documents and records

• Records management (and information management generally) is likely to be a key skill for informatics professionals

-Huge amounts of digital information is being produced as a by-product of normal commercial transactions

-Many government, business or medical organisations require information to be kept for legal purposes

Page 25: Principles of Information Systems

25

Managing electronic documents and records involves…• Storing information • Tracking and locate information, finding gaps in a

collection, or missing items• Answering queries about information• Keeping valuable assets secure, accessible only to the

authorised• Deciding what is worth keeping

- trade off between information collection and storage costs, and its value in a future context

Page 26: Principles of Information Systems

26

Managing electronic documents and records• Storage• Indexing• Access• Preservation• Security• Governance• Social continuation of knowledge

Page 27: Principles of Information Systems

27

Remembering

• Things are recorded so that they can be remembered- By an individual

- By an organisation

- By a government

- By a society

• The record substitutes for an individual’s memory or personal experience

1. Introduction2. The act of recording3. Remembering4. Other aspects of memory5. Summary

Page 28: Principles of Information Systems

28

Storage

• Storage is a way of ensuring something persists-Any record will last as long as the medium it is stored on (digital or non-digital) lasts

• A storage strategy involves:-Choice of storage medium-Storage strategy – what to back up, and how frequently

-Replication – multiple copies are usually made

Page 29: Principles of Information Systems

29

Backing up a movie collection over several disks

3 movies, 4 disks

Page 30: Principles of Information Systems

Backing up a movie collection over several disks

If this disk is lost we can still

recover Movie 1 from the other

disks

Page 31: Principles of Information Systems

Backing up a movie collection over several disks

If this disk is lost we can still

recover Movie 2 from the other

disk

Page 32: Principles of Information Systems

Backing up a movie collection over several disks

If this disk is lost we can still

recover Movie 2 from the other

disk

Page 33: Principles of Information Systems

Backing up a movie collection over several disks

If this disk is lost we can still

recover Movie 3 from the other

disks

Page 34: Principles of Information Systems

34

“Lots of Copies Keep Stuff Safe”

• LOCKSS is a strategy for making sure a library can continue to maintain its digital collection of article through time

• Individual libraries in the LOCKSS collective don’t have to keep backups of all their collections, because the entire collection is constantly circulating among all members

• Peer-to-peer checking between documents makes sure there is always a correct copy in circulation

Page 35: Principles of Information Systems

35

Storage media• Magnetic media

-Hard disks, floppy disks, external drives

• Optical media-CD-ROM (CD-RW, CD-R)

• Flash memory-USB sticks, memory sticks

• Storage Area Networks-Link different types of storage together over a high speed network

• Online storage-Photos, videos etc stored on remote website

Page 36: Principles of Information Systems

36

“Archives are the documents accumulated by a natural process in the course of the conduct of affairs of any kind, public or private, at any date, and preserved thereafter for reference, in their own custody, by the persons responsible of the affairs in question of their successors” Sir Hilary Jenkinson

written documents

mapsnewspapers

photos

sound recordings

personal collections

filmsdances

buildings

carvingsoral records

artistic works

cave paintings

CDs

Page 37: Principles of Information Systems

37

Archiving

• Archives provide a record of all types of documents relevant to cultural memory

-The counterpart to this is memoricide, where museums and cultural archives are deliberately destroyed under ‘ethnic cleansing’ regimes

• Archives are physically housed in dedicated institutions-This involves expertise in security, climate control, space usage, etc

Page 38: Principles of Information Systems

38

Digital asset management

• Increasingly, the documents that are stored as records or archives are digital (either born digital, or digitised)

• Similar issues apply to categorising, labelling, searching, etc

• But long term preservation of digital artefacts brings its own issues and challenges

Page 39: Principles of Information Systems

39

Records management is a key skill for informatics professionals.

It involves issues of storage, indexing, access, preservation, security,

governance, and the social continuation of knowledge.

Recap

Page 40: Principles of Information Systems

What software do you currently have on your computer?

What are your backups stored on?

Page 41: Principles of Information Systems

What would your answer have been this time last year?

What would it have been 5 years ago?

Page 42: Principles of Information Systems

42

“Digital documents last forever or five years, whichever is sooner”

Jeff Rothenberg

Page 43: Principles of Information Systems

43

Preservation and digital preservation

• Preservation is an aspect of archiving that ensures what is stored survives in good condition

• Digital preservation is doing this for digital or digitised artefacts

Digital preservation: A process by which digital data is preserved in digital form in order to ensure the usability, durability and intellectual

integrity of the information contained therein

Page 44: Principles of Information Systems

44

Digital preservation• The definition emphasises the preservation of the

information contained in the digital artefact, rather than the artefact itself

• This means that the strategies for preserving digital objects are different from those traditionally used to preserve physical objects

• A book may decay, but the words in it are preserved in digital form

Page 45: Principles of Information Systems

45

Digital longevity

Howard Besser identified 5 classic problems:

• The viewing problem- We cannot see what is on a disk or file without the relevant software

• The scrambling problem- Many files are encrypted or compressed, and the schemes for de-crypting may become lost

Page 46: Principles of Information Systems

46

Digital longevity

• The translation problem- Changing the form of a digital artefact may change its meaning

• The inter-relation problem- Many digital artefacts such as websites have many component parts – where is the ‘boundary’ of the object?

• The custodial problem- Who is responsible for preserving digital information?

Page 47: Principles of Information Systems

47

Digital longevity• The problem of authenticity is also significant –

especially for ‘born digital’ artefacts

- How do we know that a digital artefact has not been modified (intentionally or otherwise) since it was created?

- This is a growing problem in legal informatics, where digital documents may be required as evidence

Page 48: Principles of Information Systems

48

Strategies for digital preservation

?• Replication

- Simply copying the artefact on to different media, or more up to date media

- Avoids the problem of media rot but not format obsolescence

• Migration- Moving to a different format and/or updating the media on which the artefact is stored

- Standard archival formats are used to counter obsolescence

Page 49: Principles of Information Systems

49

Strategies for digital preservation

• Emulation- Preserving the original ‘look and feel’ of the artefact through emulating the hardware and software platform of the original in software

-Old arcade games

• Encapsulation- Storing the artefact together with metadata and other information required to run it

Page 50: Principles of Information Systems

50

Version control

• As soon as a digital document is revised and saved elsewhere you have two versions – so need some form of version control

• This involves - A consistent naming convention to identify different copies- Controlling who has access to the various copies, and how they can modify them

Page 51: Principles of Information Systems

51

Preserving the Internet• There is a huge amount of digital content on the Internet

which is added to constantly and which disappears again almost as fast

• But nobody owns the Internet, so nobody is directly responsible for preserving its resources

• Some efforts exist though:- Internet Archive

-International Internet Preservation Consortium

-National libraries, such as PANDORA archive at National Library of Australia

Page 52: Principles of Information Systems

52

Internet Archive www.archive.org

• Captures and stores old web pages-Wayback Machine allows viewing of old pages

• Collaborates with institutions such as Library of Congress and Smithsonian to preserve cultural material

• Also houses many digital collections of audio, text and moving images, such as Live Music Archive

• The Archive’s aims are to make its collections available to everyone

Page 53: Principles of Information Systems

53

Preserving your own stuff

• How do you keep your own digital stuff?- CD, Disk drives …

• Many websites retain copies of your photos, videos or writings indefinitely

- Flickr

- Youtube

- Internet Archive

Page 54: Principles of Information Systems

54

Data archiving standards and metadata• To be able to access stored data, it must be stored in a

way that enables it to be retrieved

• Metadata is information describing a resource (such as photo or text) which helps indicate the categories it can fit into

• Using a standard scheme can help ensure resources can be catalogued and searched as part of a wider resource

Page 55: Principles of Information Systems

55

Some cataloguing schemes

• Dublin Core Element Set-A core set of metadata elements used mostly for describing web pages

• LCSH – Library of Congress Subject Headings-A set of keywords used to describe subject

• AACR, AACR2 – Anglo-American Cataloguing Rules-Cataloguing rules and keywords for libraries

• RDA – Resource Description and Access-Revised AACR2 for digital content

Page 56: Principles of Information Systems

56

Finding stuff on the Web

• Web resources are identified by address or URL - But problem is that URLs can disappear or their contents change

• Addressing schemes for persistent identifiers:- PURL (Persistent URL)

- DOI (Digital Object Identifier)

- ARK (Archival Resources Key)

Page 57: Principles of Information Systems

57

Semantic Web

• The idea of the Semantic Web is that instead of humans needing to process the ‘meaning’ of web pages, the computer will be able to

• All documents on the web would be marked up according to standards for data description and association

- Resource Description Framework- Dublin Core Metadata Element Set- URL- XML

Page 58: Principles of Information Systems

58

Long term preservation of digital records is a significant issue.

There are various strategies for digital preservation, and international collaboration

and standards will be important.

Recap

Page 59: Principles of Information Systems

59

Persistence

• The preservation of records and digital artefacts is at one remove from our experience

- If the context of author and reader are aligned, the record may be understood as intended

• But nothing can be exactly the same

• What changes, and what persists?

1. Introduction2. The act of recording3. Remembering4. Other aspects of memory5. Summary

Page 60: Principles of Information Systems

60

Persistence

• Ensuring that something persists is a common concern for everyday life

- Objects or entities

- Ability to do things

- Social concepts

• What allows us to do this is memory• Matching memory against the present is what is

interpreted in your mind as continuity and familiarity

Page 61: Principles of Information Systems

61

Types of human memory - declarative

• Semantic memory – for facts- ‘Paris is the capital of France’

• Episodic memory – for events you have personally experienced

- ‘Last year I went on holiday to Paris’

• Semantic and episodic memory together contribute to declarative knowledge– Enables you to make statements about things

that you know and which have happened

Page 62: Principles of Information Systems

62

Types of human memory – procedural • Procedural memory – for how to do things

- How to pack a suitcase

- How to brush your teeth

- How to ride a bicycle

Page 63: Principles of Information Systems

63

• Describe how you fold your arms

• Is your left arm over your right, or your right arm over your left?

• You remember the procedural knowledge (you know how to do it) but can’t necessarily describe how to do it (declarative knowledge)

step

Page 64: Principles of Information Systems

64

Types of memory in informatics

• Informatics uses concepts of both declarative and procedural knowledge in designing artificial memories

- Reasoning with facts and rules

Jim bought nappies and beer

Jack bought nappies and beer

all men who buy nappies also buy beer

- Stored procedures for calculation

=SUM (A5:A10)

Page 65: Principles of Information Systems

65

Social memory

• Social memories transcend individuals and are built into way a society identifies and conducts itself

- National holidays

- Traditional stories, songs and art

- Telling jokes about the neighbouring region

• These traditions can persist even if individuals die or written records are lost

Page 66: Principles of Information Systems

66

Organisational memory

• Records of data and general procedures are maintained for an organisation

• There is also often a legal requirement to store records for some period of time

Page 67: Principles of Information Systems

67

Decay and obsolescence

• Physical media may degrade-Data stored on them may be lost

• Later versions of applications supersede earlier

• Information may become obsolete-Although the information itself is still there, meaning can be lost-A living system of meaning is required if knowledge is to last

• Remembering and doing may be essential to preserve meaning – not just the written record

Page 68: Principles of Information Systems

68

Social memory and culture• Entropy is the tendency of everything towards a state of

disorder, negative entropy or order acts against this• Human knowledge generally consists in stabilising some

order of things that are agreed as meaningful over some length of time

• An order that outlasts the lifetime of an individual, that enters social memory and tradition, is the stuff of cultural heritage and the social identity of a people

-Preserving such order is the function of institutions such as libraries, museums and galleries

-Theatre, colleges, art, religions explore and reinterpret themes of social value

Page 69: Principles of Information Systems

69

Which ideas will endure?

• At any time in a culture, there are themes that are ephemeral and others that turn out to be enduring

• Words that change their meaning over time are especially treacherous in preserving understanding

-Gay, wicked, bad …

• As language changes, information expressed in language is unreliable if you don’t take into account the times in which it was written

Page 70: Principles of Information Systems

70

Which ideas will endure?

• The work of translators, editors, librarians and archivists is crucial in ensuring ideas and concepts persist over time

- Must be able to distinguish concepts from the words that convey them

Page 71: Principles of Information Systems

71

Persistence of memory is what gives us our concept of reality.

The persistence of ideas and knowledge through time is essential for the

continuity of social and cultural identity.

Recap

Page 72: Principles of Information Systems

72

Summary• Written records allow data to be removed from its

original context and used in others• Records management is a significant issue for

informatics• Recorded information is never as good quality as the

original experience, so the choice of medium and format is important

• Formats and applications can become obsolete, and storage media can decay

• Active strategies are needed to preserve digital information in the long term

Page 73: Principles of Information Systems

73

Summary

• Information can become inaccurate through going out of date

• The persistence of ideas and knowledge through groups and society is essential for the continuity of social and cultural identity

• Language changes through time, so being able to distinguish concepts from the words that convey them is essential

Page 74: Principles of Information Systems

7474