information organization and taxonomy building an introduction david rashty, isaac waisberg

24
Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Upload: reginald-hensley

Post on 12-Jan-2016

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Information organization and taxonomy building

An introduction

David Rashty, Isaac Waisberg

Page 2: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Clutter and confusion are failures of design, not attributes of information

Edward Tufte

Page 3: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Contents

• Introduction (5 slides)• Definition (6 slides)• Building a taxonomy (8 slides)• Conclusion (1 slide)

Page 4: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

The problem

• The world's total yearly production of print, film, optical, and magnetic content would require roughly 1.5 billion gigabytes of storage.

• This is the equivalent of 250 megabytes per person for each man, woman, and child on earth.

Introduction > The problem

1,500,000,000,000,000,000 bytes

Page 5: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

The solution

• Organizing information in a logical manner so that it can be reliably accessed by anyone in the organization.

• In other words,

building a taxonomy

Introduction > The solution

Page 6: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Organizing information

Introduction > Organizing information 1

Page 7: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Organizing information

Introduction > Organizing information 2

IIIIIIIVXYXYXYXY

10.08.0410.09.1410.07.4610.06.588.06.958.08.148.06.778.05.7613.07.5813.08.7413.012.7

413.07.71

9.08.819.08.779.07.119.08.8411.08.3311.09.2611.07.8111.08.4714.09.9614.08.1014.08.8414.07.046.07.246.06.136.06.086.05.254.04.264.03.104.05.394.012.5

012.010.8

412.09.1312.08.1512.05.56

7.04.827.07.267.06.427.07.915.05.685.04.745.05.735.06.89

N = 11

Mean of X’s = 9.0

Mean of Y’s = 7.5

Regression line: Y=3+0.5X

Correlation = 0.82

Page 8: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Organizing information

Introduction > Organizing information 3

Page 9: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Definition

• A taxonomy is a structure for providing guidance over bits of information that relate to a specific context.

• It shows the user the groupings and relationships that can emerge from information in many patterns.

• Users can either zoom down to fine grained levels to pin down exact items, or they can scan and highlight different parts of the system.

• This process enables the potential creation of knowledge through open associations of relevant material.

Definition > Slide 1

DirectoriesThesaurus

Taxonomies

Page 10: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Definition > Slide 2

…providing guidance

over bits of information

Page 11: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Atlas of cyberspace

Definition > Slide 3

…zoom

down to fine

grained levels to

pin down exact items

Page 12: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Yahoo

Definition > Example 1

Page 13: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Dmoz

Definition > Example 2

Page 14: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Lycos

Definition > Example 3

Page 15: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Building a taxonomy

• Gather terms from as many sources as possible

• Define the preferred terms• Link synonyms and related terms• Group preferred terms by subject• Identify broader and narrower terms • Perform associative linking• Testing

Building a taxonomy

Page 16: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Gather terms

• Users, subject experts, the content itself, existing taxonomies.

• Entry terms should include synonyms and abbreviations, acronyms, and alternate spellings for all the important concepts in your document collection.

• Goto’s suggestion tool and similar tools can help finding relevant terms.

Building a taxonomy > stage 1

Page 17: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Define preferred terms

• Create guidelines for selecting preferred terms.

• In a collection of health-related documents that include terms such as cancer, oncology, skin, and dermatology, you'll need to decide whether to select medical terminology or regular English as the preferred terms, based on the type of language most appropriate to your primary audience.

Building a taxonomy > stage 2

Page 18: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Link synonyms

• This is where you map the synonyms, abbreviations, acronyms, and alternate spellings as "variant terms" to the preferred terms.

• Within reason, the more entry terms you have, the easier it will be for indexers and users to find the preferred terms.

Building a taxonomy > stage 3

Page 19: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Group preferred terms• This forms the foundation of your hierarchy.• Definition of the subject hierarchy should be

informed by a balance of top-down considerations (e.g., mission, vision, intended audiences) and bottom-up content analysis.

Building a taxonomy > stage 4

Page 20: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Broader/Narrower Terms

• You're defining where each term fits within the hierarchy.

• Existing taxonomies that cover your subject area or industry can prove extremely useful in generating ideas for broader and narrower terms.

Building a taxonomy > stage 5

Page 21: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Associative linking

• The definition of related terms is highly subjective.

• For each term ask the question: "Where will users want to go from here?"

• Choose only the most obvious and important relationships.

Building a taxonomy > Stage 6

Page 22: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Testing

• Taxonomies are built in order to help users find the information they need.

• The metrics for success should be users satisfaction while fulfilling information tasks using the taxonomy you have built.

Building a taxonomy > Stage 7

Page 23: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

Conclusion

• Organizational structures within websites encourage emotional, fortuitous information seeking behavior.

• Directories are value laden tools of information organization that articulate a specific world view.

• Because taxonomies lay out synonymous, associative and hierarchical relationships, they function like guides to information.

• They support the dynamic process of finding information and facilitate associative thought.

Conclusion

Page 24: Information organization and taxonomy building An introduction David Rashty, Isaac Waisberg

We thrive in information-thick worlds because of our marvelous and everyday capacities to

the wheat from the chaff, and separate the sheep from the goats.

Edward Tufte

select

editsingle out

structure

highlight groupharmonizepair

merge synthesiz

e

boil down focus

list

catalog discrimina

teorganize

isolate

blend

abstract

look into pigeonhol

econdensereduce

choose

categorize

idealize

scan

distinguishscreen

pick over sortintegrate

inspectfilter

lump

skip

smoothchunk average

approximate

cluster

aggregateoutline

summarize itemize reviewdip into flip through

skimrefineenumerate synopsize

glean

winnow