july 7, 2008isko montréal1 isko 2008, montréal 4w vocabulary mapping across diverse reference...

25
July 7, 2008 ISKO Montréal 1 ISKO 2008, Montréal 4W Vocabulary Mapping Across Diverse Reference Genres Michael Buckland and Ryan Shaw (& others) Electronic Cultural Atlas Initiative and School of Information, Univ. of California, Berkeley Work supported by the Institute for Museum and Library Studies and by the National Endowment for the Humanities.

Post on 22-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

July 7, 2008 ISKO Montréal 1

ISKO 2008, Montréal

4W Vocabulary Mapping Across Diverse Reference Genres

Michael Buckland and Ryan Shaw (& others)Electronic Cultural Atlas Initiative and School of Information, Univ. of California, Berkeley

Work supported by the Institute for Museum and Library Studies and by the National Endowment for the Humanities.

July 7, 2008 ISKO Montréal 2

Currently:

-- Distinct reference genres

-- Vocabulary mapping across similar vocabularies

-- Codex-like infrastructure

Need:

-- Interlinked reference genres

-- Vocabulary mapping across dissimilar vocabularies

-- Union index infrastructure

July 7, 2008 ISKO Montréal 3

Context determines understanding!

Five ideas about use of digital corpora. . . .

1. Understanding requires knowing the context.

July 7, 2008 ISKO Montréal 4

Five ideas about use of digital corpora. . . .

1. Understanding requires knowing the context.

2. Using Internet resources should be like using a library reference collection – and as easy and as reliable.

3. Design: Find the context of any museum object, document, or performance: What is related to it in what it is, where it came from, when it originated, and who is associated with it?

4.WHAT, WHERE, WHEN, and WHO (“4W”) as a structure.

5.Make better use of existing descriptive metadata.

July 7, 2008 ISKO Montréal 5

Any word, name, document, or event

Any resource:Audio, Images, Texts, Numeric data, Objects, Virtual reality, Webpages

Any catalog: Archives, Libraries, Museums, TV, Publishers

Connect it with its context – and other resources.

Facet Vocabulary Displays

WHAT Thesaurus Cross- e.g. LCSH references

WHERE Gazetteer Map

WHEN Period directory Timeline

WHO Biograph. dict. Personal e.g. Who’s Who relations

Context and relationships: Ireland and Irish Studies – Project diagram.

July 7, 2008 ISKO Montréal 6

WHAT Subject headings Cross-references in& between vocabularies

Kung fu movies SEE Martial Arts filmsFORMERLY Hand-to-hand fighting, oriental, in motion pictures

“Automobile” in four dialects: - PASS MOT VEH, SPARK IGN ENG (U.S. Import/Export statistics) - TL 205 (Library of Congress Classification) - 180/280 (US Patent classification) - 3711 (Standard Industrial Classification)

“HS 847120 Digital auto data proc mach contng in the same housing a CPU and input & output device.”(International Harmonized Commodity Classification System).

NEED TO MAP TO & BETWEEN UNFAMILIAR VOCABULARIES

= Computer!

July 7, 2008 ISKO Montréal 7

WHEN? What happened in IRELAND in 1690s?

Time Period Directory records in Google Earth. Zoom to Ireland and 1690s. Icon for siege of Limerick, 1690. Click link for library search. Catalog records list books and show context.

July 7, 2008 ISKO Montréal 8

WHO Biographical Dictionary Complex relationships

Life events metadata

WHAT: Actions prisoner

WHERE: Places Holstein

WHEN: Times

1261-1262

WHO: People Margaret Sambiria

But ideally we need external links to the best resources!

Current project: Context finding for biographical texts.

Example: Electronic search engine pioneer.

July 7, 2008 ISKO Montréal 9

Emanuel Goldberg, b. Moscow, 1881; son of Grigorii Goldberg; Univ. of Moscow, 1900-04; Ph.D w. Robert Luther, Leipzig Univ., 1906; Assistant, Adolf Miethe, TU Charlottenburg, 1906-07; Prof, Akad. f. graphische Künste, Leipzig, 1907-17; ICA, Zeiss Ikon, Dresden, 1917-1933; Kinamo cine camera, 1921; microdots, 1925; search engine, 1927; Contax 35 mm camera 1932; kidnapped by Nazi SA; refugee in Paris, 1933-37; Laboratory, Palestine, Israel, 1937; d. 1970.

WHO?

Click a name to search for an internet resource.

July 7, 2008 ISKO Montréal 10

Emanuel Goldberg, b. Moscow, 1881; son of Grigorii Goldberg; Univ. of Moscow, 1900-04; Ph.D w. Robert Luther, Leipzig Univ., 1906; Assistant, Adolf Miethe, TU Charlottenburg, 1906-07; Prof, Akad. f. graphische Künste, Leipzig, 1907-17; ICA, Zeiss Ikon, Dresden, 1917-1933; Kinamo cine camera, 1921; microdots, 1925; search engine, 1927; Contax 35 mm camera 1932; kidnapped by Nazi SA; refugee in Paris, 1933-37; Laboratory, Palestine, Israel, 1937; d. 1970.

WHERE?

Trace a life-path.

July 7, 2008 ISKO Montréal 11

Emanuel Goldberg, b. Moscow, 1881; son of Grigorii Goldberg; Univ. of Moscow, 1900-04; Ph.D w. Robert Luther, Leipzig Univ., 1906; Assistant, Adolf Miethe, TU Charlottenburg, 1906-07; Prof, Akad. f. graphische Künste, Leipzig, 1907-17; ICA, Zeiss Ikon, Dresden, 1917-1933; Kinamo cine camera, 1921; microdots, 1925; search engine, 1927; Contax 35 mm camera 1932; kidnapped by Nazi SA; refugee in Paris, 1933-37; Laboratory, Palestine, Israel, 1937; d. 1970.

WHAT?

July 7, 2008 ISKO Montréal 12

Initial sketch for “Context Finding / Building” interface.

Save search path

Save link & notes as “stand-off” markup.

Save link & notes as embedded mark-up.

Insert / block text

Define facet

Ranked lists of suggested resources for each facet chosen

Display of search result

July 7, 2008 ISKO Montréal 13

Scanned text Named Entities

July 7, 2008 ISKO Montréal 14

Hovering over a named entity highlights the areas where it appears in the text.

July 7, 2008 ISKO Montréal 15

Named entities are linked to specific resources or dynamic searches over relevant databases.

July 7, 2008 ISKO Montréal 16

Initially, named entities are linked to keyword searches at the appropriate name authorities and metadata services. Here we see a number of possible candidates for “Henry V”.

July 7, 2008 ISKO Montréal 17

Now that it has been disambiguated, the named entity links directly to the

appropriate record.

July 7, 2008 ISKO Montréal 18

Named entities not detected automatically can be added manually.

July 7, 2008 ISKO Montréal 19

Edmund Hogan’s Onomasticon Goedelicum : Locorum et Tribuum Hiberniae et Scotiae = An Index, with Identifications, to the Gaelic Names of Places and Tribes

If searchable online, one could, when reading an Irish studies text:

1. Search it (Context finder)

2. Markup text with links to it (Context builder);

3. Markup Hogan with reverse links to the Irish studies text (Context provider) – with rich consequences.

July 7, 2008 ISKO Montréal 20

July 7, 2008 ISKO Montréal 21

July 7, 2008 ISKO Montréal 22

July 7, 2008 ISKO Montréal 23

Facet genres include other facets

Library subject headingsTopic – Geographic subdivision – Chronological subdivision

Place name gazetteerPlace name – Type – Spatial markers (Lat & long) – When

Time Period DirectoryPeriod name – Type – Time markers (Calendar) – Where

Biographical DictionaryPerson – Activity type – Time – Where – Who else

July 7, 2008 ISKO Montréal 24

Facet genres with facets realigned.

What Where When WhoWHAT (LCSH) X X X X

WHERE (Place Gazet.) X X X -

WHEN (Period dir.) X X X -

WHO (Biogr dict.) X X X X

From LCSH “Lighthouses” to NGA Gazetteer Geographic Description Code “Lthse” (Lighthouse). Gazetteer entries give locations of instances.

Vertical mappings extend semantic links vocabularies, e.g.Horizontal links provide additional context.

July 7, 2008 ISKO Montréal 25

Facet Vocabulary Displays Reference GenreWHAT Topics Cross-references Dictionary, EncyclopediaWHERE Places Maps Atlas, gazetteerWHEN Periods Timeline Almanac, ChronologyWHO Persons Personal relationships Biogr.dictionary, Whos Who

Reference Genre Vocabulary Displays FacetDictionary, encyclopedia Topics Cross-refs WHATAtlas, gazetteer Places Maps WHEREAlmanac, chronology Time Timelines WHENBiogr. Dict., Who’s Who Persons Personal relationships WHO

Paper-based reference collection: Codex determines structure and use.

Reversed in a digital environment: Metadata forms infrastructure.

And, better, build a union index, so you know where too look!