july 7, 2008isko montréal1 isko 2008, montréal 4w vocabulary mapping across diverse reference...
Post on 22-Dec-2015
214 views
TRANSCRIPT
July 7, 2008 ISKO Montréal 1
ISKO 2008, Montréal
4W Vocabulary Mapping Across Diverse Reference Genres
Michael Buckland and Ryan Shaw (& others)Electronic Cultural Atlas Initiative and School of Information, Univ. of California, Berkeley
Work supported by the Institute for Museum and Library Studies and by the National Endowment for the Humanities.
July 7, 2008 ISKO Montréal 2
Currently:
-- Distinct reference genres
-- Vocabulary mapping across similar vocabularies
-- Codex-like infrastructure
Need:
-- Interlinked reference genres
-- Vocabulary mapping across dissimilar vocabularies
-- Union index infrastructure
July 7, 2008 ISKO Montréal 3
Context determines understanding!
Five ideas about use of digital corpora. . . .
1. Understanding requires knowing the context.
July 7, 2008 ISKO Montréal 4
Five ideas about use of digital corpora. . . .
1. Understanding requires knowing the context.
2. Using Internet resources should be like using a library reference collection – and as easy and as reliable.
3. Design: Find the context of any museum object, document, or performance: What is related to it in what it is, where it came from, when it originated, and who is associated with it?
4.WHAT, WHERE, WHEN, and WHO (“4W”) as a structure.
5.Make better use of existing descriptive metadata.
July 7, 2008 ISKO Montréal 5
Any word, name, document, or event
Any resource:Audio, Images, Texts, Numeric data, Objects, Virtual reality, Webpages
Any catalog: Archives, Libraries, Museums, TV, Publishers
Connect it with its context – and other resources.
Facet Vocabulary Displays
WHAT Thesaurus Cross- e.g. LCSH references
WHERE Gazetteer Map
WHEN Period directory Timeline
WHO Biograph. dict. Personal e.g. Who’s Who relations
Context and relationships: Ireland and Irish Studies – Project diagram.
July 7, 2008 ISKO Montréal 6
WHAT Subject headings Cross-references in& between vocabularies
Kung fu movies SEE Martial Arts filmsFORMERLY Hand-to-hand fighting, oriental, in motion pictures
“Automobile” in four dialects: - PASS MOT VEH, SPARK IGN ENG (U.S. Import/Export statistics) - TL 205 (Library of Congress Classification) - 180/280 (US Patent classification) - 3711 (Standard Industrial Classification)
“HS 847120 Digital auto data proc mach contng in the same housing a CPU and input & output device.”(International Harmonized Commodity Classification System).
NEED TO MAP TO & BETWEEN UNFAMILIAR VOCABULARIES
= Computer!
July 7, 2008 ISKO Montréal 7
WHEN? What happened in IRELAND in 1690s?
Time Period Directory records in Google Earth. Zoom to Ireland and 1690s. Icon for siege of Limerick, 1690. Click link for library search. Catalog records list books and show context.
July 7, 2008 ISKO Montréal 8
WHO Biographical Dictionary Complex relationships
Life events metadata
WHAT: Actions prisoner
WHERE: Places Holstein
WHEN: Times
1261-1262
WHO: People Margaret Sambiria
But ideally we need external links to the best resources!
Current project: Context finding for biographical texts.
Example: Electronic search engine pioneer.
July 7, 2008 ISKO Montréal 9
Emanuel Goldberg, b. Moscow, 1881; son of Grigorii Goldberg; Univ. of Moscow, 1900-04; Ph.D w. Robert Luther, Leipzig Univ., 1906; Assistant, Adolf Miethe, TU Charlottenburg, 1906-07; Prof, Akad. f. graphische Künste, Leipzig, 1907-17; ICA, Zeiss Ikon, Dresden, 1917-1933; Kinamo cine camera, 1921; microdots, 1925; search engine, 1927; Contax 35 mm camera 1932; kidnapped by Nazi SA; refugee in Paris, 1933-37; Laboratory, Palestine, Israel, 1937; d. 1970.
WHO?
Click a name to search for an internet resource.
July 7, 2008 ISKO Montréal 10
Emanuel Goldberg, b. Moscow, 1881; son of Grigorii Goldberg; Univ. of Moscow, 1900-04; Ph.D w. Robert Luther, Leipzig Univ., 1906; Assistant, Adolf Miethe, TU Charlottenburg, 1906-07; Prof, Akad. f. graphische Künste, Leipzig, 1907-17; ICA, Zeiss Ikon, Dresden, 1917-1933; Kinamo cine camera, 1921; microdots, 1925; search engine, 1927; Contax 35 mm camera 1932; kidnapped by Nazi SA; refugee in Paris, 1933-37; Laboratory, Palestine, Israel, 1937; d. 1970.
WHERE?
Trace a life-path.
July 7, 2008 ISKO Montréal 11
Emanuel Goldberg, b. Moscow, 1881; son of Grigorii Goldberg; Univ. of Moscow, 1900-04; Ph.D w. Robert Luther, Leipzig Univ., 1906; Assistant, Adolf Miethe, TU Charlottenburg, 1906-07; Prof, Akad. f. graphische Künste, Leipzig, 1907-17; ICA, Zeiss Ikon, Dresden, 1917-1933; Kinamo cine camera, 1921; microdots, 1925; search engine, 1927; Contax 35 mm camera 1932; kidnapped by Nazi SA; refugee in Paris, 1933-37; Laboratory, Palestine, Israel, 1937; d. 1970.
WHAT?
July 7, 2008 ISKO Montréal 12
Initial sketch for “Context Finding / Building” interface.
Save search path
Save link & notes as “stand-off” markup.
Save link & notes as embedded mark-up.
Insert / block text
Define facet
Ranked lists of suggested resources for each facet chosen
Display of search result
July 7, 2008 ISKO Montréal 14
Hovering over a named entity highlights the areas where it appears in the text.
July 7, 2008 ISKO Montréal 15
Named entities are linked to specific resources or dynamic searches over relevant databases.
July 7, 2008 ISKO Montréal 16
Initially, named entities are linked to keyword searches at the appropriate name authorities and metadata services. Here we see a number of possible candidates for “Henry V”.
July 7, 2008 ISKO Montréal 17
Now that it has been disambiguated, the named entity links directly to the
appropriate record.
July 7, 2008 ISKO Montréal 19
Edmund Hogan’s Onomasticon Goedelicum : Locorum et Tribuum Hiberniae et Scotiae = An Index, with Identifications, to the Gaelic Names of Places and Tribes
If searchable online, one could, when reading an Irish studies text:
1. Search it (Context finder)
2. Markup text with links to it (Context builder);
3. Markup Hogan with reverse links to the Irish studies text (Context provider) – with rich consequences.
July 7, 2008 ISKO Montréal 23
Facet genres include other facets
Library subject headingsTopic – Geographic subdivision – Chronological subdivision
Place name gazetteerPlace name – Type – Spatial markers (Lat & long) – When
Time Period DirectoryPeriod name – Type – Time markers (Calendar) – Where
Biographical DictionaryPerson – Activity type – Time – Where – Who else
July 7, 2008 ISKO Montréal 24
Facet genres with facets realigned.
What Where When WhoWHAT (LCSH) X X X X
WHERE (Place Gazet.) X X X -
WHEN (Period dir.) X X X -
WHO (Biogr dict.) X X X X
From LCSH “Lighthouses” to NGA Gazetteer Geographic Description Code “Lthse” (Lighthouse). Gazetteer entries give locations of instances.
Vertical mappings extend semantic links vocabularies, e.g.Horizontal links provide additional context.
July 7, 2008 ISKO Montréal 25
Facet Vocabulary Displays Reference GenreWHAT Topics Cross-references Dictionary, EncyclopediaWHERE Places Maps Atlas, gazetteerWHEN Periods Timeline Almanac, ChronologyWHO Persons Personal relationships Biogr.dictionary, Whos Who
Reference Genre Vocabulary Displays FacetDictionary, encyclopedia Topics Cross-refs WHATAtlas, gazetteer Places Maps WHEREAlmanac, chronology Time Timelines WHENBiogr. Dict., Who’s Who Persons Personal relationships WHO
Paper-based reference collection: Codex determines structure and use.
Reversed in a digital environment: Metadata forms infrastructure.
And, better, build a union index, so you know where too look!