use case: data edited as a book !!!

17
Use case: data edited as a book !!! CONNECTING COLLECTIONS 02-12-2014 Kepa J. Rodriguez (Gottingen State and University Library)

Upload: kepa-j-rodriguez

Post on 09-Jul-2015

249 views

Category:

Data & Analytics


0 download

DESCRIPTION

Presentation at the workshop "The Challenges of Publishing Finding Aids in a Digitally Joined-Up World" (The Hague, December 2014) about the extraction of structured data from edited books and its conversion in EAD.

TRANSCRIPT

Page 1: Use case: data edited as a book !!!

Use case: data edited as a book !!!

CONNECTING COLLECTIONS

02-12-2014

Kepa J. Rodriguez(Gottingen State and University Library)

Page 2: Use case: data edited as a book !!!

Outline

How do we import books into the portal?

Case 1: Jewish Archival Guide Belgium (ARA)

Case 2: Informator (IPN-Poland)

Some conclusions

Page 3: Use case: data edited as a book !!!

How do we import a book (1)

Page 4: Use case: data edited as a book !!!

How do we import a book (2)

Page 5: Use case: data edited as a book !!!

How do we import a book (3)

From our experience:

Important that the estructure of the book is good represented in the layout.

• A hierarchical table of contents helps to extract automatically the structure (later more in an example)

Layout and presentation should be consistent.

• Consistence in use of fonts, no spaces at the end of the lines, etc.

• Fonts and color can be useful if the document is converted/convertible into RTF.

– But... better... don't use colors in spreadsheets. Visual arts are beautiful but no useful.

Page 6: Use case: data edited as a book !!!

Case 1: Jewish Archival Guide Belgium (1)

Page 7: Use case: data edited as a book !!!

Case 1: Jewish Archival Guide Belgium (2)

Structure of the table of contents corresponds to the hierarchies of record groups.

That help us to infer the hierarchies in the EAD.

Page 8: Use case: data edited as a book !!!

Case 1: Jewish Archival Guide Belgium (3)

Page 9: Use case: data edited as a book !!!

Case 1: Jewish Archival Guide Belgium (4)

Descriptions of collections and fonds are compliant with ISAD(G) and other ICA standards.

Conversion in EAD tags using crosswalks.

The book provides the identifiers of the fonds in the hosting institutions.

Page 10: Use case: data edited as a book !!!

Case 1: Jewish Archival Guide Belgium (5)

Page 11: Use case: data edited as a book !!!

Case 1: Jewish Archival Guide Belgium (5)

Very good communication with the authors during the edition process.

Trilingual tagset (EN, NL, FR)

Use of the identifiers to find the original repositories.

Help in the selection of data using the subject keywords.

Mapping of the used keywords with terms of the EHRI thesaurus.

Page 12: Use case: data edited as a book !!!

Case 2: IPN – Informator (1)

Page 13: Use case: data edited as a book !!!

Case 2: IPN – Informator (2)

Book was written only for humans.

Part of the structure extracted by hand

Difficult to map the layout and structural information to standards.

Identifiers of the fonds in the IPN database are not provided.

At the end.... it took a lot of time and effort to produce something meaninful.

Page 14: Use case: data edited as a book !!!

Case 2: IPN – Informator (3)

Page 15: Use case: data edited as a book !!!

Some conclusions

Books and edited material are not the ideal way to share data.

Anyway they can be useful in this case if:

– Archival standards are used

– Use of standards is transparent

– Identifiers are provided

– Structure of the document reproduces the hierarchical organisation of the data in the archives.

– Layout of the doucument gives information about the different pieces of information

Page 16: Use case: data edited as a book !!!

NIOD Institute for War, Holocaust and Genocide Studies (NL)

 CEGES-SOMA Centre for Historical Research

and Documentation on War and Contemporary Society (BE)

 Jewish Museum in Prague (CZ)

 Institute of Contemporary History Munich – Berlin

(DE) 

YAD VASHEM The Holocaust Martyrs’ and Heroes’ Remembrance Authority (IL)

 The Wiener Library – Institute of Contemporary

History (UK) 

Holocaust Memorial Center (HU) 

HL-senteret Center for Studies of Holocaust and Religious Minorities (NO)

 NAF National Archives of Finland (FI)

 

The Emanuel Ringelblum Jewish Historical Institute (PL)

King’s College London (UK) Georg-August-Universität Göttingen – Göttingen State and University Library (DE) Athena RC/IMIS (GR) DANS Data Archiving and Networked Services (NL) Shoah Memorial, Museum, Center for Contemporary Jewish Documentation (FR) ITS International Tracing Service (DE) Memorial to the Murdered Jews of Europe (DE) Terezín Memorial (CZ) Beit Theresienstadt (IL) VWI Vienna Wiesenthal Institute for Holocaust Studies (AT)

CONNECTING KNOWLEDGE

Page 17: Use case: data edited as a book !!!

CONNECTING COLLECTIONS

What if you don't have ways to share the data?

02-12-2014