component...

16
AIT, 2013 C3.5.0 v1.0 p. 1 COMPONENT REPORT Project Acronym: OpenUp! Grant Agreement No: 270890 Project Title: Opening up the Natural History Heritage for Europeana C3.5.0 Integration with BHL Portal Revision: Version 1.0 Authors (in alphabetical order): Benda Odo AIT Forschungsgesellschaft mbH Höller Astrid AIT Forschungsgesellschaft mbH Koch Gerda AIT Forschungsgesellschaft mbH Koch Walter AIT Forschungsgesellschaft mbH Project co-funded by the European Commission within the ICT Policy Support Programme Dissemination Level P Public C Confidential, only for members of the consortium and the Commission Services x

Upload: others

Post on 17-Mar-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 1

COMPONENT REPORT

Project Acronym: OpenUp!

Grant Agreement No: 270890

Project Title: Opening up the Natural History Heritage for Europeana

C3.5.0 Integration with BHL Portal

Revision: Version 1.0

Authors (in alphabetical order):

Benda Odo AIT Forschungsgesellschaft mbH

Höller Astrid AIT Forschungsgesellschaft mbH

Koch Gerda AIT Forschungsgesellschaft mbH

Koch Walter AIT Forschungsgesellschaft mbH

Project co-funded by the European Commission within the ICT Policy Support Programme

Dissemination Level

P Public

C Confidential, only for members of the consortium and the Commission Services x

Page 2: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 2

Revision History

Revision Date Author Organisation Description

Draft 2013-03-22 O. Benda AIT Draft and implementation

Draft 2013-04-03 A. Höller AIT draft and concept of document

Draft 2013-04-04 A. Höller AIT Integration of BHL in Pentaho

Draft 2013-04-05 A. Höller AIT Examples

Draft 2013-04-22 G. Koch AIT Revision and Examples

Draft 2013-04-22 A.Höller AIT References, Screenshots

Draft 2013-04-22 W. Koch AIT Revision

Version 1.0 2013-04-22 G. Koch AIT Finalization

Distribution

Recipient Date Version Accepted YES/NO

TMG 2013-04-22 1.0

Project Coordinator 2013-04-22 1.0

TMG (AIT, BGBM, GBIF, IBSAS, MFN, MRAC, NHM, NHMW, RBGK, UH)

Project Coordinator (W.Berendsohn, BGBM)

Statement of Originality

This deliverable contains original unpublished work except where clearly indicated otherwise. Acknowledgement of previously published material and of the work of others has been made through appropriate citation, quotation or both.

Page 3: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 3

Table of Contents

1 DESCRIPTION OF WORK ................................................................................................................... 4

2 THE BHL WEB PORTAL ...................................................................................................................... 5

2.1 Access to bibliographic information in the BHL portal .............................................. 6

2.2 Contextualization with bibliographic information by “Bibliography by URL” .............. 10

3 INTEGRATING “BIBLIOGRAPHY BY URL” INTO PENTAHO.................................................................. 10

3.1 The Stylesheet ................................................................................................ 11

4 EXAMPLE ....................................................................................................................................... 13

5 LIST OF FIGURES ............................................................................................................................. 16

6 LIST OF REFERENCES ....................................................................................................................... 16

Page 4: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 4

1 DESCRIPTION OF WORK

The aim of this document is to describe the contextualization of OpenUp! metadata with BHL bibliographic information data. Figure 1 shows the overall workflow of ingesting records into Europeana with the step covered in this document marked red.

Figure 1 Ingesting records into Europeana (overall workflow)

For the purpose of this report the possibilities of integration of OpenUp! metadata with the BHL-Europe web portal were evaluated. Investigations showed that the BHL-Europe web portal offers no API for data linkage. But BHL-Europe is part of the international BHL (Biodiversity Heritage Library) consortium http://biodivlib.wikispaces.com/BHL+Consortium+Membership and therefore the API “Bibliography by URL” provided by the Biodiversity Heritage Library (http://www.biodiversitylibrary.org/) serves very well to link the OpenUp! data to biodiversity heritage literature. For a better understanding this document starts with an overview of the BHL portal and its search functions. Then the API1 of BHL called “Bibliography by URL” is introduced. This API is integrated in the Pentaho Transformation to create a new dc:relation element (see chapter 3).

1 Application Programming Interface, http://en.wikipedia.org/wiki/Application_programming_interface 22 April 2013

Page 5: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 5

2 THE BHL WEB PORTAL

BHL, the Biodiversity Heritage Library, features a sustained human-accessible Web portal - http://www.biodiversitylibrary.org – giving access to biodiversity literature held in the storage system (see Figure 2).

Figure 2 The BHL web portal

The Biodiversity Heritage Library has implemented a new user interface on March 18, 2013 (see Figure 3 with new features of the portal).

Page 6: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 6

Figure 3 The new BHL-Europe web portal2

2.1 Access to bibliographic information in the BHL portal

To search the BHL collections the General Search can be used by typing a term in the simple search box (see Figure 4 with the example “Parnassius apollo”).

2 https://docs.google.com/document/d/1XXFm4MsAEX-3iZq7UlZDGvxkZHvFztf0gcINEfDU27s/pub# 22 April 2013.

Page 7: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 7

Figure 4 General Search for “Parnassius apollo”

By clicking on the magnifier symbol or by pressing enter the results will be shown (see Figure 5).

Figure 5 Result of general search for “Parnassius apollo”

The simple search is looking for matching results in the indexed fields of five different result types. The result overview shows the result types with the numbers of found records in brackets:

In our example:

Books/Journals (0)

Articles/Chapters/Treatments (4)

Authors (0)

Subjects (0)

Scientific Names (87)

When clicking on the result type “Scientific Names” there are 87 Scientific Names found. Each name is listed with the number of its occurrence (see Figure 6).

Page 8: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 8

Figure 6 Scientific Names found for “Parnassius apollo”

By clicking on one of the result names (in this example “Parnassius apollo”) the complete bibliography of this term is shown with its Type, Title, Authors, Volume, Date and Page number (see Figure 7).

Page 9: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 9

Figure 7 Bibliography for “Parnassius apollo”

When clicking on the magnifier symbol at the right the bibliography result is shown in a new window (see Figure 8).

Figure 8 One of the bibliography results containing the scientific name “Parnassius apollo”

Page 10: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 10

2.2 Contextualization with bibliographic information by “Bibliography by URL”

BHL is building APIs (Application Programming Interfaces) to allow individual users and data providers to remix and reuse BHL content3 and makes its metadata available for public use under the CC0 1.0 Universal (CC0 1.0) Public Domain Dedication license4.

One of these APIs is “Bibliography by URL”. To easily link into a list of all pages containing a given scientific name, use the following URL:

http://www.biodiversitylibrary.org/name/Scientific_name

where Scientific_name is any uninomial, binomial, or trinomial. Replace spaces with the underscore ( _ ) character5.

In the previous example the link would be

http://www.biodiversitylibrary.org/name/Parnassius_apollo

When typing the link in the search bar the result shown in Figure 7 Bibliography for “Parnassius apollo” appears.

3 INTEGRATING “BIBLIOGRAPHY BY URL” INTO PENTAHO

The transformation of the OpenUp! metadata from the ABCD format to the ESE/EDM format is processed with the Pentaho Kettle PDI tool.

In the Transformation “ABCD206_to_ESE34” (see Figure 9) the step “ABCD to ESE XSL” (marked red) contains the stylesheet which is modified to integrate the BHL web portal data.

3 http://biodivlib.wikispaces.com/Developer+Tools+and+API 22 April 2013.

4 http://creativecommons.org/publicdomain/zero/1.0/ 22 April 2013.

5 http://biodivlib.wikispaces.com/Developer+Tools+and+API 22 April 2013.

Page 11: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 11

Figure 9 Pentaho Transformation ABCD206_to_ESE34

3.1 The Stylesheet

To integrate the BHL web portal to the Transformation in Pentaho, the XSL Stylesheet used in the Add Constants step “ABCD to ESE XSL” is extended (see Figure 10).

Page 12: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 12

Figure 10 The XSL Stylesheet used in the Pentaho Transformation

The following <xsl:template> element is added:

<xsl:template name="BHLLink">

<xsl:for-each select=".//abcd:ScientificName/abcd:FullScientificNameString">

<dc:relation>

<xsl:text>http://www.biodiversitylibrary.org/name/</xsl:text>

<xsl:call-template name="url-encode">

<xsl:with-param name="str"><xsl:value-of select="translate(., '.', '')"/></xsl:with-param>

</xsl:call-template>

</dc:relation>

</xsl:for-each>

</xsl:template>

So every time a “FullScientificNameString” appears in a record the link http://www.biodiversitylibrary.org/name/{FullScientificNameString} is created and inserted in the dc:relation element in order to link to the available bibliography in the Biodiversity Heritage Library.

Page 13: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 13

4 EXAMPLE

After transforming a datasource with Pentaho the result can be viewed at the OAI-Provider platform. Figure 11 shows one record of the butterfly “Parnassius apollo” including the BHL-link in the element “Relation”.

Figure 11 Record on the OAI platform with BHL-link

When clicking on the link the bibliography shown in Figure 12 (compare Figure 7) opens in a new window.

Page 14: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 14

Figure 12 Bibliographic information for “Parnassius apollo”

The contextualization of OpenUp! data with BHL content in the Europeana data display is demonstrated with the butterfly “Sericinus montela Gray, 1852” in Figure 13.

Page 15: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 15

Figure 13 Contextualization with BHL content

On the top left hand side of the figure the record is shown in the Europeana portal with the BHL link in the metadata field “Relation”. When clicking on this link the bibliographic results of BHL will display (compare Figure 12). Here all literature and the pages where the scientific name is quoted appear in the result list. Finally – when clicking on one of the bibliographic titles – the original book content appears.

Page 16: COMPONENT REPORTopen-up.eu/sites/open-up.eu/files/D23-C350_Integration_of_BHL_Europe_Portal_v1-0.pdfAIT, 2013 C3.5.0 v1.0 p. 13 4 EXAMPLE After transforming a datasource with Pentaho

AIT, 2013 C3.5.0 v1.0 p. 16

5 LIST OF FIGURES

Figure 1 Ingesting records into Europeana (overall workflow) .................................................... 4

Figure 2 The BHL web portal ................................................................................................. 5

Figure 3 The new BHL-Europe web portal ............................................................................... 6

Figure 4 General Search for “Parnassius apollo” ....................................................................... 7

Figure 5 Result of general search for “Parnassius apollo” .......................................................... 7

Figure 6 Scientific Names found for “Parnassius apollo” ............................................................ 8

Figure 7 Bibliography for “Parnassius apollo” ........................................................................... 9

Figure 8 One of the bibliography results containing the scientific name “Parnassius apollo” ........... 9

Figure 9 Pentaho Transformation ABCD206_to_ESE34 ........................................................... 11

Figure 10 The XSL Stylesheet used in the Pentaho Transformation........................................... 12

Figure 11 Record on the OAI platform with BHL-link ............................................................... 13

Figure 12 Bibliographic information for “Parnassius apollo” ..................................................... 14

Figure 13 Contextualization with BHL content ........................................................................ 15

6 LIST OF REFERENCES

ABCD - Access to Biological Collection Data. http://wiki.tdwg.org/ABCD 22 April 2013.

ABCDEFG - Access to Biological Collection Databases Extended for Geosciences. http://wiki.tdwg.org/twiki/bin/view/ABCD/DesignAbcdExtensions 22 April 2013.

BHL – Biodiversity Heritage Library, Developer Tools and API, http://biodivlib.wikispaces.com/Developer+Tools+and+API 22 April 2013.

C3.2.1 Domain specific vocabularies for EUROPEANA – interim, OpenUp! component

Europeana Data Model Mapping Guidelines. http://pro.europeana.eu/web/guest/edm-documentation 22 April 2013.

Europeana Data Model Definition v5.2.3. http://pro.europeana.eu/web/guest/edm-documentation 22 April 2013.

Europeana Semantic Elements (ESE) documentation http://pro.europeana.eu/web/guest/technical-requirements 22 April 2013.