renewal of eurogeonames central serviceinstance with its own service database •the production...

22
Renewal of EuroGeoNames Central Service Pekka Latvala Antti Jakobsson Lassi Lehto Saulius Urbanas Jaakko Kähkönen Finnish Geodetic Institute EuroGeographics INSPIRE 2012 27.06.2012

Upload: others

Post on 10-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Renewal of EuroGeoNames

Central Service

Pekka Latvala Antti Jakobsson

Lassi Lehto Saulius Urbanas

Jaakko Kähkönen

Finnish Geodetic Institute EuroGeographics

INSPIRE 2012

27.06.2012

Page 2: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Overview

• Background

• Reasons for the service renewal

• Original EGN service

• Renewed EGN service

• Schema mappings

• Current status

• Future work

2

Page 3: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Background

• The EuroGeoNames (EGN) project (2006 – 2009) was

implemented by EuroGeographics as a part of the

eContentPlus project

• EGN service is a Gazetteer service that serves geographical

names data from several European countries

• EGN service consists of:

• Several national EGN services (hosted by NMCAs)

• Central EGN service (hosted by BKG Germany)

3

Page 4: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Background

• Currently EGN service contains geographical names data from 14 countries. 4

countries are ready to connect to the service

4

• Italian data also available

• Norway hasn’t joined yet

• Greece is connected with no data

• Estonia hasn’t connected but they

have provided data

• The short term target is to expand

the EGN service into all 27 EU

Member States

Page 5: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Reasons for the service renewal

• The original schema was developed before INSPIRE specification were in place. It was ambitious but most of the countries couldn’t provide data for it

• New countries have been difficult to convince to join to the EGN service

• Need to simplify the schema in order to get countries more interested

• Simpler schema suits also better for web application use

• Simpler schema makes database easier to understand and manage

• Old database schema was quite complex and hard to manage.

• Data update procedures were difficult to handle that ultimately resulted in data corruption in the central service

5

Page 6: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Original EGN service - architecture

• Central EGN service and national EGN services• Database: PostgreSQL + PostGIS

• Web Feature Service interface that is developed with the deegree WFS

application

• Updates are handled periodically through WFS-T requests

together with additional database scripting

• The whole dataset of a certain country is updated at the same

time by retrieving the data from the national EGN service and

replacing fully the corresponding data in the central service

6

Page 7: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Original database schema

7

Page 8: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Renewed EGN Service

• The EGN central service is to be renewed and the national EGN services are left untouched

• The renewed service is based on the ISO19112 schema from the OGC Gazetteer Service AP Best Practice document• Simpler schema in comparison to the original schema

• Will be used as the main input and output schema

• Database schema is derived from this schema

• Data is loaded from the existing National EGN services via a schema transformation process• Exonym database will be merged into the central service

• In future, the INSPIRE Geographical Names services are seen as source for the EGN data

• The EGN central service is established as a Cloud Service-based instance with its own service database• The production database will be kept as a separate instance

8

Page 9: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Architecture of the renewed service

• EGN central service• Production database (PostgrsSQL + PostGIS)

• WFS application (deegree WFS)

• National EGN services• These are left untouched

• Possibly INSPIRE GN nodes in the future

• Amazon cloud service• Service database (PostgreSQL + PostGIS)

• WFS application (deegree WFS)

• Custom-made Java-based WFS front-end• Java servlet

• Custom language parameter for WFS requests

• Finalizing the XML formatting

• Uses SAX (Simple API for XML) parser

9

Page 10: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

New output / input Schema

10

OGC Gazetteer AP

Page 11: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

New Database Schema

11

Page 12: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Schema mapping

12

Original schema

New schema

Page 13: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Schema mapping

• Left out

• complexGeometry

• classificationScheme

• script (as part of xml:lang ’ell-Latn’)

• status (to be indicated by type/primary)

13

status type primary

official_endonym official TRUE

other_endonym official FALSE

standardized_exonym variant TRUE

other variant FALSE

Page 14: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

INSPIRE GN

14

Page 15: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

INSPIRE GN OGC Gazetteer

NamedPlace SI_LocationInstance

boundedBy geographicExtent

beginLifespanVersion dateModified

geometry* position

text + inspireId nameID

language xml:lang of name

sourceOfName administrator

text name

relatedSpatialObject spatialObject

relatedSpatialObject* geographicIdentifer

*) Can be arbitrary, target always gml:Point

Page 16: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

16

INSPIRE GN

nativeness

INSPIRE GN

nameStatus

OGC Gazetteer

type

OGC Gazetteer

primary

endonym official official true

endonym standardised official true

endonym historical official false

endonym other official false

exonym official variant true

exonym standardised variant true

exonym historical variant false

exonym other variant false

Page 17: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

17

INSPIRE EGN

adminstrativeUnit Administrative units

building Houses ?

hydrography HYDROGRAPHIC FEATURES

landcover Natural terrain areas or regions ?

landform Elevation ?

populatedPlace POPULATED PLACES

protectedSite CONSERVATION AREAS

transportNetwork TRANSPORT AND

TELECOMMUNICATION

FEATURES

other MISCELLANEOUS

Page 18: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

18

INSPIRE OGC Gazetteer

endLifespanVersion -

leastDetailedViewingResolution -

mostDetailedViewingResolution -

localType -

pronunciation -

script -

transliterationScheme -

grammaticalGender -

grammaticalNumber -

- partner

- (nameID)

- dateCommitted

- citationCode

- dateOfCreation

- designation

- history

- description

- parent/child

Page 19: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Mapping to INSPIRE GN

• All the mandatory elements can be populated

• Missing name-specific ID in INSPIRE GN

• Combine together the name string and the corresponding spatial object

ID (nameID)

• Feature type mapping partially not satisfactory

• Some important data items missing in Gazetteer AP schema

Page 20: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Current status

• Data has been loaded into the EGN Central Database

• Schema mapping has been made to the ISO 19112 OGC

Gazetteer AP schema

• The custom-made front-end WFS for final XML editing is under

construction

• The EGN service has been installed to the Amazon Cloud

Service and tested initially

20

Page 21: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Future work

• Integrate the exonym database into the central database

• Finalize the front-end WFS application

• Copy the data from the central database into the cloud service

• The data update procedures are to be specified

• Output in INSPIRE GN

• Other possible interfaces• REST-interfaces for geocoding and reverse geocoding

• JSON / JavaScript API

• etc...

21

Page 22: Renewal of EuroGeoNames Central Serviceinstance with its own service database •The production database will be kept as a separate instance 8. Architecture of the renewed service

Thank you!

22