19/10/20151 semantic web scientific data integration vladimir serebryakov computing centre of the...
TRANSCRIPT
21/04/23 1
Semantic WEB Scientific Data Integration
Vladimir SerebryakovComputing Centre
of the Russian Academy of Science
ProposalProposal: : SkTech.RC/IT/MadnickSkTech.RC/IT/Madnick
21/04/23 2
Table of Contents• What is the Computing Centre of the Russian
Academy of Science (CCRAS)• Unified Information space of the Russian
Academy of Science• Information system “Research institution”• Its extensions
– LibMeta– GeoMeta– Linked Open Data
21/04/23 3
What is the Computing Centre of the Russian Academy of Science (CCRAS)
Directions of study– Numerical methods– Informatics
• Pattern recognition
• Information systems
• Computer algebra
21/04/23 4
The Unified Research Information Space (URIS)
The Unified Research Information Space (URIS) of the Russian Academy of Sciences (RAS) is an integrated information space of distributed and local digital resources of RAS organizations and hardware and software tools that support its functionality and control.
21/04/23 5
The main tasks of URIS RAS• Development of a unified metadata model on the basis of modern
technologies and implementation of a global search mechanisms on its;
• Active scientific communications;
• Building of distributed catalogs of scientific information;
• Information support of research study;
• Development and publication of corporate standards;
• Development and implementation of a software package “The RAS Basic Institution”;
• Construction of points for access these information (portals);
• Implementation of access and integration to information resources of RAS organizations;
• Security support;
• Interconnection with other information systems.
21/04/23 6
URIS architecture
Institution RAS division
Access Access Access
RAS integrated system of information resources
Access control
Other information systems National
o Science o Education o
Foreing information systems
Regional information systems
Application field information systems
Physics Geology Economy
Division node: Metainformation Search indices
Institition node: Metainformation Search indices
Scientific information resources
Library
Library node: Metainformation Search indices
Institution
Institition node: Metainformation Search indices
Institution
Institition node: Metainformation Search indices
Institution
Institition node: Metainformation Search indices
21/04/23 7
RAS Institution Library Publishing Digital library Administrative departments Scientific secretary Data basis of scientific results Publications Scientific reports Innovations Conferences Learning
RAS Institution RAS Institution
RAS dividion
Administratuve information
Administratuve information
RAS Presidium
Access Access Access
RAS Integrated system of information resources
Access control
Other information systems National
o Science o Education o
Foreing information systems
RAS regional information systems
Application information systems
21/04/23 8
URIS Information Bus
The basis of the URIS RAS is an Information Bus that is a set of hardware, software and administrative tools that support:
• Resources and services supplement• Security• Metadata actualization• Data integration• Global search.
21/04/23 9
URIS Information bus: architecture
Metadata Metadata
RAS Devision
RAS Institution
Regional IS
External IS
Devision IS
Search Security
RAS URIS Information Bus
Services Information resorces
Regional IS
External IS
Devision IS
21/04/23 10
•Organizatios
•Persons
•Publications
•Projects
•Spatial data
•Application data
URIS Information bus: resources
21/04/23 11
Technologies
The information model of URIS RAS is based on Semantic Web – RDF, RDFS, OWL ontology of scientific information. This includes:
• Scientific activity, in particular projects as a process, conferences, seminars etc.
• Participants of Scientific activity, like persons, working groups, organizations etc.
• Results of Scientific activity, like data bases, software projects, innovations etc.
• Documents and publications, like papers, dissertations etc
21/04/23 14
URIS Metadata requirements
•Include basic resource types•Provide access to resources•Provide extensibility•Provide data integration•Provide identification•Provide searching in distributed environment•Use Semantic Web approach.•Provide interoperobility
21/04/23 15
URIS Metadata standards
Semantic Web – RDF, RDFS, OWL DCMI - Dublin Core Metadata Initiative (dublincore.org)
PRISM - Publishing Requirements for Industry Standard Metadata (Adobe,…)
AGLS Metadata Standard
vCard – “visit card” in RDF.
FOAF open initiative Friend Of A Friend (personal information) BIBLINK, bibTeX, Math-Net, UKOLN CLD …
CERIF 2000, MARC и RUSMARC, CIDOC …
21/04/23 16
The software package“RAS Institution”
21/04/23 17
RAS institutions information tasks
• Inner administrative tasks• Institution as a research RAS
organization• Support of a research process• Public representation
21/04/23 18
The software package“RAS Institution”
The software package “The RAS Institution” is intended to supply institutions with a modern information system that supports internal requirements (publication of scientific information, administrative processes and information etc.) from one hand, and external ones (representation of the information in URIS RAS and Internet) from another hand. It includes:
Infrastructure services that supports
•Data storage
•Global identification
•Data exchange and replication
•Security
•Indexing and search.
21/04/23 19
The software package“RAS Institution”
Base components include subsystems
•Administrative directory
•Publications
•Projects.
•Interaction components
•News
•Forums
•Private communication
•Application components
•Publishing department
•Library
•Electronic library
•Library of dissertations.
21/04/23 20
“RAS institution” applications
• Portal RAS• RAS organizations information
systems• Thematic information systems • Bridges
21/04/23 21URIS Portal
21/04/23 22RAS Portal
21/04/23 23Division’s of Mathematics portal
21/04/23 24Moscow State University dept’s of applied mathematics portal
21/04/23 25RAS Institution’s of America and Canada portal
21/04/23 26
LibMeta• Requirements
– Integration into URIS– Distributed environment– International standards
• OAIS
• Dublin Core
• CIDOC-CRM
• OAI-PMH
21/04/23 27
LibMeta Fuctionality• User
– Searching• Full text• Attribute• Directories
– Navigation– Accessing
• Administrator– Content control– Rights control– Directories management
• OAI PMH metadata exchange
21/04/23 28
LibMeta Profile
URIS basic Profile -Kernel -Person -Project -Organization -Publication
LibMeta Profile -Full texts (scanned) -Contents -Multimedia objects -Museum objects -Collections
URIS Library extension -Resumed publication -Publication collections -Bibliography -Series
21/04/23 29RAS scientific heritage digital library portal
21/04/23 30
GeoMeta portal
21/04/23 31
Purposes– Metadata support (keeping and
editing)– Metadata harvesting– Integration of scientific spatial data– Searching of spatial data and
services– Spatial data visualisation (maps,
pictuers etc)
21/04/23 32
Architecture
• Implementation is based on RAS Institution
• Based on ISO 19115/19139 standards
21/04/23 33
GeoMeta Functionality• In addition to main URIS resources
(person, publication, organization, project) the system supports spatial data
• Main functions:– Resource cataloging, harvesting, loading,
searching;– Keeping spatial data in a repository and
access to these data;– Access via standard protocols (WFS, WMS);– Data (maps) visualization;– Directories management.
21/04/23 35
21/04/23 36
Protected Sites Information System
21/04/23 37
Information system on protected sites
• Is based on GeoMeta
• Functionality– Data model is based on ISO ISO
19101 Reference model, 9109 Rules for Application Schema, INSPIRE
– Loading data– Navigation, searching– Queries
21/04/23 38
ProposalsThe integration of scientific data in the
common scientific information space, the integration of this space with a distributed system of scientific digital libraries.
The challenge is to develop formalisms, methods of implementation and a pilot implementation, particularly:
21/04/23 39
Proposals
• Ontologies for some scientific domains. Science is big, so the claim to universal coverage is not realistic. Therefore, we should focus on specific subject areas, such as spatial data, for which there are well-developed standards for describing and organizing data.
21/04/23 40
Proposals
The formal means of data integration based on domain ontologies. The integration in particular, should include data binding, i.e. linkages based on the data identification.
21/04/23 41
Proposals
• Creating key information (metadata) in storing it in special (data) centers, in particular, information about the relationships between data.
21/04/23 42
Proposals
• Establishment of protocols that work with distributed information, in particular, searching.
21/04/23 43
Proposals
• Development of means for extracting information from sources and loading appropriate meta information into the global environment (storage centers).
21/04/23 44
Proposals
• Development of user interfaces in the format of digital libraries, ie, digital libraries, working with the metadata of the global environment and having the ability to extract data from sources.
21/04/23 45
Thank you!