openaire metrics service: usage statistics (webinar for repository managers)

33
@openaire_eu OpenAIRE Metrics Service: Usage Statistics Webinar, 07 Dec 2017 Dimitris Pierrakos, Athena Research Center Jochen Schirrwagen, Bielefeld University

Upload: openaire

Post on 22-Jan-2018

285 views

Category:

Internet


4 download

TRANSCRIPT

Page 1: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

@openaire_euOpenAIRE Metrics Service: Usage StatisticsWebinar, 07 Dec 2017

Dimitris Pierrakos, Athena Research CenterJochen Schirrwagen, Bielefeld University

Page 2: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

● OpenAIRE infrastructure and Usage Statistics Service.

● Usage Data Collection strategies.

● Using Piwik for tracking and analytics.

● Applying COUNTER rules.

● Metrics in the Repository Manager Dashboard.

● Relation to Open Metrics and Next Generation Metrics.

Overview

Page 3: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

• A pan-European Research Information platform to

monitor OA research outcomes from EC and other

national funders.

• Research analytics tools to promote new scientific

metrics & support evidence-based decision-making.

• Implementation of an OpenAIRE usage statistics

service for usage data collected from data providers.

OpenAIRE 2020

Page 4: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

● Task in OpenAIRE2020 covers:○ aligning policies and standards for gathering and sharing of usage data

-> guidelines○ considering legal aspects (data protection / data privacy)○ relating usage statistics to other kinds of metrics○ collecting and processing of usage data and producing consolidated,

standards-based usage statistics

● Task team: Athena Research Center, University of Bielefeld, University of Minho, Jisc IRUS-UK, Couperin + NOADs

Usage Statistics in OpenAIRE

Page 5: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

● OpenAIRE collects from 980 compatible data providers

~23 Mio documents

● currently 32 active data providers participating in

Usage statistics + IRUS-UK

● Usage statistics deployment under cc-0.

○ in OpenAIRE dashboard, portal and API.

Usage Statistics in the OpenAIRE Infrastructure

Page 6: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

● Tracking of views and downloads / collecting COUNTER reports

○ Push or Pull collection workflows.

● Anonymisation of IP-addresses.● Metadata de-duplication enables accumulation of

views and downloads for same documents ● COUNTER Code of Practice compatibility.○ standards based usage statistics.○ enables comparability with statistics from other data sources.

Usage Statistics Service Features

Page 7: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

• World's leading open-source analytics platform.• Valuable insights into website traffic and visitors activity. • Piwik collects and stores PII (personally identifiable

information).• Keeps full data ownership and can control who has access. • Robot filtering plugin.• Compliant with EU regulations.• Recommended by privacy organizations such as ULD

(Germany) and CNIL (France).

Piwik Analytics platform

Page 8: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

Piwik Facts

Page 9: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

Metadata-Index

UsageStatistics-DB

● Repository

● CRIS

● eJournal

● National

Statistics Node

● Publisher

PULLCOUNTER

Report

PUSHtracked

event

IP-Anonym.

processing script

processing script

2-Tiers Collection Workflows for Usage Statistics

Page 10: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

• An institutional repository is registered in Piwik.• Server side tracking: Plugins (Dspace) or patches

(Eprints) using Piwik’s HTTP API.• Usage Activity is tracked and logged at Piwik

platform in real time.• Ιnformation is transferred offline, using Piwik’s API,

to OpenAIRE’s DBs for statistical analysis.• Statistics are deployed via OpenAIRE’s Portal or

Sushi-Lite API.

Tier-1: Push Usage Statistics Tracking Workflow

Page 11: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

Tier-1: Piwik Tracking Parameters

Page 12: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

● Usage events can be considered privacy-sensitive information (user-agent, ip-address, ...)

● Usage statistics services must comply with data protection laws and regulations for both usage data- and service-providers○ but legal situation differs between the countries○ OpenAIRE must comply with the EU-General Data Protection

Regulation

● Tracking plugins issued by OpenAIRE anonymize usage data already on the client-side

Data Protection Aspects

Page 13: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

Usage Activity in real time

Page 14: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

Real time Visitor Map

Page 15: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

• Applying data processing rules according to COUNTER Code of Practice:• ie. counting requests depending on session duration, tracing double-

clicks

• Bot filtering• Piwik Bot Plugin• COUNTER Robots Working Group

• Link of usage event with metadata record in OpenAIRE

• Accumulate views and counts of de-duplicated records

Cleaning and Consolidation

Page 16: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

Repository Pilot Statistics

Page 17: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

• Gathering of consolidated statistics reports from aggregation services, such as IRUS-UK, using protocols such as SUSHI-Lite.

• Statistics are stored to OpenAIRE’s DB for statistical analysis.

• Statistics are deployed via OpenAIRE’s Portal or Sushi-Lite API.

Tier-2: Collecting (Pull) Consolidated Usage Statistics Reports

Page 18: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

entityId/orid

entityId/orid

entityId/orid

entityId/orid

source

source

OpenAIRE Usage Statistics DB

Page 19: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

● Four steps to join OpenAIRE Usage Statistics1. Download. 2. Configure. 3. Deploy. 4. Validate (by OpenAIRE).

● Or enter SUSHI endpoint to let OpenAIRE collect COUNTER reports

OpenAIRE Repository Manager Dashboard

Page 20: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

Content Provider Dashboard - Start Page

Page 21: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

Content Manager’s Datasource selection for Metrics

Page 22: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

Enable Metrics for selected Datasource

Page 23: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

Configure Metrics for selected Datasource

000

01233456

Page 24: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

Summarized Usage Statistics on the content provider level

Page 25: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

Usage Statistics on the Article Level

Page 26: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

● Available as beta with the help of IRUS-UK○ http://beta.services.openaire.eu/usagestats/sushilite/

● Supports COUNTER R4 compatible reports:○ Article Reports (AR) and Book Reports (BR) using identifiers like

openaire, doi, oai-record-id○ Item Reports (IR)○ Repository Reports (RR) using identifiers issued by OpenAIRE or

OpenDOAR○ Journal Reports (JR) using identifiers like ISSN

SUSHI-Lite Interface

Page 27: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

Repository Report Item Report

SUSHI response example (JSON)

Page 28: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

• Quantitative indicators for research

• Governance

• Management

• Assessment

• Dimensions

• Robust metrics in terms of accuracy and scope;

• Humble metrics recognizing that quantitative evaluation should support qualitative,

expert assessment;

• Open and Transparent metrics;

• Diverse metrics by field in order to support the plurality of research and researcher career

paths across the system;

• Reflexible metrics for recognising, anticipating and updating the systemic and potential

effects of indicators.

OpenAIRE: A Usage Statistics Hub for Responsible Metrics

Page 29: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

• Standardization: following COUNTER Code of Practice• by update to COUNTER R5• by contribution to COUNTER Robots Working Group

• Put usage statistics into context with conventional and alternative metrics and (open) peer review

Considering the HLEG Altmetrics Recommendations

Page 30: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

● Develop Piwik plugins for other Repository platforms (eg. Fedora, Samvera)

● Promote the service to content provider managers● Support national usage statistics initiatives to

become a node in OpenAIRE Usage Statistics● Contribute to the Open Metrics concept and vision● Activities in OpenAIRE-Advance starting in 2018:○ support LA Referencia to set up a regional usage statistics network and

interlink○ working towards Open Metrics

Next Steps

Page 31: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

● Standardize usage statistics to enable assessment of research impact

○ Standardize usage statistic metrics across OpenAIRE and EOSC-hub

○ Collaborate with RDA (e.g. Make Data Count BoF working group)

○ Promote common guidelines to and across communities

○ Take EC rules and GDPR regulations into account

● Enable the collection/aggregation of usage stats from content providers

○ Adopt OpenAIRE and EOSC-hub services for collecting user statistics, services in scope:

■ EGI: Accounting System, AppDB

■ EUDAT: DPMT, B2SHARE, B2FIND, B2SAFE

○ Adopt OpenAIRE Usage Statistics Services to collect user stats for all products of

science

■ e.g. literature, datasets, software, research objects

■ Integrating with EOSC-hub services for usage statistics/metrics

Collaboration with EOSC-hub

Page 32: OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)

● OpenAIRE Usage Statistics Deliverable Report○ https://doi.org/10.5281/zenodo.1034163

● Repository Tracking Plugins (github)○ https://github.com/openaire/OpenAIRE-Piwik-DSpace○ https://github.com/openaire/EPrints-OAPiwik

● SUSHI-Lite API (beta)○ http://beta.services.openaire.eu/usagestats/sushilite/

References