metrics standardization. wikimedia research & data showcase, march 2014

21
Metrics standardization Dario Taraborelli Aaron Halfaker Wikimedia Research and Data showcase March 2014

Upload: dario-taraborelli

Post on 08-May-2015

593 views

Category:

Technology


0 download

DESCRIPTION

Slides from the talk on metrics standardization from the March 2014 Wikimedia Research & Data Showcase.

TRANSCRIPT

Page 1: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

Metrics standardizationDario Taraborelli • Aaron Halfaker

Wikimedia Research and Data showcaseMarch 2014

Page 2: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

summer 2013

Page 3: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

cohort-level metrics

Page 4: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

cohort-level metrics project-level metrics

Page 5: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

project-level metrics

Page 6: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

project-level metrics

Page 7: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

ENWIKI New Editors / day 1D: 21% 30D: 18% YTD: 20%

Editor engagement vital signs

key performance indicators for user engagement, community and content growth aggregated daily / weekly / monthlyfor every single Wikimedia project

https://www.mediawiki.org/wiki/Analytics/Epics/Editor_Engagement_Vital_Signs

02/01: 1240•

summer 2014

Page 8: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

Key metrics

New users Community Content Curation

Newly registered users

New editors

Productive new editors

Surviving new editors

...

Editors

Active editors

Very active editors

IP editors

Bots

Page creators

...

Edits

Bot edits

Uploads

Pages

...

Page deletions

Reverts

...

https://meta.wikimedia.org/wiki/Research:Metrics_standardization

Page 9: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

RelevantMeasure quantities that describe important phenomena

ReplicableMake research easily reproducible and verifiable

Transparent Provide formal specifications, remove ambiguity

ConsistentReplace proprietary, ad-hoc metric definitions; compare apples to apples

RobustMake metrics replicable via multiple data sources at any point in time

GranularComputable at different time scales

Rationale

Page 10: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

Anatomy of a metric 1. specification

Page 11: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

Anatomy of a metric 2. visualizations

registration

time

Activation Trial Survival

Page 12: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

Anatomy of a metric 2. visualizations

New editor

Productive new editor

Surviving new editor

Page 13: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

Anatomy of a metric 3. discussion

Page 14: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

Anatomy of a metric 4. sensitivity analysis

Page 15: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

Sensitivity analysis

https://meta.wikimedia.org/wiki/Research:Productive_new_editor

Does new editor productivity vary when we measure it over the first day or the first week?

Page 16: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

Sensitivity analysis

https://meta.wikimedia.org/wiki/Research:New_editor

Does it really matter to limit new editor activation to main namespace edits only?

Page 17: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

Sensitivity analysis

https://meta.wikimedia.org/wiki/Research:Surviving_new_editor

Does the length of the trial and survival period affect the measurement of new editor survival?

Page 18: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

Why does this matter at all?

1. Data exploration“Newly registered users on German and Dutch Wikipedia have a higher activation rate than newbies on English Wikipedia”

“Spanish Wikipedia adds every day twice as many new editors than German Wikipedia, despite having only half its new user activation rate”

2. Natural experiments

“A change in abuse filter rules on the Italian Wikipedia significantly increased new editor survival”

Page 19: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

Metric specification

Sensitivity analysis

Parameter recommendation

Release

Evaluation

Evaluation

Page 20: Metrics standardization.  Wikimedia Research & Data Showcase, March 2014

Metric specification

Sensitivity analysis

Parameter recommendation

Release

Evaluation

Evaluation

feedback