anatomy of large-scale human-computation engine

The Anatomy of a Large-Scale Human-Computation Engine

Shailesh Kochhar, Stefano Mazzocchi, Praveen Paritosh

freebase.com, google.com

HCOMP'10

July 25, 2010 HCOMP'10

1: Freebase & Human Computation

2: Example – Stanford Library

3: RABJ

4: Lessons

Freebase

Structured database

12 MM entites, 300 MM triples/facts


Where does the data come from?


Community contributions

Mass Data Loads


Human Judgments Improve Both


Community

Simplifying contribution through games


Mass Data Loads

Precision: QA for >99% accuracy

Recall: increase coverage




3: RABJ

4: Lessons


Reconcile Stanford Library Catalog with freebase.com


Stanford Library Catalog

4.4MM book editions

1.3MM English book editions

1.2MM English books

600K authors


For freebase, identity is key

match books, match authors


Automatic matching insufficient

Trained judges needed to decide hard cases


How to get this?


RABJRedundant Array of Brains in a Jar


Abstraction

Powers human judgment applications

1.8MM judgments in 16 months of

operation


Provides primitives for more sophisticated

HJ applications


Questions

Judgments

Queues

Agents


Design Constraints


Content-agnostic

Dynamic data

Low latency


Architecture


Questions contain pointers to data, pushed

to a store

Questions added to queues

Metadata allows slicing and dicing


JS applications pull questions from broker

Broker matches judge to work

Apps render question, collect judgment

Broker writes judgments back to store


Declarative consensus

Yes: 3, No: 3, Skip: 2, Bad: 2, Max: 4

Broker notifies agents of consensus


Applications


matchmaker

http://matchmaker2.freebaseapps.com/

http://matchmaker2.freebaseapps.com/


Book Edition QA


typewriter

http://typewriter.freebaseapps.com/

http://typewriter.freebaseapps.com/


Scale


1.8 MM questions

2.8MM judgments

500 queues

20+ applications




3: RABJ

4: Lessons


Relationships, relationships, relationships

This is not controversial


Spam, collusion, gaming: $0

Skill development

Communication, documentation


Don't have to pay per-judgment

Yes, this is controversial


There are always leftovers


Working on formalizing workflows


More in the paper

RABJ Architecture

Learning through feedback loops


http://rabj.freebaseapps.com/

http://wiki.freebase.com/wiki/RABJ_API/

Questions?

http://rabj.freebaseapps.com/

http://wiki.freebase.com/wiki/RABJ_API/

anatomy of large-scale human-computation engine

Documents