1 efficiently learning the accuracy of labeling sources for selective sampling by pinar donmez,...

Efficiently Learning the Accuracy of

Labeling Sources for Selective Sampling

by Pinar Donmez, Jaime Carbonell, Jeff Schneider

School of Computer Science, Carnegie Mellon University

KDD ’09

June 30th 2009

Paris, France

Problem Illustration

instances

oracles

Interval Estimate Threshold (IEThresh) Goal: find the labeler(s) with the highest expected accuracy Our work builds upon Interval Estimation [L. P. Kaelbling]

1. Estimate the reward of each labeler (more on next slide)2. Compute upper confidence interval for the labelers

3. Select labelers with upper interval higher than a threshold

4. Observe the output of the chosen oracles to estimate their reward

5. Repeat to step 1

filter out unreliable labelers reduce labeling cost

Reward of the labelers The reward of each labeler is unknown => need to be estimated

reward of a labeler eliciting true label

true label is also unknown => estimated by the majority vote

We propose the below reward function

reward=1 if the labeler agrees with the majority label reward=0 otherwise

IEThresh at the Beginning

Oracles

Expect

IEThresh Oracle Selection

Oracles

Expect

Threshold

1 2 3 4 5

IE Learning Snapshot IIExpect

Oracles

Threshold

1 2 3 4 5

IEThresh Instance Selection1

Uniform Expert Accuracy є (0.5,1]

Repeated Labeling [Sheng et al, 2008]: querying all experts for labeling

# Oracle Queries vs. Accuracy

: First 10 iterations

: Next 40 iterations

: Next 100 iterations

# Oracle queries to reach a target accuracy

skew increases

Results on AMT Data with Human Annotators

IEThresh reaches the best performance with similar effort to Repeated labeling

Repeated baseline needs 840 queries total to reach 0.95 accuracy

Dataset at http://nlpannotations.googlepages.com/ made available by [Snow et al., 2008]

5 annotators

6 annotators

Conclusions and Future Work Conclusions

IEThresh is effective in balancing exploration vs. exploitation tradeoff

Early filtering of unreliable labelers boosts performance Utilizing labeler accuracy estimates is more effective

than asking all or randomly

Future Work

from consistent to time-variant labeler quality label noise conditioned on the data instance correlated labeling errors

THANK YOU!

1 efficiently learning the accuracy of labeling sources for selective sampling by pinar donmez,...

reward function reward

better slide

france slide

reward exploration

labeling cost slide

highest expected reward

observed reward

exploitation tradeoff

Documents

ficha descargable ruta pinar

jaime carbonell, pinar donmez, jingui he & vamshi ambati...

carbonell federalismo

carbonell, miguel

pinar donmez - kabbage at the chief analytics officer forum,...

pinar ozen bilecik cimento.doc

cosmos ej01 pinar

0534-pinar undeger - gaia

e carbonell riesgos psicosociales

pinar de rio

carbonell v. ca

carbonell. edad media

donmez a. - wideband pll system as a clock multiplier (2009)

miguel carbonell, capítulo ix.pdf

es pinar - new apartments

antecedentes de la historia - carbonell

optimizing estimated loss reduction for active sampling in...

jaime carbonell (cs.cmu/~jgc) with vamshi ambati and...

jaime carbonell (jgc) with pinar donmez, jingui he, vamshi...

b/n, xavi carbonell