assessing the effect of visualizations on bayesian reasoning through crowdsourcing

93
Assessing the Effect of Visualizations on Bayesian Reasoning through Crowdsourcing Jean-Daniel Fekete Pierre Dragicevic Luana Micallef

Upload: tulia

Post on 23-Mar-2016

39 views

Category:

Documents


0 download

DESCRIPTION

Assessing the Effect of Visualizations on Bayesian Reasoning through Crowdsourcing. Luana Micallef. Pierre Dragicevic. Jean-Daniel Fekete. The probability that a woman at age 40 has breast cancer is 1%. The probability that the disease is detected by a mammography is 80%. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Assessing the Effect of Visualizations on Bayesian Reasoning through Crowdsourcing

Jean-Daniel Fekete

Pierre Dragicevic

Luana Micallef

Page 2: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

0% - 30% 30% - 60% 60% - 100%

The probability that a woman at age 40 has breast cancer is 1%.

The probability that the disease is detected by a mammography is 80%.

The probability that the test misdetects the disease although the patient does not have it is 9.6%.

If a woman at age 40 is tested as positive, what is the probability that she indeed has breast cancer?

Page 3: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

ATTENTION

Page 4: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

The probability that a woman at age 40 has breast cancer is 1%.

The probability that the disease is detected by a mammography is 80%.

The probability that the test misdetects the disease although the patient does not have it is 9.6%.

If a woman at age 40 is tested as positive, what is the probability that she indeed has breast cancer?

0% - 30% 30% - 60% 60% - 100%

Page 5: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

The probability that a woman at age 40 has breast cancer is 1%.

The probability that the disease is detected by a mammography is 80%.

The probability that the test misdetects the disease although the patient does not have it is 9.6%.

If a woman at age 40 is tested as positive, what is the probability that she indeed has breast cancer?

0% - 30% 30% - 60% 60% - 100%

Page 6: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

7.8%P ( Cancer | Positive Mammography ) =

Page 7: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 8: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

95 doctors out of 100

said the answer is between 70% to 80%

Page 9: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Why the correct answer is so low

Page 10: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

P ( cancer | +ve mammography )

=

P ( +ve mammography | cancer)

P (+ve mammography | cancer) + P (+ve mammography | cancer)

Bayes’ Theorem

Page 11: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

women without cancer

women with cancer

The probability that a woman at age 40 has breast cancer is 1%.

Page 12: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

women without cancer

women with cancer

The probability that the disease is detected by a mammography is 80%.

The probability that the test misdetects the disease although the patient does not have it is 9.6%.

If a woman at age 40 is tested as positive, what is the probability that she indeed has breast cancer?

7.8%

Page 13: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Can such visualizations facilitate Bayesian reasoning

Page 14: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Proposed Visualizations

Page 15: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

contingency table

bar-grain boxes Bayesian boxes trees

signal detection curves

Page 16: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Euler diagram frequency grid

+

Page 17: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Euler diagram + glyphs

Page 18: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Previous Studies

Mainly in Psychology

Claim that

Bayesian problem representation impacts comprehension

Page 19: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

but …

Inconsistent findings

Most effective Bayesian problem representation? UNCLEAR

Inconsistent and sometimes inappropriate diagram designs

Diagrams do not match textual information

Page 20: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

(Sloman et al., 2003)

Page 21: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Area-Proportional Not Area-Proportional

Page 22: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

and the subjects …

Specific background usually highly-focused university students

Specific age group

Sometimes,

specific department

carried out as part of their course

Page 23: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

so … cannot generalize their findings to

a more diverse population of laypeople

Page 24: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Our Work

Assessing the Effect of Visualizations on Bayesian Reasoning through Crowdsourcing

Page 25: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 26: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

to identify…

- the most effective visualization for the crowd

- whether hybrid visualizations are helpful

- the link between the visualizations and different spatial and numeracy abilities

Page 27: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 28: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

but…

how appropriate is

Page 29: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Amazon MTurk

Used and evaluated for research and InfoVis

Demographics of workers are well-understood

Captures aspects of real-world problem solving better

- a large diverse population with different backgrounds, education, occupations, age, gender

- workers carry out tasks rapidly but accurately to improve their rating

- reduces experimental biases, as demand characteristics

Page 30: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

http://www.eulerdiagrams.org/eulerGlyphs

Page 31: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 32: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 33: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 34: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 35: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 36: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 37: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 38: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Experiment

Page 39: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

168 workers with MTurk approval rate ≥ 95%

Page 40: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Demographics

Page 41: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

25 min

$1

Page 42: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

3 Bayesian problemsclassics in Psychology

in natural frequencies format

Page 43: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 44: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 45: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 46: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 47: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 48: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 49: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 50: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

followed by

objective and subjective numeracy tests

paper folding spatial abilities test

brief questionnaire

Page 51: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Results

Page 52: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

We failed to replicate previous findings

subjects’ accuracy was remarkably lower

visualizations exhibited no measurable benefit

Page 53: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

even though …

reasonably confident with their answer

Page 54: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

overall

12% exact answers

Page 55: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

6%

no visualization

Page 56: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

14% 11% 11%

21% 7% 14% 21%

Page 57: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

no vis V0

V1

V2

V3

V4

V5

V6Answer errors for all three Bayesian problems combined

per visualization type (N = 24 each)

21% exact

6% exact

Page 58: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

12% 40% - 80%our study

exact answers

previous studies

Page 59: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Thus

we failed to demonstrate measurable

benefits from visualizations to

facilitate Bayesian reasoning.

Page 60: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Qualitative Feedback

Page 61: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

53 out of the 168 subjects

participated

Page 62: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

89% ‘somehow’ used the diagram

Most found the diagram very useful

BUT

Various did not understand the diagram

Some doubted the diagram’s credibility

Page 63: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

However

must understand and trust the diagram

the answer is in the visualization

Page 64: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

women without cancer

women with cancer

The probability that the disease is detected by a mammography is 80%.The probability that the test misdetects the disease although the patient does not have it is 9.6%.If a woman at age 40 is tested as positive, what is the probability that she indeed has breast cancer?

7.8%

Page 65: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

How

Page 66: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

either

help them understand and relate the diagram to the text

or

force them to get the answer from the diagram

Page 67: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

change the text

Page 68: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Another Experiment

Page 69: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

480 workers with MTurk approval rate ≥ 95%

did not participate in experiment 1

Page 70: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

1 Bayesian problemthe Mammography problem

Page 71: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

10 out of every women at age forty who participate in routine screening have breast cancer.

8 of every 10 women with breast cancer will get a positive mammography.

95 out of every 990 women without breast cancer will also get a positive mammography.

classic

Page 72: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

10 out of every women at age forty who participate in routine screening have breast cancer (compare the red dots in the diagram below with the total number of dots).

8 of every 10 women with breast cancer will get a positive mammography (compare the red dots that have a black border with the total number of red dots).

95 out of every 990 women without breast cancer will also get a positive mammography (compare the blue dots that have a black border with the total number of blue dots).

with instructions

Page 73: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

10 out of every women at age forty who participate in routine screening have breast cancer.

8 of every 10 women with breast cancer will get a positive mammography.

95 out of every 990 women without breast cancer will also get a positive mammography.

without numbers

Page 74: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

A small minority of women at age forty who participate in routine screening have breast cancer.

A large proportion of women with breast cancer will get a positive mammography.

A small proportion of women without breast cancer will also get a positive mammography.

without numbers

Page 75: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing
Page 76: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

10 out of every women at age forty who participate in routine screening have breast cancer.

8 of every 10 women with breast cancer will get a positive mammography.

95 out of every 990 women without breast cancer will also get a positive mammography.

classic

Page 77: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Results

Page 78: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

The Most Effective Textual Representation

Page 79: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

A small minority of women at age forty who participate in routine screening have breast cancer.

A large proportion of women with breast cancer will get a positive mammography.

A small proportion of women without breast cancer will also get a positive mammography.

without numbers

Page 80: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

exact answers

Page 81: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

+no visualization

3.3% exact answers

classic text

Page 82: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

+5% exact answers

classic text

Page 83: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

5% exact answers

+

text with instructions

Page 84: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

1 exact answer (N=120)

+

text without numbers

Page 85: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Answer errors for the Mammography Bayesian problemper presentation type (N = 120 each)

classic + no vis

classic + vis

with instructions + vis

without numbers + vis

Page 86: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Conclusion

Page 87: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

Using crowdsourcing, we assessed

6 visualizations and text alone for

3 classic Bayesian problems

Page 88: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

We failed to replicate previous findings

subjects’ accuracy was remarkably lower

visualizations exhibited no measurable benefit

Page 89: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

A follow-up experiment confirmed …

simply adding a visualization to a textual Bayesian

problem does not help

diagrams can help but numerical values have to be removed and the text should be used to merely set the scene

Page 90: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

We need …

novel visualization that holistically combine

text and visualization and promote the use of estimation rather than calculation

more studies in settings that better capture real-life rapid decision making

Page 91: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

To …

facilitate reasoning of statistical information

for both layman and professionals

Page 92: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

ThanksJean-Daniel

FeketePierre

DragicevicLuana Micallef

Page 93: Assessing the Effect of  Visualizations on  Bayesian Reasoning  through Crowdsourcing

error = log10answergiven

answerexpected