Download - Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

Page 1: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Formulating optimal decision making process.Utility of Bayesian Model

Understanding Human Sequential Decision MakingUnder Uncertainty Using Ideal Observer Analysis

Brian J. Stankiewicz, Ph.D.

University of Texas, AustinDepartment Of Psychology & Center for Perceptual Systems

June 17, 2005

Brian J. Stankiewicz, Ph.D. Human sequential decision making under uncertainty

Page 2: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Collaborators

University of Texas, Austin

Matthew deBrechtChris GoodsonKyler EastmanMatthew McCabeAnthony Cassandra

University of Minnesota

Gordon E. LeggeErik Schlicht

SUNY Plattsburgh

J. Stephan Mansfield

Army Research Lab

Sam Middlebrooks

University XXI / Army Research Labs

National Institute of Health

Air Force Office of ScientificResearch

Page 3: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

1 IntroductionFormat of sequential decision making with uncertainty tasks.Problem StructureRe-orientation taskSeek & Destroy problem

2 Formulating optimal decision making process.Tiger Problem

3 Utility of Bayesian ModelNavigation with Uncertainty: IntroductionExperiment 1: Effect of Layout Complexity

Page 4: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Format of sequential decision making with uncertainty tasks.Problem StructureRe-orientation taskSeek & Destroy problem

Examples Sequential Decision Making with Uncertainty

Medical Diagnosis

Scientific Exploration

Re-orientation after becoming lost

Seek & Destroy

Tiger Problem

Etc.

Page 5: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Examples Sequential Decision Making with Uncertainty

Medical Diagnosis

Scientific Exploration

Re-orientation after becoming lost

Seek & Destroy

Tiger Problem

Etc.

Page 6: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Sequential Decision Making: Structure

Structure of Sequential Decision Making with UncertaintyTasks.

1 States (S)True state is hidden

2 Actions (A)3 Observations (O)4 Transition Matrix (T)5 Rewards (R)6 Beliefs (B)

Page 7: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Medical Diagnosis: States

Patient can be in one of a set of states. (S)

InfluenzaPancreatic cancerAppendicitisetc.

Unable to directly observe the current state.

True state is Hidden.

Page 8: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 9: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 10: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 11: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Medical Diagnosis: Actions

A set of Actions available to decision maker. (A)

Take temperatureBlood workupMRI scanSurgeryetc.

Page 12: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Medical Diagnosis: Actions

A set of Actions available to decision maker. (A)

Take temperatureBlood workupMRI scanSurgeryetc.

Page 13: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Medical Diagnosis: Rewards

Actions have costs and rewards.

$5,000 for surgery$0.32 for taking temperature

Rewards/Costs may be dependent upon the state of thepatient

E.g. Surgery given patient has Appendicitis

Reward(Surgery |Appendicitis) =-$5,000+$4,000=-$1,000Reward(Surgery |NotAppendicitis) =-$5,000

Page 14: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 15: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 16: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 17: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 18: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Medical Diagnosis: Observations

Actions result in Observations. (O)

E.g., Temperature = NormalNo elevated white blood cells

Observations help to determine the true state of the system(Hidden state).

E.g. p(Influenza) goes down if temperature is normal.

Observations can be noisy (i.e., non-deterministic)

E.g., Nurse may write down wrong temperature on chartThermometer might be broken or used incorrectly1.0> p(Observation|State,Action) >0.0

Page 19: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 20: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 21: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 22: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 23: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 24: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Medical Diagnosis: Beliefs

We can represent the decision maker’s current belief aboutthe true state of the system in terms of likelihoods.

E.g., < p(Influenza), p(Pancreaticcancer), . . . , p(Healthy) >

Observations and actions modify this belief (i.e., informationgathering).

E.g. p(Influenza) goes down if temperature is normal.p(Pancreatic cancer) will go up slightly given that thetemperature is normal.

Page 25: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 26: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 27: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 28: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Medical Diagnosis: Transitions

Executing an action may move the system from one state toanother. (T)

E.g., Surgery may move the patient from Appendicitis toHealthy.

Results of actions may be non-deterministic(1.0> p(s ′|s, a) >0.0)

p(Healthy |Appendicitis,Surgery)=0.97p(Dead |Appendicitis,Surgery)=0.03

Page 29: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 30: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 31: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 32: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Sequential Decision Making: Structure

Medical diagnosis can be described with the followingstructures.

1 States (S)True state is hidden

2 Actions (A)3 Observations (O)4 Transition Matrix (T)5 Rewards (R)6 Beliefs (B)

Page 33: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Medical Diagnosis: Summary

Description of Medical Diagnosis problem.

Table: General description of decision making under uncertainty task in

medical diagnosis.

States Set of disorders (Cancer, Influenza, etc.)

Hidden State Patient’s current disorder(s)

Actions MRI scan, blood work, surgery, etc.

Observations Elevated white blood cells, temperature, etc.

Transitions Pancreatic CancerSurgery→ No cancer

Rewards Money, time, personnel, etc.

Beliefs Hypothesis about the patient’s current disorder

Page 34: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Re-orientation task

Other sequential decision making processes have the samebasic structure.

Table: Re-orientation after becoming lost.

States Position & Orientation in environment

Hidden State Current position & orientation

Actions Move forward, rotate-left, rotate-right, etc.

Observations Visual Landmarks, Proprioception, Compass, etc.

Transitions < X ,Y ,Θ >Translation→ < X ′,Y ′,Θ >

Rewards Energy expended, time, etc.

Beliefs Hypothesis about your current position

Page 35: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Seek & Destroy task

Other sequential decision making processes have the samebasic structure.

Table: Structure of seek & destroy problem.

States Possible locations of enemy

Hidden State Current position of enemy

Actions Reconnaissance, Artillery, Declare “Finished”, etc.

Observations Enemy sighted, Enemy not sighted, etc.

Transitions Enemy:X,YArtillery→ Enemy: Destroyed

Rewards Cost of Unmanned Air Vehicle assignment, time, etc.

Beliefs Hypothesis about the enemy’s current position

Page 36: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Beyond the structure of the problem

We have provided a description of the structure of asequential decision making with uncertainty task.

Does not provide with a description of what one should do ifthey are optimal/rational.

One can generate the optimal behavior using PartiallyObservable Markov Decision Processes (POMDP)

Bayesian statistics method for computing the optimal decisiongiven the types of specifications above.

To formulate a problem we need to fully specify problem.

Describe a simple example to illustrate the mathematics.

Page 37: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 38: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 39: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 40: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 41: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

1 Tiger Problem

Simple example of Sequential Decision Making underUncertainty task.Illustration to provide intuitive understanding of POMDParchitecture.

Page 42: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Tiger Problem: States

Two doors:

Behind one door is TigerBehind other door is “pot of gold”

Page 43: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Tiger Problem: Actions

Three Actions:

1 Listen2 Open Left-Door3 Open Right-Door

Page 44: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Tiger Problem: Observations

Two Observations:

1 Hear Tiger Left (HearLeft)2 Hear Tiger Right (HearRight)

Observation Structure

p(HearLeft |TigerLeft , Listen) = 0.85

p(HearRight |TigerRight , Listen) = 0.85

p(HearRight |TigerLeft , Listen) = 0.15

p(HearLeft |TigerRight , Listen) = 0.15

Page 45: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Tiger Problem: Observations

Two Observations:

1 Hear Tiger Left (HearLeft)2 Hear Tiger Right (HearRight)

Observation Structure

p(HearLeft |TigerLeft , Listen) = 0.85

p(HearRight |TigerRight , Listen) = 0.85

p(HearRight |TigerLeft , Listen) = 0.15

p(HearLeft |TigerRight , Listen) = 0.15

Page 46: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Tiger Problem: Rewards

Table: Reward Structure for Tiger Problem

Tiger=Left Tiger=Right

Listen -1 -1

Open-Left -100 10

Open-Right 10 -100

Page 47: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Formalizing a SDMUU Task

1 Belief Vector (p(s ′|b, o) = p(o|s′,b)p(s′|b))p(o|b) )

The decision maker’s current belief about the true state of thesystem.Represented as the probability of being in each state.

2 Hidden state (strue)

The position of the TigerNot directly observable

3 Goal

Maximize the expected reward.

Page 48: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Tiger Problem: Belief Updating

Belief Updating

p(s ′|b, o) =p(o|s ′, b)p(s ′|b))

p(o|b)

Table: Belief Updating for Tiger Problem

Act. Num Action Observation p(TigerLeft)

0 —- —- 0.5

1

Listen HearLeft 0.85

2

3

Listen HearRight 0.85

4

5

But what about action selection?

Page 49: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Belief Updating

p(s ′|b, o) =p(o|s ′, b)p(s ′|b))

p(o|b)

0 —- —- 0.5

1 Listen

HearLeft 0.85

2

3

4

5

Page 50: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Belief Updating

p(s ′|b, o) =p(o|s ′, b)p(s ′|b))

p(o|b)

0 —- —- 0.5

1 Listen HearLeft

0.85

2

3

4

5

Page 51: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Belief Updating

p(s ′|b, o) =p(o|s ′, b)p(s ′|b))

p(o|b)

0 —- —- 0.5

1 Listen HearLeft 0.85

2

3

4

5

Page 52: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Belief Updating

p(s ′|b, o) =p(o|s ′, b)p(s ′|b))

p(o|b)

0 —- —- 0.5

2 Listen HearLeft

0.9698

3

4

5

Page 53: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Belief Updating

p(s ′|b, o) =p(o|s ′, b)p(s ′|b))

p(o|b)

0 —- —- 0.5

3

4

5

Page 54: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Belief Updating

p(s ′|b, o) =p(o|s ′, b)p(s ′|b))

p(o|b)

0 —- —- 0.5

3 Listen HearRight

0.85

4

5

Page 55: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Belief Updating

p(s ′|b, o) =p(o|s ′, b)p(s ′|b))

p(o|b)

0 —- —- 0.5

3 Listen HearRight 0.85

4

5

Page 56: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Belief Updating

p(s ′|b, o) =p(o|s ′, b)p(s ′|b))

p(o|b)

0 —- —- 0.5

4 Listen HearLeft

0.9698

5

Page 57: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Belief Updating

p(s ′|b, o) =p(o|s ′, b)p(s ′|b))

p(o|b)

0 —- —- 0.5

5

Page 58: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Belief Updating

p(s ′|b, o) =p(o|s ′, b)p(s ′|b))

p(o|b)

0 —- —- 0.5

5 Listen HearLeft

0.9945

Page 59: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Belief Updating

p(s ′|b, o) =p(o|s ′, b)p(s ′|b))

p(o|b)

0 —- —- 0.5

Page 60: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Belief Updating

p(s ′|b, o) =p(o|s ′, b)p(s ′|b))

p(o|b)

0 —- —- 0.5

But what about action selection?Brian J. Stankiewicz, Ph.D. Human sequential decision making under uncertainty

Page 61: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Tiger Problem: Action Selection

Table: Reward Structure for Tiger Prob-

lem

Tiger=Left Tiger=Right

Listen -1 -1

Open-Left -100 10

Open-Right 10 -100

Page 62: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Computing Expected Value

ρ(b, a) =∑

s∈S R(s, a)b(s)

Immediate reward

R = r(0) +∑∞

t=1 r(t)

Consider immediate and future rewards

V (b) = maxa∈A

[ρ(b, a) +

∑b′∈B τ(b, a, b′)V (b′)

]Expected Value Function

Page 63: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

ρ(b, a) =∑

s∈S R(s, a)b(s)

Immediate reward

R = r(0) +∑∞

t=1 r(t)

V (b) = maxa∈A

[ρ(b, a) +

∑b′∈B τ(b, a, b′)V (b′)

Page 64: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

ρ(b, a) =∑

s∈S R(s, a)b(s)

Immediate reward

R = r(0) +∑∞

t=1 r(t)

V (b) = maxa∈A

[ρ(b, a) +

∑b′∈B τ(b, a, b′)V (b′)

Page 65: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

Tiger Problem: Expected Reward

ρ(b, a) =∑

s∈S R(s, a)b(s)Immediate reward

-120.0

-100.0

-80.00

-60.00

-40.00

-20.00

0.000

20.00

0 0.2 0.4 0.6 0.8 1

Expected Reward1 Action

ListenOpen-RightOpen-Left

Expe

cted

Val

ue

p(TigerLeft

)

Need to calculate theexpected reward.

Reward(Action,State)

Page 66: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

R = r(0) +∑∞

t=1 r(t)

Immediate and future rewards

V (b) = maxa∈A

[ρ(b, a) +

∑b′∈B τ(b, a, b′)V (b′)

Expected rewardfunctions for multiplefuture actions with aninfinite horizon.

Page 67: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

R = r(0) +∑∞

t=1 r(t)

Immediate and future rewards

V (b) = maxa∈A

[ρ(b, a) +

∑b′∈B τ(b, a, b′)V (b′)

Expected rewardfunctions for multiplefuture actions with aninfinite horizon.

Page 68: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

From value function(Expected Rewards)we can generate apolicy based upon ourcurrent belief (beliefvector).

Page 69: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

0 —- —- 0.5

1

2

3

Open-Right Reward 0.5

Page 70: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

0 —- —- 0.5

1 Listen

HearLeft 0.85

2

3

Page 71: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

0 —- —- 0.5

1 Listen HearLeft

0.85

2

3

Page 72: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

0 —- —- 0.5

2

3

Page 73: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

0 —- —- 0.5

2

3

Page 74: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

0 —- —- 0.5

2 Listen HearLeft

0.9698

3

Page 75: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

0 —- —- 0.5

3

Page 76: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

0 —- —- 0.5

3

Page 77: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

0 —- —- 0.5

3 Open-Right Reward

0.5

Page 78: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

0 —- —- 0.5

3 Open-Right Reward 0.5

Page 79: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

POMDP: Computing Expected Value

1 Using a POMDP we can generate a policy graph for aSequential Decision Making Under Uncertainty Task.

Policy graph provides us with the optimal action given a beliefabout the true state of the system.

2 Using a POMDP we can compute the Expected Rewardgiven the initial belief state and optimal action selection.

Using the optimal expected reward structure we can comparehuman performance to the optimal performance.By comparing human behavior to the optimal ExpectedReward we can get a measure of efficiency.

Page 80: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Tiger Problem

POMDP: Computing Expected Value

1 Using a POMDP we can generate a policy graph for aSequential Decision Making Under Uncertainty Task.

Policy graph provides us with the optimal action given a beliefabout the true state of the system.

2 Using a POMDP we can compute the Expected Rewardgiven the initial belief state and optimal action selection.

Using the optimal expected reward structure we can comparehuman performance to the optimal performance.By comparing human behavior to the optimal ExpectedReward we can get a measure of efficiency.

Page 81: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Navigation with Uncertainty: IntroductionExperiment 1: Effect of Layout Complexity

Utility of Optimal Observer

What is the utility of computing the optimal solution?

Provides a way to normalize for task difficulty.

For example, Stankiewicz, Legge, Mansfield & Schlicht (UnderReview)

Page 82: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 83: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 84: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Navigating with uncertainty: General

Needed to carefully control the environment (system) so thatwe can provide the optimal navigator (POMDP) with thesame information as the human.

Used virtual reality indoor environments.Environments were randomly generated on a Cartesian grid.Environments were visually sparse.

No object landmarksAble to quantify Observations (p(o|s))Observations are not unique. More than one state cangenerate the same observation (p(o|s) ≤ 1.0)

Specific set of simple actions.

1 Translate forward 1 hallway unit (to be described)2 Rotate right 90◦

3 Rotate left 90◦

Page 85: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Navigating with uncertainty: Environment

Sample map of environment

Randomly generated environments.

Generated on Cartesian grid.

Translations move from one red square to another.

Page 86: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Navigating with uncertainty: Environment

Sample of navigating through virtual environment

Page 87: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Experiment 1: Procedure

Training:1 Allow free exploration for 3 minutes.

One goal position.Indicated by an auditory signal when subject “walks” over theposition.

2 Have subjects draw environment on cartesian grid.3 Check if map is accurately drawn.4 Return to exploration.5 Repeat until environment is drawn correctly twice in a

row.

Page 88: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Experiment 1: Procedure

Testing:

Subject starts from a random state in the environmentSubject’s task is to reach the goal state in the fewest numberof actions possible (minimize cost).

Every action (translation and rotation) has an equal cost.Subjects have to be certain that they are there and indicatethat they are at the goal with a button press.

Actions are deterministic (p(s ′|a, s) = 1.0 or p(s ′|a, s) = 0.0)Observations are deterministic (p(o|s) = 1.0 or p(o|s) = 0.0Measure the number of actions to reach the goal state and becertain that they are there.

Page 89: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Experiment 1: Manipulation

Manipulated the number of hallway units that composed theenvironment.

10, 20, 40, and 80 hallway units.

Increasing the layout size increased the memory/processing inthe four areas that we are interested in understanding.

Inefficient processing of observations (O)Inefficient access to their cognitive map (T)Inefficient belief updating (B)Inefficient decision process (POMDP)

Page 90: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Experiment 1: Environments

10 Hallway Environment

Page 91: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Experiment 1: Environments

80 Hallway Environment

Page 92: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Experiment 1: Results

Effect of layout complexity on navigationperformance

Increasing layout size increasednumber of actions to reach goal state.

Not too surprising . . . need to travelfarther.

Need to control for task difficulty indifferent environments.

Ideal navigator will be sensitive to taskdifficulty.

Page 93: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 94: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 95: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 96: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 97: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Using ideal navigator we can computethe optimal performance.

POMDP provides optimal performancewith no cognitive limitations.

Use performance to get a measure ofefficiency

Efficiency = IdealPerformanceHumanPerformance

Page 98: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 99: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 100: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 101: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Increasing layout complexity decreasednavigation efficiency.

Controlled for task difficulty usingideal navigator.

Page 102: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 103: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 104: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Experiment 1: Summary

Why does increasing layout complexity reduce humanefficiency when navigating with uncertainty?

Inefficient visual processing? (Experiment 2)

Longer hallways in larger environments.Difficult to process all the way to the end.

Limited access to cognitive map? (Experiment 3)

More to remember in larger environments.Perhaps difficult to access entire map when localizing.

Belief updating? (Experiment 3)

More uncertainty in larger environment.Have to consider more places in larger environments.

Suboptimal Decision Process? (Experiment 3)

More decision needed to be made in larger environment.

Page 105: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 106: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 107: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 108: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 109: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 110: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Experiment 3: Introduction

Provide subject with explicit knowledge that should beavailable internally.

Information provided is already available in the task.

Present information on computer screen using a “heads-up”display.

Three conditions:

1 No Map2 Map3 Map + Belief Vector

Page 111: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

No Map

Map Map + Belief Vector

Page 112: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

No Map Map

Map + Belief Vector

Page 113: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

No Map Map Map + Belief Vector

Page 114: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 115: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Experiment 3: Predictions

Accessing Cognitive Map

Effic

ienc

y

Belief Updating Map + Belief Vector

Page 116: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Effic

ienc

y

Belief Updating

Effic

ienc

y

Map + Belief Vector

Page 117: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Effic

ienc

y

Belief Updating

Effic

ienc

y

Map + Belief Vector

Effic

ienc

y

Page 118: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Experiment 3: Design

Training: Same as Experiment 1

Test: Same as Experiment 1 with an additional “heads-up”map display

Three map conditions:

1 No Map2 Map3 Map + Belief Vector

Page 119: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Improved performancewhen belief vector ismade explicit.

Suggests that the limitingcognitive factor innavigating withuncertainty is aninefficient belief updatingprocess.

Page 120: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 121: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 122: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 123: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Page 124: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Summary

Investigated human sequential decision making underuncertainty.

Used Ideal Observer analysis (Bayesian; POMDP) to studycognitive limitations in two different tasks.

Localized cognitive bottleneck in generating, updating andmaintaining an accurate belief vector.

Currently studying if effect generalizes to non-navigationaltask.

Seek & Destroy problemResults suggest belief updating is also the bottleneck.

Page 125: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Summary

Page 126: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Summary

Page 127: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Summary

Page 128: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Summary

Page 129: Understanding Human Sequential Decision Making Under ......Understanding Human Sequential Decision Making Under Uncertainty Using Ideal Observer Analysis Brian J. Stankiewicz, Ph.D

OutlineIntroduction

Summary

Interesting note:

Computationally, updating and generating an accurate beliefvector is fast and easy for the computer.However for human, it is difficultSelecting the optimal action given the belief vector is difficultcomputationally.However, data suggests that it is relatively easy for the human.Suggests a symbiotic relationship between computer andhuman.