why ain’t you rich? - artificial intelligence @ miri · why ain’t you rich? why our current...

43
Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence

Upload: others

Post on 11-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Why ain’t you rich?Why our current understanding of “rational choice” isn't

good enough for superintelligence

Page 2: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Nate Soares

We do foundational mathematical research to ensure smarter-than-human artificial intelligence has a positive impact.

Page 3: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Questions?

Text M5642 + your question to 765-560-4177

Q: Text M5642 to 765-560-4177

Page 4: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

City Lights of the United States 2012by NASA Earth Observatory.http://goo.gl/7rvKLrfor GeoTIFF original.http://goo.gl/pKdQwMfor quicker access to the jpeg.Licensed under Public domainvia Wikimedia Commonshttp://goo.gl/W9Xjdx Q: Text M5642 to 765-560-4177

Page 5: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

● Tiling agent theory

● Logical uncertainty

● Decision Theory

● Corrigibility

● Value learning

Q: Text M5642 to 765-560-4177

Page 6: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Why decision theory?

Q: Text M5642 to 765-560-4177

Page 7: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Why decision theory?

We need some way to reason counterfactually.

Q: Text M5642 to 765-560-4177

Page 8: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

"Les lignes de la main Artlibre".Licensed under Free Art License via Wikimedia Commons. http://goo.gl/5dl6RK Q: Text M5642 to 765-560-4177

Page 9: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Q: Text M5642 to 765-560-4177

Page 10: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Causal Decision Theory (CDT)1. Identify your action node A2. Identify the available actions Acts3. Identify your payoff node U4. For each action a in Acts

○ Set A=a by overwriting A with a function that always returns a○ Evaluate the expectation of U given that A=a

5. Take the action a with the highest associated value of U

Q: Text M5642 to 765-560-4177

Page 11: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

How do you construct counterfactuals?

CDT prescribes considering action a by considering what would happen if, instead of being you, you were a simple function that always chose a.

Q: Text M5642 to 765-560-4177

Page 12: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Q: Text M5642 to 765-560-4177

Page 13: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Token Trade

Give Keep

Give ($200, $200) ($0, $300)

Keep ($300, $0) ($100, $100)

Q: Text M5642 to 765-560-4177

Page 14: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Mirror Token Trade

Q: Text M5642 to 765-560-4177

Page 15: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

CDT loses

Given some probability p that TheirDecision=give

1. The action node is Give?2. The actions are give and keep.3. The payoff node is $.4. If Give?=give then $=200p5. If Give?=keep then $=300p + 100(1-p)6. Take the action keep

Because 300p + 100(1-p) > 200p regardless of the value of p

Q: Text M5642 to 765-560-4177

Page 16: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Unfair game?

"Money Cash" by 2bgr8. http://goo.gl/iYHPxZ.Licensed under Creative Commons Attribution 3.0 via Wikimedia Commonshttp://goo.gl/oDZyuU Q: Text M5642 to 765-560-4177

Page 17: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Unfair game?

"Money Cash" by 2bgr8. http://goo.gl/iYHPxZ.Licensed under Creative Commons Attribution 3.0 via Wikimedia Commonshttp://goo.gl/oDZyuU

Fair enough for me.

Q: Text M5642 to 765-560-4177

Page 18: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Unfair game?

"Money Cash" by 2bgr8. http://goo.gl/iYHPxZ.Licensed under Creative Commons Attribution 3.0 via Wikimedia Commonshttp://goo.gl/oDZyuU

Why ain’t you rich?

Fair enough for me.

Q: Text M5642 to 765-560-4177

Page 19: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Leaky scenarios

"Water drops by Ximeg 24.12.12-02" by Ximeg. Own work.Licensed under Creative Commons Attribution-Share Alike 3.0 via Wikimedia Commons.http://goo.gl/ru7DKo

Known as “Newcomblike problems”

Q: Text M5642 to 765-560-4177

Page 20: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Leaky scenarios

"Water drops by Ximeg 24.12.12-02" by Ximeg. Own work.Licensed under Creative Commons Attribution-Share Alike 3.0 via Wikimedia Commons.http://goo.gl/ru7DKo

Known as “Newcomblike problems”

These scenarios are the norm.

Q: Text M5642 to 765-560-4177

Page 21: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

CDT agents would stop using CDT

Q: Text M5642 to 765-560-4177

Page 22: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

CDT agents would stop using CDT

Q: Text M5642 to 765-560-4177

Page 23: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

"Madrid may day375".Licensed under Creative Commons Attribution-Share Alike 2.5via Wikimedia Commons.http://goo.gl/BtSY7U Q: Text M5642 to 765-560-4177

Page 24: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

No self-correction

Q: Text M5642 to 765-560-4177

Page 25: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

No self-correction

Q: Text M5642 to 765-560-4177

Page 26: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

No self-correction

Q: Text M5642 to 765-560-4177

Page 27: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Strange Blackmail

"USSR Blank Envelope 1989 - back" by Wesha. Own work.Licensed under Public domain via Wikimedia Commonhttp://goo.gl/oIOC9Y Q: Text M5642 to 765-560-4177

Page 28: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Unrealistic?

Q: Text M5642 to 765-560-4177

Page 29: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Unrealistic?

Yes.

But...

Q: Text M5642 to 765-560-4177

Page 30: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Unrealistic?

Yes.

But...

Q: Text M5642 to 765-560-4177

Page 31: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

We don’t know what we’re doing

Q: Text M5642 to 765-560-4177

Page 32: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

We don’t know what we’re doing● How do you reason as if your action is connected to the

reasoning of others?○ We actually do have a way to solve this particular problem, but the

solution is imperfect.

● What does good counterfactual reasoning look like?○ And how does this affect your reasoning about how others are

reasoning about you?

● We don’t yet understand an algorithm that knowably converges on a good decision making procedure.

Q: Text M5642 to 765-560-4177

Page 33: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

We don’t know what we’re doing● How do you reason as if your action is connected to the

reasoning of others?○ We actually do have a way to solve this particular problem, but the

solution is imperfect.

● What does good counterfactual reasoning look like?○ And how does this affect your reasoning about how others are

reasoning about you?

● We don’t yet understand an algorithm that knowably converges on a good decision making procedure.

Q: Text M5642 to 765-560-4177

Page 34: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

We don’t know what we’re doing● How do you reason as if your action is connected to the

reasoning of others?○ We actually do have a way to solve this particular problem, but the

solution is imperfect.

● What does good counterfactual reasoning look like?○ And how does this affect your reasoning about how others are

reasoning about you?

● We don’t yet understand an algorithm that knowably converges on a good decision making procedure.

Q: Text M5642 to 765-560-4177

Page 35: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Doom by default

"Apocalypse-Albert Goodwin" by Albert Goodwin. http://goo.gl/tHwcLq.Licensed under Public domain via Wikimedia Commons. http://goo.gl/el27yq

● Tiling agent theory

● Logical uncertainty

● Decision Theory

● Corrigibility

● Value learning

Page 36: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Doom by default

"Apocalypse-Albert Goodwin" by Albert Goodwin. http://goo.gl/tHwcLq.Licensed under Public domain via Wikimedia Commons. http://goo.gl/el27yq

What formal reasoning system could an intelligent agent use to gain very high confidence in similar systems?

Page 37: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Doom by default

"Apocalypse-Albert Goodwin" by Albert Goodwin. http://goo.gl/tHwcLq.Licensed under Public domain via Wikimedia Commons. http://goo.gl/el27yq

Probability theory assumes we know all consequences of everything we know. How could an agent reason reliably under logical uncertainty?

Page 38: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Doom by default

"Apocalypse-Albert Goodwin" by Albert Goodwin. http://goo.gl/tHwcLq.Licensed under Public domain via Wikimedia Commons. http://goo.gl/el27yq

Intelligent agents have, by default, strong instrumental incentives to preserve their goals, by manipulation or deception if necessary. How do we avoid these?

Page 39: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Doom by default

"Apocalypse-Albert Goodwin" by Albert Goodwin. http://goo.gl/tHwcLq.Licensed under Public domain via Wikimedia Commons. http://goo.gl/el27yq

It is not enough to build something that understands what we want. We must build something that wants what we want.

Page 40: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Doom by default

"Apocalypse-Albert Goodwin" by Albert Goodwin. http://goo.gl/tHwcLq.Licensed under Public domain via Wikimedia Commons. http://goo.gl/el27yq

We won’t get good behavior for free.

Page 41: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Dawn or doom?

It depends entirely upon whether we can figure out how to build a beneficial superintelligent system before we figure out how to build an arbitrary one.

"City Lights of the United States 2012" by NASA Earth Observatory.http://goo.gl/7rvKLr for GeoTIFF original.http://goo.gl/pKdQwM for quicker access to the jpeg.Licensed under Public domain via Wikimedia Commons http://goo.gl/W9Xjdx

Page 42: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate

Nate Soares

Page 43: Why ain’t you rich? - Artificial Intelligence @ MIRI · Why ain’t you rich? Why our current understanding of “rational choice” isn't good enough for superintelligence. Nate