beyond nash: raising the flag of rebellion
DESCRIPTION
Beyond Nash: Raising the Flag of Rebellion. Yisrael Aumann University of Haifa, 11 Kislev 5771 (18.11.10). Based on: Rational Expectations in Games by Robert Aumann and Jacques Dreze American Economic Review, March 08 http://www.ma.huji.ac.il/raumann/pdf/86.pdf. - PowerPoint PPT PresentationTRANSCRIPT
Beyond Nash:Raising the Flag of Rebellion
Yisrael Aumann
University of Haifa, 11 Kislev 5771 (18.11.10)
Based on:
Rational Expectations in Gamesby
Robert Aumann and Jacques Dreze
American Economic Review, March 08 http://www.ma.huji.ac.il/raumann/pdf/86.pdf
• The usual justification for Nash equilibrium is that if game theory is to recommend strategies to the players in a game, then the resulting strategy profile must be known, so each strategy must be a best reply to the others, so the strategies must be in equilibrium.
• What’s wrong with that?• Let’s first backtrack and ask three questions: 1. Why should decision making in games be different
from ordinary (one-person) decision making? Why not just maximize, given our belief about what the others do?
2. Isn’t something vital missing in the description of a game – namely, it’s context?
(Examples: – Coalition government formation – Driving on a one-lane road) 3. What about multiple equilibria? (Is Harsanyi-Selten equilibrium selection the answer?)
• Re 1: Suggested by Kadane & Larkey (Man. Sci., 1982).
K&L ignored the interactive nature of games,
but they didn’t have to.
We’ll show how to incorporate it.• Re 2: This suggests looking at game situations –
games with a context – rather than just games.• Re 3: The answer might be Question 2: Different
equilibria are associated with different contexts.
Formal Definition:
Game Situation :=
Game with belief hierarchies
Assumptions:
1. Common Knowledge of Rationality (CKR)
2. Common Priors (CP)
So now let’s return to our discussion. We said that • The usual justification for Nash equilibrium is that if game
theory is to recommend strategies to the players in a game, then the resulting strategy profile must be known, so each strategy must be a best reply to the others, so the strategies must be in equilibrium.
and we asked• What’s wrong with that?
The answer is that • Game Theory need not recommend any particular
strategy. It can—indeed should—recommend to each player simply to maximize given his private information.
Note that• Nash equilibrium results only in the special case when
the private information is commonly known—in particular, when each one knows what all believe.
Formal Definition:Game Situation :=
Game with belief hierarchies
Assumptions:
1. Common Knowledge of Rationality (CKR)
2. Common Priors (CP)
Definition:
A rational expectation of a player in a game G is
her expectation in some game situation based on G,
with CKR and CP.
Theorem A:
Every rational expectation in a two-person
zero-sum game is that game’s value.
Theorem B:
The rational expectations of a player in a game are
precisely her conditional payoffs (expected payoffs
to her individual pure strategies) when a correlated
equilibrium is played in the “doubled” game: that in
which each of her pure strategies is written twice.
Belief Hierarchies and Belief Systems
Definition: A belief system for a game consists of a set of
types for each player, where a type of player determines
i. his strategy, and
ii. his beliefs: probabilities on the other players’ types.
CKR obtains if all types of all players maximize given their
beliefs. CP obtains if the beliefs have a common prior.
Thm (Harsanyi, 1967). Every belief hierarchy is derived
from some belief system.
Example 1:
Conditional payoff to T = 4
Conditional payoff to B = 7
0,07,2
2,76,6
0⅓
⅓⅓
01
½½
L
L L
R
R R
T
TT
B
B
B
6,62,7
7,20,0
½½
⅞⅛
½⅞
½⅛
T
B
L R
T
B
7/227/22
7/221/22
T
B
L R
Note: Rational Expectations of different players may be mutually inconsistent.
Here the expectations for (B, R) are (6⅛, 6⅛), which is infeasible.
Example 2:
Original Game:
Doubled Game:
0,05,44,5
4,5
5,4
0,05,4
4,50,0
L M R
T
C
B
4,5
5,44,5
4,5
5,4
0,0
0,0
4,50,0
0,05,44,5
0,0
5,4
0,05,4
5,4
4,5
T1
T2
C1
C2
B1
B2
Note 1: The conditional payoffs change when the
game is doubled; there are then more such
payoffs. Thus in Example 2, in the original game
5 is not a conditional payoff, whereas in the
doubled game, it is.
4,5
5,44,5
4,5
5,4
0,0
0,0
4,50,0
0,05,44,5
0,0
5,4
0,05,4
5,4
000
0
1/12
1/6 1/6
1/61/6
1/12
0
00
4,5 1/6
Indeed, consider this correlated equilibrium of thedoubled game:
Here, 5 is the conditional payoff to T1.
0
0
0 0
T2
T1
B1
C1
C2
B2
L M R
Proof Outline for Theorem B:
Suppose there are just two players. A belief
hierarchy of a player can be represented by a type
of that player, a la Harsanyi; each type of each
player is characterized by a pure strategy of that
player, and probabilities for the other player’s
types. Having a CP (common prior) means that
these probabilities are conditionals that derive from
a single distribution on pairs of types.
In Example 2, the situation might look like this:
The rows and columns are types; the entries in the matrix are probabilities that add to 1 overall—the CP. Requiring CKR means that it is optimal for each type to play the pure strategy that that type specifies.
L1L2M1M2M3M4M5R1
T1
T2
T3
C1
C2
C3
C4
B1
B2
Hence, this is a correlated equilibrium of the game
;
the rows and columns are now pure strategies, whose conditional payoffs are the expectations of the corresponding types. “Amalgamating” the copies of each column (adding the corresponding probabilities)
L1L2M1M2M3M4M5R1
T10,00,04,54,54,54,54,5 5,4
T20,00,04,54,54,54,54,55,4
T30,00,04,54,54,54,54,55,4
C15,45,40,00,00,00,00,04,5
C25,45,40,00,00,00,00,04,5
C35,45,40,00,00,00,00,04,5
C45,45,40,00,00,00,00,04,5
B14,54,55,45,45,45,45,40,0
B24,54,55,45,45,45,45,40,0
yields a correlated equilibrium of the game
; note that the conditional payoffs to the row player remain unchanged. Amalgamating
LMR
T10,04,5 5,4
T20,04,55,4
T30,04,55,4
C15,40,04,5
C25,40,04,5
C35,40,04,5
C45,40,04,5
B14,55,40,0
B24,55,40,0
LMR
T10,04,55,4
T2 ,T30,04,55,4
C5,40,04,5
B4,55,40,0
rows as indicated yields a correlated equilibrium of
. The conditional payoff to strategy T1 is the same as the expectation of type T1 in the original type space.
By doubling C and B, and assigning 0 probabilities
to the new rows, we conclude that the expectation
of type T1 is a conditional payoff to a correlated
equilibrium in the doubled game. Similarly for all
types. But the expectations of the types are
precisely all the rational expectations in the given
game. QED
☺
In Economics, “a rational expectation is one that is the sameas the prediction of the relevant economic theory” (Muth, 1961).
Slightly rephrased: the players know the relevant theory (andof course, that it applies to the situation at hand).
In games, the relevant theory takes all players to be rational.
So all players know that all are rational.
So all know that
So all know that…So, CKR.______________________________________________
Next, the “relevant” theory may be thoughtof as yielding a probability distribution p on profiles of beliefs of the players.
But each player knows her own beliefs.
So her beliefs are the conditional of p givenher knowledge.
That is CP.
Discussion of Theorem A
Traditional arguments for the minmax value v of a 2-person 0-sum game:
Guaranteed Value: In expectation, the row playercan guarantee at least v, and the column playercan guarantee paying at most v. “So” --- rationalplayers must end up expecting precisely v.
Equilibrium
Rational Expectations as Benchmarks
!תודה