polymatrix games: algorithms and applications · rahul savani department of computer science...

Polymatrix Games:Algorithms and Applications

Rahul Savani

Department of Computer ScienceUniversity of Liverpool

Tutorial at theConference on Web and Internet Economics

WINE 2015

Some of talk relates to joint work with Argyrios Deligkas, John Fearnley,Paul Goldberg, Paul Spirakis, and Bernhard von Stengel

What is a polymatrix game?

Polymatrix games are many-player games

For us, they are graphical games:player interactions are captured by an interaction graph(though sometimes this graph is assumed to be complete)

They model pairwise interactions

Nodes correspond to players

Edges correspond to bimatrix games

Each player chooses a single strategy for all hisbimatrix games and receives the sum of the payoffsfrom his bimatrix games

History of polymatrix games

Introduced in:

Janovskaya (1968)Equilibrium points in polymatrix games (in Russian)Latvian Mathematical Collection

We will touch on the following papers here:

Both classical:

Eaves 1973 [9]

Howson 1972 [15]

Howson & Rosenthal 1974 [16]

Miller & Zucker 1991 [19]

And more recent:

Cai et al 2015 [4]

Fearnley et al 2015 [8]

Mehta 2012 [18]

Govindan & Wilson 2004 [14]

Rubinstein 2015 [21]

Polymatrix game

n players i = 1, . . . , n

finite pure strategy sets Si

payoff matrices for every player i and j , i

A ij∈ R

|Si |×|Sj |

For mixed profile (x1, . . . , xn), the payoff to player i is

ui(x1, . . . , xn) =∑i,j

(xi)>A ijx j

Polymatrix game

n players i = 1, . . . , n

finite pure strategy sets Si

payoff matrices for every player i and j , i

A ij∈ R

|Si |×|Sj |

For mixed profile (x1, . . . , xn), the payoff to player i is

ui(x1, . . . , xn) =∑i,j

(xi)>A ijx j

Example polymatrix game

a ba 0,0 2,2b 2,2 0,0

a ba 0,0 1,1b 1,1 0,0

Equilibria:

(0.5, 0.5) (0.5, 0.5) (0.5, 0.5)

Example polymatrix game

a ba 0,0 2,2b 2,2 0,0

a ba 0,0 1,1b 1,1 0,0

Equilibria:

(0.5, 0.5) (0.5, 0.5) (0.5, 0.5)

Advantage: succinctness

In terms of the number of players, the size of a

strategic-form game is exponential

polymatrix game is polynomial (quadratic)

# players # actions(per player)

# payoffentries

strategic-formn k

n × k n

polymatrix 2k 2 × (n2)

Applications

Polymatrix games are general modelling tool for multi-playergames via pairwise interactions

We will also discuss some other applications from theliterature:

1 Relaxation Labelling Problems for Artificial Neural Networks [19]

2 Graph Transduction in Machine Learning [10]

3 To model 2-player Bayesian Games [16]

4 As a sub-routine for solving general multi-player games [14]

Take-home message

Many things carry over from bimatrix to polymatrix games:

Rational equilibria

Formulation as a Linear Complementarity Problem

Applicability of complementary pivoting algorithms (e.g.Lemke-Howson, Lemke)

Descent methods using Linear Programming for findingApproximate Equilibria

There are also important differences. For polymatrix games:

PPAD-hard to find ε-Nash equilibrium for constant ε

Finding a pure equilibrium is PLS-hard

Take-home message

Many things carry over from bimatrix to polymatrix games:

Rational equilibria

Formulation as a Linear Complementarity Problem

Applicability of complementary pivoting algorithms (e.g.Lemke-Howson, Lemke)

Descent methods using Linear Programming for findingApproximate Equilibria

There are also important differences. For polymatrix games:

PPAD-hard to find ε-Nash equilibrium for constant ε

Finding a pure equilibrium is PLS-hard

Outline

1 Nash equilibria of bimatrix games

2 Linear Complementarity Problems (LCPs)

3 The Lemke–Howson Algorithm and the class PPAD

4 Lemke’s algorithm

5 PLS-hardness of pure equilibria, Graph Transduction

6 Reduction from Polymatrix Game to LCP

7 Descent method for ε-Nash equilibria of polymatrix games

8 Other recent work on polymatrix games

Nash equilibria of bimatrix games

3 31 0

2 50 2

0 64 3

Nash equilibria of bimatrix games

3 31 0

2 50 2

0 64 3

Nash equilibrium =

pair of strategies x, y with

x best response to y andy best response to x

Mixed equilibria

3 31 0

2 50 2

0 64 3

3 32 50 6

( 1/3 2/3)T

xT B =

01/32/3

0 24 3

8/3 8/3)

only only pure best responses canhave

probability > 0

Outline

Linear Complementarity Problem

Given: q ∈ Rn, M ∈ Rn×n Find: z, w ∈ Rn so that

z ≥ 0 ⊥ w = q + Mz ≥ 0

⊥ means orthogonal:

zT w = 0⇔ ziwi = 0 all i = 1, . . . , n

If q ≥ 0, the LCP has trivial solution w = q , z = 0.

Linear Complementarity Problem

z ≥ 0 ⊥ w = q + Mz ≥ 0

⊥ means orthogonal:

zT w = 0⇔ ziwi = 0 all i = 1, . . . , n

If q ≥ 0, the LCP has trivial solution w = q , z = 0.

LP in inequality form

primal : max cT xsubject to Ax ≤ b

x ≥ 0

dual : min yT b

subject to yT A ≥ cT

y ≥ 0

x ≥ 0

dual : min yT b

y ≥ 0

Weak duality: x, y feasible (fulfilling constraints)

⇒ cT x ≤ yT Ax ≤ yT b

x ≥ 0

dual : min yT b

y ≥ 0

Strong duality: primal and dual feasible

⇒ ∃ feasible x, y : cT x = yT b (x, y optimal)

LCP generalizes LP

LCP encodes complementary slackness of strong duality:

cT x = yT Ax = yT b

⇔ (yT A − cT )x = 0, yT (b − Ax) = 0.

≥ 0 ≥ 0 ≥ 0 ≥ 0

LP⇔ LCP

)︸︷︷︸

≥ 0 ⊥(−c

)︸︷︷︸

−A 0

)︸︷︷︸

LCP generalizes LP

LCP encodes complementary slackness of strong duality:

cT x = yT Ax = yT b

⇔ (yT A − cT )x = 0, yT (b − Ax) = 0.

≥ 0 ≥ 0 ≥ 0 ≥ 0

LP⇔ LCP

)︸︷︷︸

≥ 0 ⊥(−c

)︸︷︷︸

−A 0

)︸︷︷︸

Outline

Symmetric equilibria of symmetric games

Given: n n payoff matrix A for row player AT for column player

mixed strategy x = probability distribution on {1,...,n} x 0 , 1Tx = 1

equilibrium (x, x) x best response to x

Remark: As general as m n games (A, B).

Best responses

Given: n n payoff matrix A, mixed strategy y of column player

Ay = vector of expected payoffs against y, components (Ay)i

x best response to y

x maximizes expected payoff xTAy

best response condition:

∀i : xi > 0 (Ay)i = u = maxk (Ay)k

Symmetric equilibria as LCP solutions

equilibrium (x, x) of game with payoff matrix A x best response to x

1Tx = 1,

x 0 Ax ≤ 1u

w.l.o.g. A > 0 u > 0,

equilibrium (x, x)

z = (1/u) x ( 1/u = 1Tz ),

z 0 Az ≤ 1 "equilibrium z"

Best response polyhedron

2 0A =

u<>x 0,{ ( , ) |x u }1Tx= 1, x uA 1

2 0A =

u<>x 0,{ ( , ) |x u }1Tx= 1, x uA 1

2 0A =

u<>x 0,{ ( , ) |x u }1Tx= 1, x uA 1

(2/3, 1/3)

(completely labeled)equilibrium

Projective transformation

2 0A =

u<>x 0,{ ( , ) |x u }1Tx= 1, x uA 1

>x 0, <xA 1{ ( , ) |1x }1

>z 0, <zA 1

Best response polytope

{ |z }

2 0A =

Symmetric Lemke−Howson algorithm

(bottom)

(back)

1missing label

(bottom)

(back)

1missing label

(bottom)

(back)

1missing label

(bottom)

(back)

1missing label

(bottom)

(back)

1missing label

(bottom)

(back)

found label 1

(bottom)

(back)

Why Lemke-Howson works

LH finds at least one Nash equilibrium because

• finitely many "vertices"

for nondegenerate (generic) games:

• unique starting edge given missing label

• unique continuation

precludes "coming back" like here:

END OF LINE (Papadimitriou 1991)

Given a graph G ofindegree/outdegree at most 1,and a start vertex of indegree 0and outdegree 1,find another vertex of degree1

start0000

Catch:graph is exponentially largedefined by two boolean circuitsS , P that take a vertex in {0, 1}n

and output its successor andpredecessor

S(0000) = 0101

P(0101) = 0000

A problem belongs to PPAD if itis reducible in poly-time to ENDOF LINE; and PPAD-completeif END OF LINE is reducible toit.

Not to be confused with

OTHER END OF THIS LINE

output unique vertex endfound by “following the line”from the start – this isPSPACE-hard

PPAD-hardness for bimatrix games

Theorem (DGP06, CDT06 [5, 6])

It is PPAD-complete to compute an exact Nash equilibrium of abimatrix game.

Later we will see PPAD-hardness for approximate equilibriaof bimatrix and polymatrix games

Outline

Costs instead of payoffs

1 2 2 1

2 0 1 3

aik 3 − aik

payoff cost

with new cost matrix A > 0 :

equilibrium z z 0 Az 1

Polyhedral view

1z ≥ 0

2z1z ≥ 1

2z ≥ 0

Lemke's algorithm

given LCP

z 0 w = q + Mz 0

Lemke's algorithm

augmented LCP

z 0 w = q + Mz + dz0 0 z0 0

Lemke's algorithm

augmented LCP

z 0 w = q + Mz + dz0 0 z0 0

d > 0 covering vectorz0 extra variable

z0 = 0 z w solves original LCP

Lemke's algorithm

augmented LCP

z 0 w = q + Mz + dz0 0 z0 0

Initialization:

z 0 w = q + dz0 0

z0 0 minimal wi = 0 for some i

pivot z0 in, wi out,

can increase zi while maintaining z w .

Lemke's algorithm for

M = 2 1 , d = 2 1 3 1

w1 −1 2 1 2= + z1 + z2 + z0

w2 −1 1 3 1

w1 1 0 −5 −2= + z1 + z2 + w2

z0 1 −1 −3 −1

w1 −1 2 1 2= + z1 + z2 + z0

w2 −1 1 3 1

w1 1 0 −5 −2= + z1 + z2 + w2

z0 1 −1 −3 −1

z2 0.2 0 −0.2 −0.4= + z1 + w1 + w2

z0 0.4 −1 0.6 0.2

w1 1 0 −5 −2= + z1 + z2 + w2

z0 1 −1 −3 −1

z2 0.2 0 −0.2 −0.4= + z1 + w1 + w2

z0 0.4 −1 0.6 0.2

z2 0.2 0 −0.2 −0.4= + z0 + w1 + w2

z1 0.4 −1 0.6 0.2

Polyhedral view of Lemke

0z = 0

Outline

The class PLS (Polynomial Local Search)

s Given a starting solutions ∈ S = Σn

a P-time algorithm thatcomputes the cost c(s)

a P-time function that computesa neighbouring solutions′ ∈ N(s) with lower cost, i.e.s.t. c(s′) < c(s), or reportsthat no such neighbour exists:

find a local optimum of thecost function c

“every DAG has a sink”

Local Max Cut

Find local optimum ofMax Cut with the FLIP-neighbourhood (exactly onenode can change sides)

Schaffer and Yannakakis [22] showed that Local Max Cutis PLS-complete (via an extremely involved reduction)

Local Max Cut is to PLS what 3-SAT is to NP

Local Max Cut

Solutions:

{{1, 3, 4}, {2}} (actual Max Cut)

Local Max Cut

Solutions:

{{1, 3, 4}, {2}} (actual Max Cut){{3}, {1, 2, 4}}

Pure Equilibrium in Polymatrix Game

−1 2

a ba 0,0 2,2b 2,2 0,0

a ba 0,0 -1,-1b -1,-1 0,0

a ba 0,0 2,2b 2,2 0,0

a ba 0,0 -1,-1b -1,-1 0,0

The bimatrix games (A ,B) we used are examples of teamgames because A = B; also called coordination games

Proof that the reduction is correct

Define potential function for “team” polymatrix games

Φ(S) =12

This is an exact potential function:when i changes strategy then the potential functionchanges by exactly i’s change in utilityFact: in exact potential games,pure equilibria↔ local optima of exact potentialfunctionOur exact potential function value equals value of the cutfor all strategy profiles

Summary on PLS and polymatrix games

In contrast to bimatrix games, computing a pureequilibrium in polymatrix games is PLS-hard

Next, an application of team polymatrix games

Application: Graph Transduction

semi-supervised learning: estimate a classificationfunction defined over graph of labeled and unlabeled nodes

ie. propagate labels to unlabelled nodes in consistent way

INPUT: Weighted graph, where some nodes are labelled;

edge weights represent similarities

one approach is to use global optimization

an alternative approach is to use a polymatrix game

Note: without the labelled examples, this is a clusteringproblem; also see e.g., “Hedonic Clustering Games” [12, 2]

a ba 2,2 0,0b 0,0 2,2

a ba -1,-1 0,0b 0, 0 -1,-1

Note: asymmetric similarity measures have also beenconsidered. Then we may no longer have pure equilibria, butmixed equilibria are still considered meaningful

a ba 2,2 0,0b 0,0 2,2

a ba -1,-1 0,0b 0, 0 -1,-1

Note: asymmetric similarity measures have also beenconsidered. Then we may no longer have pure equilibria, butmixed equilibria are still considered meaningful

Open question for team polymatrix games

Can we compute a mixed Nash equilibrium of a teampolymatrix game in polynomial-time? [7]

Note that this problem lies in PPAD ∩ PLS so is unlikely to behard for either of them

Question:

Can anyone think of an easy mixed equilibrium for thelocal max cut game?

polymatrix games: algorithms and applications · rahul savani department of computer science...

Documents

alapgiri rahul

evolution of coronal mass ejection morphology with...

student school of hand book - p p savani university...

beyond local nash equilibria for adversarial...

rahul jain

rahul bhargava acedemic conference 2017 rahul

ards rahul

rahul dravid

on the approximation performance of fictitious play in...

rahul bajaj

savani financials limitedsavani financials limited savani...

rahul report

african banking corporation limited annual report … · 1...

rahul sharma

rahul enterprises

rahul raj.pptx

savani financials...

the radial width of a coronal mass ejection between 0.1...

rahul entrepreneur

rahul bhati.pptx