simple search methods for finding a nash equilibrium ryan porter, eugene nudelman, & yoav shoham...

21

Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

Upload: lucas-bowman

Post on 26-Mar-2015

216 views

Category:

Documents

1 download

Report

Download

Tags:

Embed Size (px):

TRANSCRIPT

Page 1: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

Simple Search Methods for Finding a Nash Equilibrium

Ryan Porter, Eugene Nudelman, & Yoav Shoham

Computer Science Department

Stanford University

Page 2: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

Finding a Sample Nash Equilibrium

Nash equilibrium (NE) Arguably the most important concept in game theory One always exists [N51]

Finding a sample NE in a normal form game: Considered hard, but unknown whether it is NP-hard State of the art among existing algorithms:

Lemke-Howson [LH64] Simplicial Subdivision [VV87] Govindan-Wilson [GW03] & [BSK03]

Our algorithms: simple Artificial Intelligence methods that perform well in practice

2-player games

N-player games

Page 3: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

Notation

Normal Form Game G = h N,(Ai),(ui) i:

N = {1,…,n}: set of players

Ai: set of available actions for player i

ui: A1 x …x An ! <

Player i selects a mixed strategy:

pi: Ai ! [0,1], s.t. ai 2 Ai pi(ai) = 1

Utility function extended to take p=(p1,…,pn):

ui(p) = a 2 A ui(a) i 2 N pi(ai)

A strategy profile p* is a NE if:

8 i 2 N, ai 2 Ai: ui(ai,p*-i) ≤ ui(p*

i,p*-i)

1,-1 -1,1

-1,1 1,-1

1/2

1/2

0

0

0 0

1/2 1/2

Page 4: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

A Harder Game

2,3 -1,4 2,4 5,2 1,-1

2,2 3,0 4,1 -2,4 1,3

4,6 7,2 2,-2 4,9 2,1

9,0 -2,6 6,3 7,0 0,5

3,2 6,1 2,5 5,3 1,05/11

2/11

0

0

4/11

3/7 2/72/70 0

Page 5: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

Searching Over Supports

Feasibility Problem: Input: S = (S1,...,SN), where 8 i 2 N, Si µ Ai Find: p=(p1,…,pn) and v=(v1,…,vn) Subject to:

8 i 2 N

8 ai 2 Si, pi(ai) ≥ 0

8 ai 2 Si, pi(ai) = 0

ai 2 Ai pi(ai) = 1

8 ai 2 Si, a-i 2 A-i ui(ai,a-i) ji p(aj) = vi

8 ai 2 Si, a-i 2 A-i ui(ai,a-i) ji p(aj) ≤ vi

2,3 -1,4 2,4 5,2 1,-1

2,2 3,0 4,1 -2,4 1,3

4,6 7,2 2,-2 4,9 2,1

9,0 -2,6 6,3 7,0 0,5

3,2 6,1 2,5 5,3 1,05/11

2/11

0

0

4/11

3/7 2/72/70 0

22 3 3 3

2

2

4

4

4

Page 6: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

Features of Algorithm

1) Prefer balanced supports

2) Prefer small supports Motivated by existing theoretical results for particular

distributions (e.g., [MB02])

3) Separately instantiate supports, and remove conditionally dominated actions: An ai is conditionally dominated, given R-i µ A-i if:

9 ai' 2 Ai, 8 a-i 2 R-i, ui(ai,a-i) < ui(ai',a-i) Especially useful in conjunction with (2)

Page 7: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

Two-Player Algorithm

FOR ALL x = (x1,x2), sorted in increasing order of

|x1 – x2| and (x1 + x2)

FOR ALL S1 µ A1 s.t. |S1| = x1

A2' ← {a2 2 A2 not conditionally dominated, given S1}

IF @ a1 2 S1 conditionally dominated, given A2'

FOR ALL S2 µ A2' s.t. |S2| = x2

IF @ a1 2 S1 conditionally dominated, given S2

IF Feasibility Problem satisfied for (S1,S2)

Return found NE p

Page 8: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

N-Player Algorithm

Constraint Satisfaction Problem (CSP) for each support size profile x=(x1,x2): Variables: Si

Domain: all subsets of Ai of size xi

Constraint: support profile S is consistent with a NE 2-player algorithm:

Backtracking, enforcing arc consistency w.r.t. weaker constraints that no conditionally dominated actions in S

N-player algorithm: Generalizes the 2-player algorithm Ordering of size and balance reversed

Page 9: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

Experimental Results

Most previous empirical tests only on “random” games: Each payoff drawn independently from uniform distribution

GAMUT [NWSL04] Based on extensive literature search Generates games from a wide variety of distributions Available at http://gamut.stanford.edu

D1 Bertrand Oligopoly D2 Bidirectional LEG, Complete Graph

D3 Bidirectional LEG, Random Graph D4 Bidirectional LEG, Star Graph

D5 Covariance Game: = 0.9 D6 Covariance Game: = 0

D7 Covariance Game: Random 2 [-1/(N-1),1] D8 Dispersion Game

D9 Graphical Game, Random Graph D10 Graphical Game, Road Graph

D11 Graphical Game, Star Graph D12 Location Game

D13 Minimum Effort Game D14 Polymatrix Game, Random Graph

D15 Polymatrix Game, Road Graph D16 Polymatrix Game, Small-World Graph

D17 Random Game D18 Traveler’s Dilemma

D19 Uniform LEG, Complete Graph D20 Uniform LEG, Random Graph

D21 Uniform LEG, Star Graph D22 War Of Attrition

Page 10: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

2-player Games

Tested on 100 2-player, 300-action games for each of 22 distributions Capped all runs at 1800s

0.01

0.1

1

10

100

1000

10000

Distribution

Tim

e (

s)

Algorithm 1 Lemke-Howson

Page 11: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

2-player Games: Scaling

1

10

100

1000

10000

400 500 600 700 800 900 1000

Actions

Tim

e (

s)

Algorithm 1 Lemke-Howson

Page 12: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

2-player Games: Covariance Games

Covariance Games: For each action profile, payoffs of all players drawn from a multivariate normal distribution, with identical covariance between any two players

0.01

0.1

1

10

100

1000

10000

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

Covariance

Tim

e (

s)

Page 13: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

N-player Games

Tested on 100 6-player, 5-action games for each distribution

0.001

0.01

0.1

1

10

100

1000

10000

Distribution

Tim

e (

s)

Algorithm 2 Simplicial Subdivision Govindan-Wilson

Page 14: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

N-player Games: Scaling

6-Action, Random Games

0.01

0.1

1

10

100

1000

10000

3 4 5 6 7 8

Players

Tim

e (

s)

5-Player, Random Games

0.1

1

10

100

1000

10000

3 4 5 6 7 8

Actions

Tim

e (

s)

Algorithm 2

Simplicial Subdivision

Govindan-Wilson

Page 15: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

N-player Games: Covariance Games

0.01

0.1

1

10

100

1000

10000

-0.2 0 0.2 0.4 0.6 0.8 1

Covariance

Tim

e (

s)

Page 16: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

BFS Lemke-Howson

Lemke-Howson algorithm: Pivoting method to solve LCP for a 2-player game First pivot is an arbitrary selection of a1 2 A1

Afterwards, a deterministic path to a NE Idea: favor “simple” solutions Breadth-First Search:

FOR ALL a1 2 A1

Initialize Lemke-Howson(a1)

REPEAT

FOR ALL a1 2 A1

Pivot Lemke-Howson(a1)

IF found a NE, THEN return p

Page 17: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

2-player “Random” Games

1.18

208

1.18

0.1

1

10

100

1000

Algorithm 1 Lemke-Howson BFS Lemke-Howson

Tim

e (

s)

Page 18: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

2-player Games: Covariance Games

0.01

0.1

1

10

100

1000

10000

-1 -0.5 0 0.5 1

Covariance

Tim

e (

s)

Page 19: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

Summary

CSP-based algorithms Heuristics:

Favor balanced and small supports Eliminate conditionally dominated strategies

Perform well in practice BFS Lemke-Howson

In preliminary results, performs even better than our 2-player algorithm

Commentary on problem: Games researchers care about tend to have at least

one “simple” solution

Page 20: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

Future Work

Coming to Gambit Focus on “Covariance” Games, with low covariance Other techniques from Artificial Intelligence

Local Search: State: support profile Operators: add or delete an action Score: based on relaxation of the feasibility problem

Page 21: Simple Search Methods for Finding a Nash Equilibrium Ryan Porter, Eugene Nudelman, & Yoav Shoham Computer Science Department Stanford University

Simple Search Methods for Finding a Nash Equilibrium

Ryan Porter, Eugene Nudelman, & Yoav Shoham

Computer Science Department

Stanford University

Lehman-O'Callaghan-Shoham JACM-02

NUDELMAN RARE BOOKS

Nudelman Couture Sample

CRM and Ecommerce. Yoav Kutner

shoham/www papers/Yoav … · Web viewAUTHOR = "H. Yanco and L.A. Stein", TITLE ="{An Adaptive Communication Protocol for Cooperating. Mobile Robots}",

WCL328 - Windows Intune for the Enterprise David Nudelman Senior Consultant – Microsoft MVP OCSL - UK

CATALOG 29 - Nudelman Rare Books

Mastering the Art of Rewarded Video | Tal Shoham

Varda Shoham, Ph.D. Senior Advisor for Translational Research NIMH

Shlomo Giora Shoham · Shlomo Giora Shoham : œuvres (16 ressources dans data.bnf.fr) Œuvres textuelles (14) HaʿEd šekašal (2013) The myth of Tantalus (2005) Riyq lloʾ śwbaʿ

CONTROL COORDINATION OF MULTIPLE AGENTS ...CONTROL COORDINATION OF MULTIPLE AGENTS THROUGH DECISION THEORETIC AND ECONOMIC METHODS 6. AUTHOR(S) Yoav Shoham 5. FUNDING NUMBERS C - F30602-98-C-0214

Yoav Lerman Thesis

Yoav Liberman's Portable Bench

Taming the Computational Complexity of Combinatorial Auctions Kevin Leyton-Brown Yoav Shoham

ai.stanford.eduai.stanford.edu/~shoham/www papers/Yoav Bibfiles/who… · Web viewauthor = {Khalil Sima'an and Alon Itai and Yoad Winter and Alon Altman and Noa Nativ},

Yoav Livneh [email protected]

Instant Dehazing of Images Using Polarizationwebee.technion.ac.il/~yoav/publications/hazecvpr.pdf · Instant Dehazing of Images Using Polarization Yoav Y. Schechner, Srinivasa G

ai.stanford.eduai.stanford.edu/users/shoham/www papers/Yoav Bibfiles… · Web viewAmerican Industrial Enterprize}", PUBLISHER = "MIT Press, Cambridge, Mass", YEAR = 1962} ... TITLE

JVM Memory Model - Yoav Abrahami, Wix

1 On the Emergence of Social Conventions: modeling, analysis and simulations Yoav Shoham & Moshe Tennenholtz Journal of Artificial Intelligence 94(1-2),

Yoav Benjamini, "In the world beyond p

Software Transactional Memory Yoav Cohen Seminar in Distributed Computing Spring 2007 Yoav Cohen Seminar in Distributed Computing Spring 2007

IP Expo 2013 - Migration strategies for end of life - David Nudelman

Attenuating Natural Flicker Patterns Yoav Y. Schechner Nir Karpel Support: Taub Foundation, Ollendorff Foundation (BMBF), ISF Ack: Yoav Fhiler, Naftali

Combinatorial Auction Glossary...1 Forthcoming in Peter Cramton, Yoav Shoham, and Richard Steinberg (eds.), Combinatorial Auctions, MIT Press, 2006.Combinatorial Auction Glossary additive

Simple search methods for finding a Nash equilibrium Ryan Porter, Eugene Nudelman, and Yoav Shoham Games and Economic Behavior, Vol. 63, Issue 2. pp. 642-661,

Approaches to Artificial Intelligence · PDF fileEconomic Approaches to Artificial Intelligence, Michael Wellman Massively Parallel AI, Dave Waltz Agent-OrientedProgramming, Yoav Shoham

E-commerce Berlin Expo - Yoav Kutner - OroCRM

The Uberflip Experience 2016: Yoav Schwartz

COMSOC’08, Liverpool, UK On the Agenda Control Problem for Knockout Tournaments Thuc Vu, Alon Altman, Yoav Shoham {thucvu, epsalon, shoham}@stanford.edu

1 On the Agenda(s) of Research on Multi-Agent Learning by Yoav Shoham and Rob Powers and Trond Grenager Learning against opponents with bounded memory

Triangulation in Random Refractive Distortionswebee.technion.ac.il/~yoav/publications/RandomStereo25_final.pdf · Triangulation in Random Refractive Distortions Marina Alterman, Yoav

Hossam Haick , Yoav Y. Broza