Noncooperative Games COMP4418 Knowledge Representation and Reasoning

Similar documents
Mixed Strategies; Maxmin

Analyzing Games: Mixed Strategies

Computing Nash Equilibrium; Maxmin

Multiagent Systems: Intro to Game Theory. CS 486/686: Introduction to Artificial Intelligence

Multiagent Systems: Intro to Game Theory. CS 486/686: Introduction to Artificial Intelligence

CMU Lecture 22: Game Theory I. Teachers: Gianni A. Di Caro

Minmax and Dominance

NORMAL FORM (SIMULTANEOUS MOVE) GAMES

ESSENTIALS OF GAME THEORY

Multi-player, non-zero-sum games

Multiagent Systems: Intro to Game Theory. CS 486/686: Introduction to Artificial Intelligence

Domination Rationalizability Correlated Equilibrium Computing CE Computational problems in domination. Game Theory Week 3. Kevin Leyton-Brown

UPenn NETS 412: Algorithmic Game Theory Game Theory Practice. Clyde Silent Confess Silent 1, 1 10, 0 Confess 0, 10 5, 5

Design of intelligent surveillance systems: a game theoretic case. Nicola Basilico Department of Computer Science University of Milan

CSCI 699: Topics in Learning and Game Theory Fall 2017 Lecture 3: Intro to Game Theory. Instructor: Shaddin Dughmi

Game Theory Intro. Lecture 3. Game Theory Intro Lecture 3, Slide 1

Self-interested agents What is Game Theory? Example Matrix Games. Game Theory Intro. Lecture 3. Game Theory Intro Lecture 3, Slide 1

CS510 \ Lecture Ariel Stolerman

CSC304 Lecture 2. Game Theory (Basic Concepts) CSC304 - Nisarg Shah 1

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012

Game Theory Week 1. Game Theory Course: Jackson, Leyton-Brown & Shoham. Game Theory Course: Jackson, Leyton-Brown & Shoham Game Theory Week 1

Games of Perfect Information and Backward Induction

Reading Robert Gibbons, A Primer in Game Theory, Harvester Wheatsheaf 1992.

1. Introduction to Game Theory

Note: A player has, at most, one strictly dominant strategy. When a player has a dominant strategy, that strategy is a compelling choice.

Advanced Microeconomics (Economics 104) Spring 2011 Strategic games I

FIRST PART: (Nash) Equilibria

(a) Left Right (b) Left Right. Up Up 5-4. Row Down 0-5 Row Down 1 2. (c) B1 B2 (d) B1 B2 A1 4, 2-5, 6 A1 3, 2 0, 1

EC3224 Autumn Lecture #02 Nash Equilibrium

ECO 220 Game Theory. Objectives. Agenda. Simultaneous Move Games. Be able to structure a game in normal form Be able to identify a Nash equilibrium

Computational Methods for Non-Cooperative Game Theory

Game Theory. Wolfgang Frimmel. Dominance

Normal Form Games: A Brief Introduction

Multiple Agents. Why can t we all just get along? (Rodney King)

A short introduction to Security Games

Economics 201A - Section 5

Introduction to IO. Introduction to IO

Genetic Algorithms in MATLAB A Selection of Classic Repeated Games from Chicken to the Battle of the Sexes

Lecture #3: Networks. Kyumars Sheykh Esmaili

Game Theory: Normal Form Games

Introduction to Game Theory

Lecture 10: September 2

Mixed strategy Nash equilibrium

Game Theory. Vincent Kubala

Backward Induction and Stackelberg Competition

Game Theory. Vincent Kubala

ECON 282 Final Practice Problems

Homework 5 Answers PS 30 November 2013

LECTURE 26: GAME THEORY 1

Simultaneous-Move Games: Mixed Strategies. Games Of Strategy Chapter 7 Dixit, Skeath, and Reiley

Game Theory ( nd term) Dr. S. Farshad Fatemi. Graduate School of Management and Economics Sharif University of Technology.

Exercises for Introduction to Game Theory SOLUTIONS

Japanese. Sail North. Search Search Search Search

Prisoner 2 Confess Remain Silent Confess (-5, -5) (0, -20) Remain Silent (-20, 0) (-1, -1)

n-person Games in Normal Form

Games. Episode 6 Part III: Dynamics. Baochun Li Professor Department of Electrical and Computer Engineering University of Toronto

Lecture 13(ii) Announcements. Lecture on Game Theory. None. 1. The Simple Version of the Battle of the Sexes

Introduction to (Networked) Game Theory. Networked Life NETS 112 Fall 2016 Prof. Michael Kearns

DECISION MAKING GAME THEORY

Game Theory and MANETs: A Brief Tutorial

1. Simultaneous games All players move at same time. Represent with a game table. We ll stick to 2 players, generally A and B or Row and Col.

Microeconomics of Banking: Lecture 4

Chapter 3 Learning in Two-Player Matrix Games

Advanced Microeconomics: Game Theory

Game Theory: The Basics. Theory of Games and Economics Behavior John Von Neumann and Oskar Morgenstern (1943)

Game Theory Lecturer: Ji Liu Thanks for Jerry Zhu's slides

Adversarial Search and Game Theory. CS 510 Lecture 5 October 26, 2017

Introduction to Game Theory I

Introduction to (Networked) Game Theory. Networked Life NETS 112 Fall 2014 Prof. Michael Kearns

PARALLEL NASH EQUILIBRIA IN BIMATRIX GAMES ISAAC ELBAZ CSE633 FALL 2012 INSTRUCTOR: DR. RUSS MILLER

Game Theory and Randomized Algorithms

ECO 463. SimultaneousGames

On Strictly Competitive Multi-Player Games

The extensive form representation of a game

ECON 2100 Principles of Microeconomics (Summer 2016) Game Theory and Oligopoly

Repeated Games. ISCI 330 Lecture 16. March 13, Repeated Games ISCI 330 Lecture 16, Slide 1

Sequential games. We may play the dating game as a sequential game. In this case, one player, say Connie, makes a choice before the other.

CMU-Q Lecture 20:

ECO 199 B GAMES OF STRATEGY Spring Term 2004 B February 24 SEQUENTIAL AND SIMULTANEOUS GAMES. Representation Tree Matrix Equilibrium concept

Imperfect Information Extensive Form Games

INTRODUCTION TO GAME THEORY

Arpita Biswas. Speaker. PhD Student (Google Fellow) Game Theory Lab, Dept. of CSA, Indian Institute of Science, Bangalore

Problem 1 (15 points: Graded by Shahin) Recall the network structure of our in-class trading experiment shown in Figure 1

Repeated Games. Economics Microeconomic Theory II: Strategic Behavior. Shih En Lu. Simon Fraser University (with thanks to Anke Kessler)

Introduction to Game Theory

Distributed Optimization and Games

1\2 L m R M 2, 2 1, 1 0, 0 B 1, 0 0, 0 1, 1

CPS 570: Artificial Intelligence Game Theory

Dominance and Best Response. player 2

NORMAL FORM GAMES: invariance and refinements DYNAMIC GAMES: extensive form

International Economics B 2. Basics in noncooperative game theory

Backward Induction. ISCI 330 Lecture 14. March 1, Backward Induction ISCI 330 Lecture 14, Slide 1

CSC304 Lecture 3. Game Theory (More examples, PoA, PoS) CSC304 - Nisarg Shah 1

Math 464: Linear Optimization and Game

Spring 2014 Quiz: 10 points Answer Key 2/19/14 Time Limit: 53 Minutes (FAS students: Teaching Assistant. Total Point Value: 10 points.

ECO 5341 Strategic Behavior Lecture Notes 3

The Game Theory of Game Theory Ruben R. Puentedura, Ph.D.

Section Notes 6. Game Theory. Applied Math 121. Week of March 22, understand the difference between pure and mixed strategies.

EconS Game Theory - Part 1

Chapter 15: Game Theory: The Mathematics of Competition Lesson Plan

Transcription:

Noncooperative Games COMP4418 Knowledge Representation and Reasoning Abdallah Saffidine 1 1 abdallah.saffidine@gmail.com slides design: Haris Aziz Semester 2, 2017 Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 1 / 36

Outline 1 Matrix Form Games 2 Best response and Nash equilibrium 3 Mixed Strategies 4 Further Reading Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 2 / 36

Outline 1 Matrix Form Games 2 Best response and Nash equilibrium 3 Mixed Strategies 4 Further Reading Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 3 / 36

Prisoner s Dilemma Both prisoners benefit if they cooperate. If one prisoner defects and the other does not, then the defecting prisoner gets out free! cooperate defect cooperate 2,2 0,3 defect 3,0 1,1 Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 4 / 36

Setup An n-player game (N, A, u) consists of Set of players N = {1,..., n} A = A 1 A n where A i is the action set of player i a A is an action profile. u = (u 1,..., u n) specifies a utility function u i : A R for each player. Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 5 / 36

Bimatrix (2-player) Games a 1 2 a 2 2 a 1 1 u 1 (a 1 1, a 1 2), u 2 (a 1 1, a 1 2) u 1 (a 1 1, a 2 2), u 2 (a 1 1, a 2 2) a 2 1 u 1 (a 2 1, a 1 2), u 2 (a 2 1, a 1 2) u 1 (a 2 1, a 2 2), u 2 (a 2 1, a 2 2) Actions of player 1=A 1 = {a 1 1, a 2 1}. Actions of player 2=A 2 = {a 1 2, a 2 2}. Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 6 / 36

Prisoner s Dilemma Both prisoners benefit if they cooperate. If one prisoner defects and the other does not, then the defecting prisoner gets out free! cooperate defect cooperate 2,2 0,3 defect 3,0 1,1 Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 7 / 36

Penalty Shootout Player 1 (Goal-keeper) wants to match; Player 2 (penalty taker) does not want to match. Left Right Left +1,-1-1,+1 Right -1,+1 +1,-1 Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 8 / 36

Zero Sum Games In zero-sum games, there are two players and for all action profiles a A, u 1 (a) + u 2 (a) = 0. Example Left Right Left +1,-1-1,+1 Right -1,+1 +1,-1 Heads Tails Heads 1-1 Tails -1 1 Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 9 / 36

Rock-Paper-Scissors Both players draw if they have the same action. Otherwise, playing Scissor wins against Paper, playing Paper wins against Rock, and playing Rock wins against Scissors. Rock Paper Scissors Rock 0-1 1 Paper 1 0-1 Scissors -1 1 0 Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 10 / 36

Battle of the Sexes Player 1 (wife) prefers Ballet over Football. Player 2 (husband) prefers Football over Ballet. Both prefer being together than going alone. Ballet Football Ballet 2,1 0,0 Football 0,0 1,2 Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 11 / 36

Pareto Optimality One outcome o Pareto dominates another outcome o if o all players prefer o at least as much as o and at least one player strictly prefers o to o. Each game admits at least one Pareto optimal outcome. Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 12 / 36

Outline 1 Matrix Form Games 2 Best response and Nash equilibrium 3 Mixed Strategies 4 Further Reading Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 13 / 36

Best Response Let a i = (a 1,..., a i 1, a i+1,..., a n ). Definition (Best Response) iff a i BR(a i ) a i A i, u i (a i, a i ) u i (a i, a i ) The best response of a player gives the player maximum possible utility. Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 14 / 36

Nash Equilibrium Let a i = (a 1,..., a i 1, a i+1,..., a n ). Definition (Best Response) a = (a 1,..., a n ) is a (pure) Nash equilibrium iff i, a i BR(a i ). A Nash equilibrium is an action profile in which each player plays a best response. Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 15 / 36

Battle of the Sexes: Pure Nash Equilibria Ballet Football Ballet 2,1 0,0 Football 0,0 1,2 What are the pure Nash equilibria of the game? Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 16 / 36

Battle of the Sexes: Pure Nash Equilibria Ballet Football Ballet 2,1 0,0 Football 0,0 1,2 Pure Nash equilibria: (Ballet, Ballet) (Football, Football) Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 17 / 36

Prisoner s Dilemma cooperate defect cooperate 2,2 0,3 defect 3,0 1,1 What are the pure Nash equilibria of the game? Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 18 / 36

Prisoner s Dilemma cooperate defect cooperate 2,2 0,3 defect 3,0 1,1 The only Nash equilibrium is (defect, defect). The outcome of (defect,defect) is Pareto dominated by the outcome of (cooperate, cooperate). Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 19 / 36

Penalty Shootout Left Right Left 1-1 Right -1 1 What are the pure Nash equilibria of the game? Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 20 / 36

Penalty Shootout Left Right Left 1-1 Right -1 1 What are the pure Nash equilibria of the game? A pure Nash equilibrium may not exist. Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 21 / 36

Complexity of a Computing a Pure Nash Equilibrium Let us assume there are n players and each player has m actions. for each of the m n possible action profiles, check whether some some player out of the n player has a different action among the m actions that gives more utility. Total number of steps: O(m n mn) = O(m n+1 n) Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 22 / 36

Outline 1 Matrix Form Games 2 Best response and Nash equilibrium 3 Mixed Strategies 4 Further Reading Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 23 / 36

Playing pure actions may not be a good idea Example (Penalty Shootout) Left Right Left 1-1 Right -1 1 Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 24 / 36

Mixed Strategies Recall that the possible set of pure actions of each player i N is A i. A pure strategy is one in which exactly one action is played with probability one. A mixed strategy: more than one action is played with non-zero probability. The set of strategies for player i is S i = (A i ) where (A i ) is the set of probability distributions over A i. The set of all strategy profiles is S = S 1 S n. Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 25 / 36

Mixed Strategies We want to analyze the payoff of players under a mixed strategy profile: u i = u i (a)p r(a s) a A P r(a s) = s j (a j ) j N Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 26 / 36

Mixed Strategies We want to analyze the payoff of players under a mixed strategy profile: Example (Penalty Shootout) u i = a A u i (a)p r(a s) P r(a s) = s j (a j ) j N Left Right Left 1-1 Right -1 1 Consider the following strategy profile Player 1 plays Left with probability 0.1 and Right with probability 0.9. Player 2 players Left with probability 0.1 and Right with probability 0.9. Question: What is the utility of player 1 under the strategy profile? Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 27 / 36

Mixed Strategies We want to analyze the payoff of players under a mixed strategy profile: Example (Penalty Shootout) u i = a A u i (a)p r(a s) P r(a s) = s j (a j ) j N Left Right Left 1-1 Right -1 1 Consider the following strategy profile Player 1 plays Left with probability 0.1 and Right with probability 0.9. Player 2 players Left with probability 0.1 and Right with probability 0.9. Then u 1 = (0.1 0.1)1 + (0.1 0.9)( 1) + (0.9 0.1)( 1) + (0.9 0.9)(1) = 0.01 0.09 0.09 + 0.81 = 0.64. Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 28 / 36

Mixed Strategies Definition (Best Response) Best response: s i BR(s i) iff s i S i, u i (s i, s i) u i (s i, s i ). The best response of a player gives the player maximum possible utility. Definition (Nash equilibrium) s = (s 1,..., s n ) is a Nash equilibrium iff i N, s i BR(s i ). A Nash equilibrium is an action profile in which each player plays a best response. Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 29 / 36

Nash s Theorem Theorem (Nash s Theorem) A mixed Nash equilibrium always exists. Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 30 / 36

Battle of the Sexes Ballet Football Ballet 2,1 0,0 Football 0,0 1,2 Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 31 / 36

Battle of the Sexes Ballet Football Ballet 2,1 0,0 Football 0,0 1,2 Let us assume that both players play their full support. Player 2 plays B with p and F with probability 1 p. Player 1 must be indifferent between the actions it plays. 2(p) + 0(1 p) = 0p + 1(1 p) p = 1/3. Player 1 plays B with q and F with probability 1 q Player 2 must be indifferent between the actions it plays. 1(q) + 0(1 q) = 0q + 2(1 q) q = 1/3. Thus the mixed strategies (2/3, 1/3), (1/3, 2/3) are in Nash equilibrium. Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 32 / 36

Support Enumeration Algorithm For 2-player games, a support profile can be checked for Nash equilibria as follows: s i (a i )u i (a i, a i ) = U a i A i s i (a i )u i (a i, a i ) U a i A i s i (a i ) 0 s i (a i ) = 0 a i A i s i (a i ) = 1 i N, a i B i i N, a i / B i i N, a i B i i N, a i / B i When there are more than two players, the constraints are not linear. Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 33 / 36

Complexity of Computing Nash Equilibrium PPAD (Polynomial Parity Arguments on Directed graphs) is a complexity class of computational problems for which a solution always exists because of a parity argument on directed graphs. The class PPAD introduced by Christos Papadimitriou in 1994. Representative PPAD problem: Given an exponential-size directed graph with no isolated nodes and with every node having in-degree and out-degree at most one described by a polynomial-time computable function f(v) that outputs the predecessor and successor of v, and a node s with degree 1, find a t s that is either a source or a sink. Theorem (Daskalakis et al., Chen & Deng; 2005) The problem of finding a Nash equilibrium is PPAD-complete. It is believed that P is not equivalent to PPAD. PPAD-hardness is viewed as evidence that the problem does not admit an efficient algorithm. Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 34 / 36

Outline 1 Matrix Form Games 2 Best response and Nash equilibrium 3 Mixed Strategies 4 Further Reading Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 35 / 36

Reading K. Leyton-Brown and Y. Shoham, Essentials of Game Theory: A Concise Multidisciplinary Introduction. Synthesis Lectures on Artificial Intelligence and Machine Learning. Morgan & Claypool Publishers, 2008. www.gtessentials.org Y. Shoham and K. Leyton-Brown. Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations. 2009. http://www.masfoundations.org Abdallah Saffidine (UNSW) Noncooperative Games Semester 2, 2017 36 / 36