Prisoner 2 Confess Remain Silent Confess (-5, -5) (0, -20) Remain Silent (-20, 0) (-1, -1)

Similar documents
UPenn NETS 412: Algorithmic Game Theory Game Theory Practice. Clyde Silent Confess Silent 1, 1 10, 0 Confess 0, 10 5, 5

Session Outline. Application of Game Theory in Economics. Prof. Trupti Mishra, School of Management, IIT Bombay

Chapter 15: Game Theory: The Mathematics of Competition Lesson Plan

Multi-player, non-zero-sum games

Distributed Optimization and Games

Static or simultaneous games. 1. Normal Form and the elements of the game

1\2 L m R M 2, 2 1, 1 0, 0 B 1, 0 0, 0 1, 1

Lect 15:Game Theory: the math of competition

Lecture #3: Networks. Kyumars Sheykh Esmaili

The book goes through a lot of this stuff in a more technical sense. I ll try to be plain and clear about it.

THEORY: NASH EQUILIBRIUM

CMU-Q Lecture 20:

Finance Solutions to Problem Set #8: Introduction to Game Theory

Distributed Optimization and Games

ESSENTIALS OF GAME THEORY

1. Simultaneous games All players move at same time. Represent with a game table. We ll stick to 2 players, generally A and B or Row and Col.

Econ 302: Microeconomics II - Strategic Behavior. Problem Set #5 June13, 2016

Section Notes 6. Game Theory. Applied Math 121. Week of March 22, understand the difference between pure and mixed strategies.

Chapter 3 Learning in Two-Player Matrix Games

Normal Form Games: A Brief Introduction

FIRST PART: (Nash) Equilibria

Math 152: Applicable Mathematics and Computing

Nash Equilibrium. Felix Munoz-Garcia School of Economic Sciences Washington State University. EconS 503

CPS 570: Artificial Intelligence Game Theory

NORMAL FORM (SIMULTANEOUS MOVE) GAMES

LECTURE 26: GAME THEORY 1

(a) Left Right (b) Left Right. Up Up 5-4. Row Down 0-5 Row Down 1 2. (c) B1 B2 (d) B1 B2 A1 4, 2-5, 6 A1 3, 2 0, 1

Lecture 6: Basics of Game Theory

Games. Episode 6 Part III: Dynamics. Baochun Li Professor Department of Electrical and Computer Engineering University of Toronto

16.410/413 Principles of Autonomy and Decision Making

CMU Lecture 22: Game Theory I. Teachers: Gianni A. Di Caro

Genetic Algorithms in MATLAB A Selection of Classic Repeated Games from Chicken to the Battle of the Sexes

Introduction to Game Theory I

ECON 2100 Principles of Microeconomics (Summer 2016) Game Theory and Oligopoly

Chapter 30: Game Theory

Introduction to (Networked) Game Theory. Networked Life NETS 112 Fall 2014 Prof. Michael Kearns

Introduction to (Networked) Game Theory. Networked Life NETS 112 Fall 2016 Prof. Michael Kearns

Game theory. Logic and Decision Making Unit 2

Topics in Applied Mathematics

1. Introduction to Game Theory

ECO 220 Game Theory. Objectives. Agenda. Simultaneous Move Games. Be able to structure a game in normal form Be able to identify a Nash equilibrium

Two-Person General-Sum Games GAME THEORY II. A two-person general sum game is represented by two matrices and. For instance: If:

Math 464: Linear Optimization and Game

Lecture 13(ii) Announcements. Lecture on Game Theory. None. 1. The Simple Version of the Battle of the Sexes

Spring 2014 Quiz: 10 points Answer Key 2/19/14 Time Limit: 53 Minutes (FAS students: Teaching Assistant. Total Point Value: 10 points.

RECITATION 8 INTRODUCTION

Reading Robert Gibbons, A Primer in Game Theory, Harvester Wheatsheaf 1992.

Arpita Biswas. Speaker. PhD Student (Google Fellow) Game Theory Lab, Dept. of CSA, Indian Institute of Science, Bangalore

Computational Aspects of Game Theory Bertinoro Spring School Lecture 2: Examples

1 Deterministic Solutions

[ Game Theory ] A short primer

Game Theory Refresher. Muriel Niederle. February 3, A set of players (here for simplicity only 2 players, all generalized to N players).

What is... Game Theory? By Megan Fava

Game Theory and the Environment. Game Theory and the Environment

Game Theory. Department of Electronics EL-766 Spring Hasan Mahmood

Lecture 3: Nash Equilibrium

DECISION MAKING GAME THEORY

Grade 7/8 Math Circles. February 14 th /15 th. Game Theory. If they both confess, they will both serve 5 hours of detention.

Exercises for Introduction to Game Theory SOLUTIONS

Dominant and Dominated Strategies

ECO 5341 Strategic Behavior Lecture Notes 3

Game Theory Lecturer: Ji Liu Thanks for Jerry Zhu's slides

Solution Concepts 4 Nash equilibrium in mixed strategies

Appendix A A Primer in Game Theory

Lab: Prisoner s Dilemma

CS510 \ Lecture Ariel Stolerman

Introduction to Game Theory

Strategic Form Games

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Game Theory

First Prev Next Last Go Back Full Screen Close Quit. Game Theory. Giorgio Fagiolo

Introduction to Game Theory

Introduction. Begin with basic ingredients of a game. optimisation equilibrium. Frank Cowell: Game Theory Basics. July

Game Theory: Basics MICROECONOMICS. Principles and Analysis Frank Cowell

Finite games: finite number of players, finite number of possible actions, finite number of moves. Canusegametreetodepicttheextensiveform.

Noncooperative Games COMP4418 Knowledge Representation and Reasoning

Lecture 10: September 2

ECON 282 Final Practice Problems

The extensive form representation of a game

Multiagent Systems: Intro to Game Theory. CS 486/686: Introduction to Artificial Intelligence

WHEN PRISONERS ENTER BATTLE: NATURAL CONNECTIONS IN 2 X 2 SYMMETRIC GAMES. by: Sarah Heilig

Game Theory: Introduction. Game Theory. Game Theory: Applications. Game Theory: Overview

Game Theory: The Basics. Theory of Games and Economics Behavior John Von Neumann and Oskar Morgenstern (1943)

n-person Games in Normal Form

Non-Cooperative Game Theory

PARALLEL NASH EQUILIBRIA IN BIMATRIX GAMES ISAAC ELBAZ CSE633 FALL 2012 INSTRUCTOR: DR. RUSS MILLER

Dominant and Dominated Strategies

Microeconomics of Banking: Lecture 4

Simultaneous-Move Games: Mixed Strategies. Games Of Strategy Chapter 7 Dixit, Skeath, and Reiley

ECO 199 B GAMES OF STRATEGY Spring Term 2004 B February 24 SEQUENTIAL AND SIMULTANEOUS GAMES. Representation Tree Matrix Equilibrium concept

Mixed Strategies; Maxmin

CSCI 699: Topics in Learning and Game Theory Fall 2017 Lecture 3: Intro to Game Theory. Instructor: Shaddin Dughmi

A Brief Introduction to Game Theory

Adversarial Search and Game Theory. CS 510 Lecture 5 October 26, 2017

EC3224 Autumn Lecture #02 Nash Equilibrium

Economics II: Micro Winter 2009 Exercise session 4 Aslanyan: VŠE

ECO 463. SimultaneousGames

BS2243 Lecture 3 Strategy and game theory

player 1 x>y>z x>z>y y>x>z y>z>x z>y>x z>x>y not aggressive aggressive most aggressive

Game Theory Week 1. Game Theory Course: Jackson, Leyton-Brown & Shoham. Game Theory Course: Jackson, Leyton-Brown & Shoham Game Theory Week 1

Game Theory ( nd term) Dr. S. Farshad Fatemi. Graduate School of Management and Economics Sharif University of Technology.

Transcription:

Session 14 Two-person non-zero-sum games of perfect information The analysis of zero-sum games is relatively straightforward because for a player to maximize its utility is equivalent to minimizing the winnings of the other player. We will now move away from these confrontational games and consider games where the interests of the players are at least partially aligned (i.e., they can benefit without necessarily making their opponents worse off). This type of games sometimes leads to unexpected conclusions; in particular, we might need to stop thinking about Nash equilibria as providing optimal solutions for the game. To motivate our move, consider the famous prisoner dilemma (do you ever watch Law & Order on TV?) Two men suspected of committing a crime together are arrested and placed in separate interrogation rooms. Each suspect my either confess or remain silent, and each one knows the consequences of his actions. If one suspect confesses and his partner does not, the one who confessed turns state s evidence and goes free while the other goes to jail for twenty years. On the other hand, if both suspects confess, the both of them go to jail for five years. Finally, if both suspects remain silent, they both go to jail for a year for a lesser charge. Assuming that each criminal only cares for his of her own well-being, the payoffs can be summarized in the following table: Prisoner 1 Prisoner 2 Confess Remain Silent Confess (-5, -5) (0, -20) Remain Silent (-20, 0) (-1, -1) Note that the sum of the payoffs is not constant (it is -10 if both confess but -2 if both remain silent). Hence, this is not a zero-sum game. From a cursory examination of the table, it would seem like remaining silent is the optimal solution for the game (at least from the point of view of the aggregate number of years that the criminals will spend in jail). However, can this solution is not an equilibrium, and it is unlikely that players will adopt such strategy. To see why, let s consider the set of best responses for each player.

If prisoner 2 confesses, remains silent, prisoner 1 should confess confess Hence, confessing is a dominant strategy for prisoner 1 (and, by the symmetry of the game, for prisoner 2 as well). This can seem somewhat contradictory at first sight. The outcome that involves both prisoners confessing is an equilibrium, as no player has a unilateral incentive to change his or her behavior if he/she knows that the other player will confess. However, this is clearly a stupid strategy for both players because both would be better off if they could coordinate their actions so that both remained silent. This type of paradoxes (where Nash equilibria are not necessarily good solutions to the game) cannot arise in zero-sum games, but are very common in non-zero sum games. They arise because in non-zero-sum games details such as the order of play and the ability of the players to communicate, make binding agreements or set side payments, can have a big effect on the outcome of the game. As a second example, consider the following game and assume that no communication, agreements, or wealth transfer are allowed: Player 1 Player 2 A B a (0, 0) (10, 5) b (5, 10) (0, 0) As with all other games, let s consider the set of best responses for each player. From player s 1 perspective:

If player 2 A, B, player 1 should b a Similarly, from the point of view of player 2: If player 1 a, b, player 2 should B A No strategy is dominant or dominated in this example. However, we can easily argue that the pairs (A, b) and (B, a) are Nash equilibria for this game. For the first pair, note that A is the best response for b, and that b is also the best response for A. Hence, there is no incentive for unilateral changes of strategies. A similar argument can be made for the pair (B, a). In addition to these pure strategy equilibria, the game admits a mixed strategy equilibrium. Let p be the probability that player 2 plays A (so that the probability that he/she will play B is 1 p). The expected payoffs for player 1 are: If Player 1 plays a, b, the expected value of the game for Player 1 is 0 p +10 (1 p) =10 10p 5 p + 0 (1 p) = 5p Hence, for player 1 to be indifferent among any action performed by player 2 we need to have 5p =10 10p 15 p =10 p = 10 15 = 2 3 and the expected payoff for player 1 is 10p = 10 3. Now, let q be the probability that player 1 plays a, then

If Player 2 plays A, B, the expected value of the game for Player 2 is 0 q +10 (1 q) =10 10q 5 q + 0 (1 q) = 5q Hence, using the same argument, q = 2 3, and the expected payoff for player 2 is also 10 3. Note that this is really a Nash equilibrium for the game. If player 1 plays a with probability 2/3, there is nothing that player 2 can do to improve his/her expected utility over the one he/she would get by playing A 2/3 of the time, and viceversa. Unlike the pure-strategy equilibria, the mixed-strategy equilibrium is fair (in the sense that the payoff for both players is the same, 10 3 ). However, the expected payoff of 10 3 for each player is still well below the payoffs that players could obtain by cooperating and moving to either (a, B) or (b, A). Indeed, the optimal outcome (if communication and transfers were allowed), would be for both to concentrate on one of those two options and the player with the highest payoff transferring 2.5 units to the other player so that both make a benefit of 7.5 units! Finally, let s consider another example, often called the game of chicken. The name has its origins in a game in which two drivers drive towards each other on a collision course: one must swerve, or both may die in the crash, but if one driver swerves and the other does not, the one who swerved will be called a "chicken" (ever seem Rebel without a cause?). A payoff matrix associated with this game would correspond to: Player 1 Player 2 Swerve Straight Swerve (0, 0) (-1, +1) Straight (+1, -1) (-100, -100) The payoff of -100 for each player in case of a collision is meant to represent a big loss (at least, when compared against the small profit/loss made when one player swerves and the other goes straight). The game of chicken has been used to model a number of

real-life situations, including the doctrine of mutually assured destruction (or, for that matter, of the current standoff between republicans and democrats in congress!!!). Let s consider the best responses from each player. From the perspective of player 1: If player 2 swerves, goes straight, player 1 should go straight. swerve (better chicken that dead!) Similarly, from the point of view of player 2: If player 1 swerves, goes straight, player 2 should go straight. swerve (better chicken that dead!). Hence, the pair of strategies corresponding to (swerve, go straight) and (go straight, swerve) are Nash equilibria. Indeed, if Player 1 knows that Player 2 will go straight, his optimal strategy is to swerve, and if Player 2 knows that Player 1 will swerve, then his best response if to go straight. However, these equilibria might not arise in real life because they imply that everybody will know who the chicken is! The game also admits a mixed-strategy equilibrium, which corresponds to the players swerving 99% of the time and going straight 1%. Indeed, if we let p be the probability that player 2 swerves, the game from the point of view of player 1 looks like If Player 1 swerves, goes straight, the expected value of the game for Player 1 is 0 p +( 1) (1 p) = p 1 1 p + ( 100) (1 p) =101p 100 Since the utilities need to be same for both options, p 1 =101p 100 99 =100 p p = 99 100

Because of the symmetry of the game, the other player must follow the same strategy. The expected value of the game is 1 p =1 99 100 = 1 100 for each of the players. These mixed strategy equilibrium seems like a most plausible solution to the problem.