Two-Person General-Sum Games GAME THEORY II. A two-person general sum game is represented by two matrices and. For instance: If:

Similar documents
Overview GAME THEORY. Basic notions

Multi-player, non-zero-sum games

Section Notes 6. Game Theory. Applied Math 121. Week of March 22, understand the difference between pure and mixed strategies.

Prisoner 2 Confess Remain Silent Confess (-5, -5) (0, -20) Remain Silent (-20, 0) (-1, -1)

Lecture #3: Networks. Kyumars Sheykh Esmaili

1\2 L m R M 2, 2 1, 1 0, 0 B 1, 0 0, 0 1, 1

UPenn NETS 412: Algorithmic Game Theory Game Theory Practice. Clyde Silent Confess Silent 1, 1 10, 0 Confess 0, 10 5, 5

Session Outline. Application of Game Theory in Economics. Prof. Trupti Mishra, School of Management, IIT Bombay

Spring 2014 Quiz: 10 points Answer Key 2/19/14 Time Limit: 53 Minutes (FAS students: Teaching Assistant. Total Point Value: 10 points.

Static or simultaneous games. 1. Normal Form and the elements of the game

Chapter 30: Game Theory

Game Theory. Vincent Kubala

CMU-Q Lecture 20:

Homework 5 Answers PS 30 November 2013

Distributed Optimization and Games

Game Theory. Vincent Kubala

Mixed Strategies; Maxmin

Chapter 3 Learning in Two-Player Matrix Games

n-person Games in Normal Form

Game theory. Logic and Decision Making Unit 2

ECON 2100 Principles of Microeconomics (Summer 2016) Game Theory and Oligopoly

Computing Nash Equilibrium; Maxmin

Reading Robert Gibbons, A Primer in Game Theory, Harvester Wheatsheaf 1992.

Dominant Strategies (From Last Time)

Computational Aspects of Game Theory Bertinoro Spring School Lecture 2: Examples

Introduction. Begin with basic ingredients of a game. optimisation equilibrium. Frank Cowell: Game Theory Basics. July

Minmax and Dominance

Introduction to Game Theory

Genetic Algorithms in MATLAB A Selection of Classic Repeated Games from Chicken to the Battle of the Sexes

NORMAL FORM (SIMULTANEOUS MOVE) GAMES

1. Introduction to Game Theory

Econ 302: Microeconomics II - Strategic Behavior. Problem Set #5 June13, 2016

BS2243 Lecture 3 Strategy and game theory

Game Theory and the Environment. Game Theory and the Environment

ECO 220 Game Theory. Objectives. Agenda. Simultaneous Move Games. Be able to structure a game in normal form Be able to identify a Nash equilibrium

LECTURE 26: GAME THEORY 1

Machine Learning in Iterated Prisoner s Dilemma using Evolutionary Algorithms

Math 152: Applicable Mathematics and Computing

Japanese. Sail North. Search Search Search Search

CSC304 Lecture 2. Game Theory (Basic Concepts) CSC304 - Nisarg Shah 1

CMU Lecture 22: Game Theory I. Teachers: Gianni A. Di Caro

Distributed Optimization and Games

Nash Equilibrium. Felix Munoz-Garcia School of Economic Sciences Washington State University. EconS 503

Multiagent Systems: Intro to Game Theory. CS 486/686: Introduction to Artificial Intelligence

EC3224 Autumn Lecture #02 Nash Equilibrium

Economics II: Micro Winter 2009 Exercise session 4 Aslanyan: VŠE

Game Theory Lecturer: Ji Liu Thanks for Jerry Zhu's slides

Introduction to Game Theory

Engineering Decisions

Game Theory: Basics MICROECONOMICS. Principles and Analysis Frank Cowell

Game Theory. Wolfgang Frimmel. Dominance

The book goes through a lot of this stuff in a more technical sense. I ll try to be plain and clear about it.

RECITATION 8 INTRODUCTION

DECISION MAKING GAME THEORY

ECON 282 Final Practice Problems

Chapter 15: Game Theory: The Mathematics of Competition Lesson Plan

Introduction to (Networked) Game Theory. Networked Life NETS 112 Fall 2014 Prof. Michael Kearns

Game Theory, Continued: From Zero-Sum to Non-Zero-Sum. Problem Set 3 due on FRIDAY!

Lecture 6: Basics of Game Theory

Decision Methods for Engineers

Games. Episode 6 Part III: Dynamics. Baochun Li Professor Department of Electrical and Computer Engineering University of Toronto

Lect 15:Game Theory: the math of competition

(a) Left Right (b) Left Right. Up Up 5-4. Row Down 0-5 Row Down 1 2. (c) B1 B2 (d) B1 B2 A1 4, 2-5, 6 A1 3, 2 0, 1

Game Theory. Department of Electronics EL-766 Spring Hasan Mahmood

Game Theory: From Zero-Sum to Non-Zero-Sum. CSCI 3202, Fall 2010

Introduction to (Networked) Game Theory. Networked Life NETS 112 Fall 2016 Prof. Michael Kearns

ECO 5341 Strategic Behavior Lecture Notes 3

Finance Solutions to Problem Set #8: Introduction to Game Theory

Note: A player has, at most, one strictly dominant strategy. When a player has a dominant strategy, that strategy is a compelling choice.

Game Theory and Randomized Algorithms

Lecture Notes on Game Theory (QTM)

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Game Theory

CSCI 699: Topics in Learning and Game Theory Fall 2017 Lecture 3: Intro to Game Theory. Instructor: Shaddin Dughmi

CS510 \ Lecture Ariel Stolerman

Robustness against Longer Memory Strategies in Evolutionary Games.

Metastrategies in the Colored Trails Game

Computing optimal strategy for finite two-player games. Simon Taylor

Analyzing Games: Mixed Strategies

ECO 463. SimultaneousGames

Lecture 10: September 2

Advanced Microeconomics (Economics 104) Spring 2011 Strategic games I

Microeconomics of Banking: Lecture 4

Domination Rationalizability Correlated Equilibrium Computing CE Computational problems in domination. Game Theory Week 3. Kevin Leyton-Brown

Introduction to Game Theory. František Kopřiva VŠE, Fall 2009

Problem 1 (15 points: Graded by Shahin) Recall the network structure of our in-class trading experiment shown in Figure 1

The Success of TIT FOR TAT in Computer Tournaments

FIRST PART: (Nash) Equilibria

Lecture 13(ii) Announcements. Lecture on Game Theory. None. 1. The Simple Version of the Battle of the Sexes

Bonus Maths 5: GTO, Multiplayer Games and the Three Player [0,1] Game

EconS Game Theory - Part 1

THEORY: NASH EQUILIBRIUM

Noncooperative Games COMP4418 Knowledge Representation and Reasoning

0.1 Battle of the Sexes. 0.2 Chicken. 0.3 Coordination Game

Chapter 2 Basics of Game Theory

Game Theory Week 1. Game Theory Course: Jackson, Leyton-Brown & Shoham. Game Theory Course: Jackson, Leyton-Brown & Shoham Game Theory Week 1

Topics in Applied Mathematics

Math 611: Game Theory Notes Chetan Prakash 2012

Lecture 11 Strategic Form Games

Game Theory ( nd term) Dr. S. Farshad Fatemi. Graduate School of Management and Economics Sharif University of Technology.

Arpita Biswas. Speaker. PhD Student (Google Fellow) Game Theory Lab, Dept. of CSA, Indian Institute of Science, Bangalore

Transcription:

Two-Person General-Sum Games GAME THEORY II A two-person general sum game is represented by two matrices and. For instance: If: is the payoff to P1 and is the payoff to P2. then we have a zero-sum game. cis716-fall2003-parsons-lect09 2 We often write the payoff matrix as: where each entry in the matrix is A strategy if: " is a Nash equilibrium solution to the game $ % $ % ' ' $ $ $&% $(% # #" If both players play a Nash equilibrium strategy, then neither can unilaterally move away from the strategy and profit. The Nash equilibrium is a generalisation of the idea of a saddle point in a zero-sum game. Note that not every game has a Nash equilibrium for pure strategies. Note also that A Nash solution: need not be the best solution; and need not be a resonable solution. All the Nash equilibrium guarantees is stability. This stability amounts to protection against exploitation by the other player. There may be more than one Nash equilibrium. cis716-fall2003-parsons-lect09 3 cis716-fall2003-parsons-lect09 4

# ' # ' In the game: there are two Nash equilibrium strategies: and Clearly both players would prefer the second to the first. A good thing about Nash equilibrium, is that every two-player general-sum game has a Nash equilibrium solution. However, these need not be pure strategy solutions. To ensure we can find a Nash equilibrium, we have to look for mixed strategies. For mixed strategies, as for zero sum games, each player is looking for a probability vector. P1 is looking for: P2 is looking for: cis716-fall2003-parsons-lect09 5 cis716-fall2003-parsons-lect09 6 For a game with payoff matrices and, a mixed strategy is a Nash equilibrium solution if: In other words, gives a higher expected value to P1 than any other strategy when P2 plays. Similarly, gives a higher expected value to P2 than any other strategy when P1 plays. The Prisoner s Dilemma Two suspects have been arrested by the police and are being questioned separately. If they both say nothing, they will be sentenced to 1 year on existing evidence. If either incriminates the other while the other remains silent, the incriminator will be released, and their co-suspect will be sent to prison for 4 years,. If both incriminate each other, they will both be sentenced to 3 years. cis716-fall2003-parsons-lect09 7 cis716-fall2003-parsons-lect09 8

This gives the game: Here the Nash equilibrium strategy is. It is stable because it is not risky for either player if you confess then there is nothing your co-suspect can do to make you worse off. However, this is a much worse outcome than if both players refused to say anything. Such a strategy, though, would be very risky and so unstable. In this case, the Nash equilibrium strategy contrasts with the Pareto efficient outcome. A solution is Pareto efficient if there is no other outcome which makes one play better off and doesn t make the second player worse off. Here the solution is Pareto efficient. One way to think about this is that Nash equilibrium strategies give the best outcome for an individual. In contrast, Pareto efficient solutions give the best outcome for all players together. cis716-fall2003-parsons-lect09 9 cis716-fall2003-parsons-lect09 10 These results seems to suggest that in looking for Nash equilibrium solutions: It is natural to defect; and Defection gives sub-optimal behvaiour. Both of these are true to some extent. Sub-optimal is true if we are looking to maximise utility. If we are looking to minimise risk, then the the Nash equilibrium solution is optimal. There are situations in which it is natural to co-operate. If the game is played several times, and players have memory of what happened on the previous round then: Defection can be punished; and The sucker s payoff can be amortised. Provided that the shadow of the future is big enough, these factors encourage co-operative behaviour. In the Iterated Prisoner s Dilemma strategies like Tit for tat do well. cis716-fall2003-parsons-lect09 11 cis716-fall2003-parsons-lect09 12

There are other some other scenarios, similar to the Prisoner s Dilemma, which are interesting to consider: These are: Co-operation dominates Defection dominates The stag hunt The game of chicken. All of these use a payoff matrix that is very similar to that above. If the payoff matrix is: Then co-operation dominates If the payoff matrix is: Then defection dominates. cis716-fall2003-parsons-lect09 13 cis716-fall2003-parsons-lect09 14 The stag hunt scenario can be described by: You and a friend decide it would be a great joke to show up on the last day of school with some ridiculous haircut. Egged on by your clique, you both swear you ll get the haircut. A night of indecision follows. As you anticipate your parents and teachers reactions [... ] you start wondering if your friend is really going to go through with the plan. Not that you don t want the plan to succeed: the best possible outcome would be for both of you to get the haircut. The trouble is, it would be awful to be the only one to show up with the haircut. That would be the worst possible outcome. You re not above enjoying your friend s embarrassment. If you didn t get the haircut, but the friend did, and looked like a real jerk, that would be almost as good as if you both got the haircut. cis716-fall2003-parsons-lect09 15 cis716-fall2003-parsons-lect09 16

A payoff matrix like: describes this scenario. The difference from the prisoner s dilemma is that now it is better if you both co-operate than if you defect while the other co-operates. There are two Nash equilibrium solutions: Both co-operate Both defect The game of chicken gets its name from a rather silly, macho game that was supposedly popular amongst juvenile delinquents in 1950s America; the game was immortalised by James Dean in the 1950s film Rebel without a Cause. The purpose of the game is to establish who is bravest of the two players. cis716-fall2003-parsons-lect09 17 cis716-fall2003-parsons-lect09 18 Chicken has a payoff matrix like: The game is played by both players driving their cars at high speed towards a cliff. The idea is that the least brave of the two (the chicken ) will be the first to drop out of the game by jumping out of the speeding car. The winner is the one who lasts longest in the car. Of course, if neither player jumps out of the car, then both cars fly off the cliff, taking their foolish passengers to a fiery death on the rocks that undoubtedly lie at the foot of the cliff. co-operation is taken to be jumping out of the car. This differs from the Prisoner s Dilemma in that both defecting is the wrost possible outcome. There are two Nash equilibrium solutions: P1 co-operates and P2 defects. P2 co-operates and P1 defects. cis716-fall2003-parsons-lect09 19 cis716-fall2003-parsons-lect09 20

Summary This lecture has introduced the idea of Nash equilibrium It has considered how Nash equilibrium may be used to analyse a number of canonical games. The notion of Pareto optimality was also discussed. cis716-fall2003-parsons-lect09 21