Minmax and Dominance

Similar documents
Mixed Strategies; Maxmin

Computing Nash Equilibrium; Maxmin

Domination Rationalizability Correlated Equilibrium Computing CE Computational problems in domination. Game Theory Week 3. Kevin Leyton-Brown

Analyzing Games: Mixed Strategies

CSC304 Lecture 2. Game Theory (Basic Concepts) CSC304 - Nisarg Shah 1

Normal Form Games: A Brief Introduction

1. Introduction to Game Theory

CS510 \ Lecture Ariel Stolerman

Section Notes 6. Game Theory. Applied Math 121. Week of March 22, understand the difference between pure and mixed strategies.

Chapter 3 Learning in Two-Player Matrix Games

Lecture 7: Dominance Concepts

Game Theory. Wolfgang Frimmel. Dominance

Game Theory Lecturer: Ji Liu Thanks for Jerry Zhu's slides

Advanced Microeconomics (Economics 104) Spring 2011 Strategic games I

Noncooperative Games COMP4418 Knowledge Representation and Reasoning

Multiagent Systems: Intro to Game Theory. CS 486/686: Introduction to Artificial Intelligence

Multiagent Systems: Intro to Game Theory. CS 486/686: Introduction to Artificial Intelligence

Multiagent Systems: Intro to Game Theory. CS 486/686: Introduction to Artificial Intelligence

EC3224 Autumn Lecture #02 Nash Equilibrium

SF2972 GAME THEORY Normal-form analysis II

LECTURE 26: GAME THEORY 1

UPenn NETS 412: Algorithmic Game Theory Game Theory Practice. Clyde Silent Confess Silent 1, 1 10, 0 Confess 0, 10 5, 5

Reading Robert Gibbons, A Primer in Game Theory, Harvester Wheatsheaf 1992.

Introduction to Game Theory

Note: A player has, at most, one strictly dominant strategy. When a player has a dominant strategy, that strategy is a compelling choice.

Lecture 6: Basics of Game Theory

Distributed Optimization and Games

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Game Theory

Design of intelligent surveillance systems: a game theoretic case. Nicola Basilico Department of Computer Science University of Milan

CMU Lecture 22: Game Theory I. Teachers: Gianni A. Di Caro

Extensive Form Games: Backward Induction and Imperfect Information Games

CMU-Q Lecture 20:

Lecture #3: Networks. Kyumars Sheykh Esmaili

ECO 463. SimultaneousGames

Repeated Games. Economics Microeconomic Theory II: Strategic Behavior. Shih En Lu. Simon Fraser University (with thanks to Anke Kessler)

ECON 282 Final Practice Problems

(a) Left Right (b) Left Right. Up Up 5-4. Row Down 0-5 Row Down 1 2. (c) B1 B2 (d) B1 B2 A1 4, 2-5, 6 A1 3, 2 0, 1

Math 464: Linear Optimization and Game

Repeated Games. ISCI 330 Lecture 16. March 13, Repeated Games ISCI 330 Lecture 16, Slide 1

NORMAL FORM (SIMULTANEOUS MOVE) GAMES

Game Theory and Randomized Algorithms

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012

Extensive Form Games: Backward Induction and Imperfect Information Games

Games. Episode 6 Part III: Dynamics. Baochun Li Professor Department of Electrical and Computer Engineering University of Toronto

Game Theory and Algorithms Lecture 3: Weak Dominance and Truthfulness

Econ 302: Microeconomics II - Strategic Behavior. Problem Set #5 June13, 2016

Game theory attempts to mathematically. capture behavior in strategic situations, or. games, in which an individual s success in

Homework 5 Answers PS 30 November 2013

Game Theory: The Basics. Theory of Games and Economics Behavior John Von Neumann and Oskar Morgenstern (1943)

CSCI 699: Topics in Learning and Game Theory Fall 2017 Lecture 3: Intro to Game Theory. Instructor: Shaddin Dughmi

Introduction to Algorithms / Algorithms I Lecturer: Michael Dinitz Topic: Algorithms and Game Theory Date: 12/4/14

Introduction to Game Theory I

Rationality and Common Knowledge

FIRST PART: (Nash) Equilibria

Asynchronous Best-Reply Dynamics

Finite games: finite number of players, finite number of possible actions, finite number of moves. Canusegametreetodepicttheextensiveform.

Advanced Microeconomics: Game Theory

The Game Theory of Game Theory Ruben R. Puentedura, Ph.D.

Imperfect Information Extensive Form Games

EconS Game Theory - Part 1

ECO 220 Game Theory. Objectives. Agenda. Simultaneous Move Games. Be able to structure a game in normal form Be able to identify a Nash equilibrium

Dominance and Best Response. player 2

Self-interested agents What is Game Theory? Example Matrix Games. Game Theory Intro. Lecture 3. Game Theory Intro Lecture 3, Slide 1

Topics in Applied Mathematics

Lecture 10: September 2

Game Theory. Vincent Kubala

Economics 201A - Section 5

Solution Concepts 4 Nash equilibrium in mixed strategies

Game Theory: Introduction. Game Theory. Game Theory: Applications. Game Theory: Overview

CPS 570: Artificial Intelligence Game Theory

1. Simultaneous games All players move at same time. Represent with a game table. We ll stick to 2 players, generally A and B or Row and Col.

PARALLEL NASH EQUILIBRIA IN BIMATRIX GAMES ISAAC ELBAZ CSE633 FALL 2012 INSTRUCTOR: DR. RUSS MILLER

Lecture 5: Subgame Perfect Equilibrium. November 1, 2006

Adversarial Search and Game Theory. CS 510 Lecture 5 October 26, 2017

Computational Methods for Non-Cooperative Game Theory

Distributed Optimization and Games

ESSENTIALS OF GAME THEORY

Games in Extensive Form

Topic 1: defining games and strategies. SF2972: Game theory. Not allowed: Extensive form game: formal definition

CSC304 Lecture 3. Game Theory (More examples, PoA, PoS) CSC304 - Nisarg Shah 1

ECON 2100 Principles of Microeconomics (Summer 2016) Game Theory and Oligopoly

Chapter 2 Basics of Game Theory

Extensive Form Games and Backward Induction

Game Theory and MANETs: A Brief Tutorial

/633 Introduction to Algorithms Lecturer: Michael Dinitz Topic: Algorithmic Game Theory Date: 12/6/18

Microeconomics of Banking: Lecture 4

Student Name. Student ID

Game Theory Week 1. Game Theory Course: Jackson, Leyton-Brown & Shoham. Game Theory Course: Jackson, Leyton-Brown & Shoham Game Theory Week 1

Math 152: Applicable Mathematics and Computing

17.5 DECISIONS WITH MULTIPLE AGENTS: GAME THEORY

CSC304: Algorithmic Game Theory and Mechanism Design Fall 2016

Instability of Scoring Heuristic In games with value exchange, the heuristics are very bumpy Make smoothing assumptions search for "quiesence"

Dominant and Dominated Strategies

Summary Overview of Topics in Econ 30200b: Decision theory: strong and weak domination by randomized strategies, domination theorem, expected utility

Session Outline. Application of Game Theory in Economics. Prof. Trupti Mishra, School of Management, IIT Bombay

Introduction to (Networked) Game Theory. Networked Life NETS 112 Fall 2016 Prof. Michael Kearns

14.12 Game Theory Lecture Notes Lectures 10-11

Introduction to Game Theory

Exercises for Introduction to Game Theory SOLUTIONS

Game theory Computational Models of Cognition

Transcription:

Minmax and Dominance CPSC 532A Lecture 6 September 28, 2006 Minmax and Dominance CPSC 532A Lecture 6, Slide 1

Lecture Overview Recap Maxmin and Minmax Linear Programming Computing Fun Game Domination Minmax and Dominance CPSC 532A Lecture 6, Slide 2

What are solution concepts? Solution concept: a subset of the outcomes in the game that are somehow interesting. There is an implicit computational problem of finding these outcomes given a particular game. Depending on the concept, existence can be an issue. Solution concepts we ve seen so far: Minmax and Dominance CPSC 532A Lecture 6, Slide 3

What are solution concepts? Solution concept: a subset of the outcomes in the game that are somehow interesting. There is an implicit computational problem of finding these outcomes given a particular game. Depending on the concept, existence can be an issue. Solution concepts we ve seen so far: Pareto-optimal outcome Pure-strategy Nash equilibrium Mixed-strategy Nash equilibrium Other Nash variants: weak Nash equilibrium strict Nash equilibrium Minmax and Dominance CPSC 532A Lecture 6, Slide 3

Mixed Strategies It would be a pretty bad idea to play any deterministic strategy in matching pennies Idea: confuse the opponent by playing randomly Define a strategy s i for agent i as any probability distribution over the actions A i. pure strategy: only one action is played with positive probability mixed strategy: more than one action is played with positive probability these actions are called the support of the mixed strategy Let the set of all strategies for i be S i Let the set of all strategy profiles be S = S 1... S n. Minmax and Dominance CPSC 532A Lecture 6, Slide 4

Best Response and Nash Equilibrium Our definitions of best response and Nash equilibrium generalize from actions to strategies. Best response: s i BR(s i ) iff s i S i, u i (s i, s i) u i (s i, s i ) Nash equilibrium: s = s1,..., s n is a Nash equilibrium iff i, s i BR(s i ) Every finite game has a Nash equilibrium! [Nash, 1950] e.g., matching pennies: both players play heads/tails 50%/50% Minmax and Dominance CPSC 532A Lecture 6, Slide 5

Lecture Overview Recap Maxmin and Minmax Linear Programming Computing Fun Game Domination Minmax and Dominance CPSC 532A Lecture 6, Slide 6

Max-Min Strategies Player i s maxmin strategy is a strategy that maximizes i s worst-case payoff, in the situation where all the other players (whom we denote i) happen to play the strategies which cause the greatest harm to i. The maxmin value (or safety level) of the game for player i is that minimum amount of payoff guaranteed by a maxmin strategy. Why would i want to play a maxmin strategy? Minmax and Dominance CPSC 532A Lecture 6, Slide 7

Max-Min Strategies Player i s maxmin strategy is a strategy that maximizes i s worst-case payoff, in the situation where all the other players (whom we denote i) happen to play the strategies which cause the greatest harm to i. The maxmin value (or safety level) of the game for player i is that minimum amount of payoff guaranteed by a maxmin strategy. Why would i want to play a maxmin strategy? a conservative agent maximizing worst-case payoff a paranoid agent who believes everyone is out to get him Definition The maxmin strategy for player i is arg max si min s i u i (s 1, s 2 ), and the maxmin value for player i is max si min s i u i (s 1, s 2 ). Minmax and Dominance CPSC 532A Lecture 6, Slide 7

Min-Max Strategies Player i s minmax strategy in a 2-player game is a strategy that minimizes the other player i s best-case payoff. The minmax value of the 2-player game for player i is that maximum amount of payoff that i could achieve under i s minmax strategy. Why would i want to play a minmax strategy? Minmax and Dominance CPSC 532A Lecture 6, Slide 8

Min-Max Strategies Player i s minmax strategy in a 2-player game is a strategy that minimizes the other player i s best-case payoff. The minmax value of the 2-player game for player i is that maximum amount of payoff that i could achieve under i s minmax strategy. Why would i want to play a minmax strategy? to punish the other agent as much as possible Definition The maxmin strategy for player i is arg max si min s i u i (s 1, s 2 ), and the maxmin value for player i is max si min s i u i (s 1, s 2 ). Definition In a two-player game, the minmax strategy for player i is arg min si max s i u i (s 1, s 2 ), and the minmax value for player i is min si max s i u i (s 1, s 2 ). Minmax and Dominance CPSC 532A Lecture 6, Slide 8

Minmax Theorem Theorem (Minmax theorem (von Neumann, 1928)) In any finite, two-player, zero-sum game it is the case that: 1. The maxmin value for one player is equal to the minmax value for the other player. By convention, the maxmin value for player 1 is called the value of the game. 2. For both players, the set of maxmin strategies coincides with the set of minmax strategies. 3. Any maxmin strategy profile (or, equivalently, minmax strategy profile) is a Nash equilibrium. Furthermore, these are all the Nash equilibria. Consequently, all Nash equilibria have the same payoff vector (namely, those in which player 1 gets the value of the game). Minmax and Dominance CPSC 532A Lecture 6, Slide 9

Lecture Overview Recap Maxmin and Minmax Linear Programming Computing Fun Game Domination Minmax and Dominance CPSC 532A Lecture 6, Slide 10

Linear Programming A linear program is defined by: a set of real-valued variables a linear objective function a weighted sum of the variables a set of linear constraints the requirement that a weighted sum of the variables must be greater than or equal to some constant Minmax and Dominance CPSC 532A Lecture 6, Slide 11

Linear Programming maximize subject to w i x i i i x i 0 w c i x i b c c C x i X These problems can be solved in polynomial time using interior point methods. Interestingly, the (worst-case exponential) simplex method is often faster in practice. Minmax and Dominance CPSC 532A Lecture 6, Slide 12

Lecture Overview Recap Maxmin and Minmax Linear Programming Computing Fun Game Domination Minmax and Dominance CPSC 532A Lecture 6, Slide 13

Computing equilibria of zero-sum games minimize U1 subject to u 1 (a 1, a 2 ) s a2 2 U 1 a 1 A 1 a 2 A 2 2 = 1 a 2 A 2 s a2 s a 2 2 0 a 2 A 2 variables: U1 is the expected utility for player 1 s a2 2 is player 2 s probability of playing action a 2 under his mixed strategy each u 1 (a 1, a 2 ) is a constant. Minmax and Dominance CPSC 532A Lecture 6, Slide 14

Computing equilibria of zero-sum games minimize U1 subject to u 1 (a 1, a 2 ) s a2 2 U 1 a 1 A 1 a 2 A 2 2 = 1 a 2 A 2 s a2 s a 2 2 0 a 2 A 2 s 2 is a valid probability distribution. Minmax and Dominance CPSC 532A Lecture 6, Slide 14

Computing equilibria of zero-sum games minimize U1 subject to u 1 (a 1, a 2 ) s a2 2 U 1 a 1 A 1 a 2 A 2 2 = 1 a 2 A 2 s a2 s a 2 2 0 a 2 A 2 U 1 is as small as possible. Minmax and Dominance CPSC 532A Lecture 6, Slide 14

Computing equilibria of zero-sum games minimize U1 subject to u 1 (a 1, a 2 ) s a2 2 U 1 a 1 A 1 a 2 A 2 2 = 1 a 2 A 2 s a2 s a 2 2 0 a 2 A 2 Player 1 s expected utility for playing each of his actions under player 2 s mixed strategy is no more than U1. Because U1 is minimized, this constraint will be tight for some actions: the support of player 1 s mixed strategy. Minmax and Dominance CPSC 532A Lecture 6, Slide 14

Computing equilibria of zero-sum games minimize U1 subject to u 1 (a 1, a 2 ) s a2 2 U 1 a 1 A 1 a 2 A 2 2 = 1 a 2 A 2 s a2 s a 2 2 0 a 2 A 2 This formulation gives us the minmax strategy for player 2. To get the minmax strategy for player 1, we need to solve a second (analogous) LP. Minmax and Dominance CPSC 532A Lecture 6, Slide 14

Computing Maxmin Strategies in General-Sum Games Let s say we want to compute a maxmin strategy for player 1 in an arbitrary 2-player game G. Minmax and Dominance CPSC 532A Lecture 6, Slide 15

Computing Maxmin Strategies in General-Sum Games Let s say we want to compute a maxmin strategy for player 1 in an arbitrary 2-player game G. Create a new game G where player 2 s payoffs are just the negatives of player 1 s payoffs. The maxmin strategy for player 1 in G does not depend on player 2 s payoffs Thus, the maxmin strategy for player 1 in G is the same as the maxmin strategy for player 1 in G By the minmax theorem, equilibrium strategies for player 1 in G are equivalent to a maxmin strategies Thus, to find a maxmin strategy for G, find an equilibrium strategy for G. Minmax and Dominance CPSC 532A Lecture 6, Slide 15

Lecture Overview Recap Maxmin and Minmax Linear Programming Computing Fun Game Domination Minmax and Dominance CPSC 532A Lecture 6, Slide 16

Traveler s Dilemma Two travelers purchase identical African masks while on a tropical vacation. Their luggage is lost on the return trip, and the airline asks them to make independent claims for compensation. In anticipation of excessive claims, the airline representative announces: We know that the bags have identical contents, and we will entertain any claim between $180 and $300, but you will each be reimbursed at an amount that equals the minimum of the two claims submitted. If the two claims differ, we will also pay a reward R to the person making the smaller claim and we will deduct a penalty R from the reimbursement to the person making the larger claim. Minmax and Dominance CPSC 532A Lecture 6, Slide 17

Traveler s Dilemma Action: choose an integer between 180 and 300 If both players pick the same number, they both get that amount as payoff If players pick a different number: the low player gets his number (L) plus some constant R the high player gets L R. Play this game once with a partner; play with as many different partners as you like. R = 5. Minmax and Dominance CPSC 532A Lecture 6, Slide 18

Traveler s Dilemma Action: choose an integer between 180 and 300 If both players pick the same number, they both get that amount as payoff If players pick a different number: the low player gets his number (L) plus some constant R the high player gets L R. Play this game once with a partner; play with as many different partners as you like. R = 5. R = 180. Minmax and Dominance CPSC 532A Lecture 6, Slide 18

Traveler s Dilemma What is the equilibrium? Minmax and Dominance CPSC 532A Lecture 6, Slide 19

Traveler s Dilemma What is the equilibrium? (180, 180) is the only equilibrium, for all R 2. Minmax and Dominance CPSC 532A Lecture 6, Slide 19

Traveler s Dilemma What is the equilibrium? (180, 180) is the only equilibrium, for all R 2. What happens? Minmax and Dominance CPSC 532A Lecture 6, Slide 19

Traveler s Dilemma What is the equilibrium? (180, 180) is the only equilibrium, for all R 2. What happens? with R = 5 most people choose 295 300 with R = 180 most people choose 180 Minmax and Dominance CPSC 532A Lecture 6, Slide 19

Lecture Overview Recap Maxmin and Minmax Linear Programming Computing Fun Game Domination Minmax and Dominance CPSC 532A Lecture 6, Slide 20

Domination Let s i and s i be two strategies for player i, and let S i be is the set of all possible strategy profiles for the other players Definition s i strictly dominates s i if s i S i, u i (s i, s i ) > u i (s i, s i) Definition s i weakly dominates s i if s i S i, u i (s i, s i ) u i (s i, s i) and s i S i, u i (s i, s i ) > u i (s i, s i) Definition s i very weakly dominates s i if s i S i, u i (s i, s i ) u i (s i, s i) Minmax and Dominance CPSC 532A Lecture 6, Slide 21

Equilibria and dominance If one strategy dominates all others, we say it is dominant. A strategy profile consisting of dominant strategies for every player must be a Nash equilibrium. An equilibrium in strictly dominant strategies must be unique. Minmax and Dominance CPSC 532A Lecture 6, Slide 22

Equilibria and dominance If one strategy dominates all others, we say it is dominant. A strategy profile consisting of dominant strategies for every player must be a Nash equilibrium. An equilibrium in strictly dominant strategies must be unique. Consider Prisoner s Dilemma again not only is the only equilibrium the only non-pareto-optimal outcome, but it s also an equilibrium in strictly dominant strategies! Minmax and Dominance CPSC 532A Lecture 6, Slide 22

Dominated strategies No equilibrium can involve a strictly dominated strategy (why?) Minmax and Dominance CPSC 532A Lecture 6, Slide 23

Dominated strategies No equilibrium can involve a strictly dominated strategy (why?) Thus we can remove it, and end up with a strategically equivalent game This might allow us to remove another strategy that wasn t dominated before Running this process to termination is called iterated removal of dominated strategies. Minmax and Dominance CPSC 532A Lecture 6, Slide 23