Game Theory two-person, zero-sum games

Similar documents
Math 611: Game Theory Notes Chetan Prakash 2012

Lecture Notes on Game Theory (QTM)

Math 464: Linear Optimization and Game

Chapter 15: Game Theory: The Mathematics of Competition Lesson Plan

Game Theory Lecturer: Ji Liu Thanks for Jerry Zhu's slides

Game Theory. Problem data representing the situation are constant. They do not vary with respect to time or any other basis.

Section Notes 6. Game Theory. Applied Math 121. Week of March 22, understand the difference between pure and mixed strategies.

Summary Overview of Topics in Econ 30200b: Decision theory: strong and weak domination by randomized strategies, domination theorem, expected utility

Game Theory. Chapter 2 Solution Methods for Matrix Games. Instructor: Chih-Wen Chang. Chih-Wen NCKU. Game Theory, Ch2 1

Optimization of Multipurpose Reservoir Operation Using Game Theory

Chapter 3 Learning in Two-Player Matrix Games

GAME THEORY Day 5. Section 7.4

DECISION MAKING GAME THEORY

37 Game Theory. Bebe b1 b2 b3. a Abe a a A Two-Person Zero-Sum Game

CS510 \ Lecture Ariel Stolerman

ECON 282 Final Practice Problems

Strategic Bargaining. This is page 1 Printer: Opaq

Game Theory and Randomized Algorithms

Japanese. Sail North. Search Search Search Search

CSCI 699: Topics in Learning and Game Theory Fall 2017 Lecture 3: Intro to Game Theory. Instructor: Shaddin Dughmi

Game Theory: The Basics. Theory of Games and Economics Behavior John Von Neumann and Oskar Morgenstern (1943)

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Game Theory

Lecture 6: Basics of Game Theory

ECON 2100 Principles of Microeconomics (Summer 2016) Game Theory and Oligopoly

Game Theory and Algorithms Lecture 3: Weak Dominance and Truthfulness

Game Theory. Department of Electronics EL-766 Spring Hasan Mahmood

Student Name. Student ID

Games. Episode 6 Part III: Dynamics. Baochun Li Professor Department of Electrical and Computer Engineering University of Toronto

Chapter 13. Game Theory

Dominant and Dominated Strategies

Game Theory. Wolfgang Frimmel. Dominance

Optimum Gain Analysis Using the Principle of Game Theory

final examination on May 31 Topics from the latter part of the course (covered in homework assignments 4-7) include:

U strictly dominates D for player A, and L strictly dominates R for player B. This leaves (U, L) as a Strict Dominant Strategy Equilibrium.

(a) Left Right (b) Left Right. Up Up 5-4. Row Down 0-5 Row Down 1 2. (c) B1 B2 (d) B1 B2 A1 4, 2-5, 6 A1 3, 2 0, 1

Computing Nash Equilibrium; Maxmin

CS 1571 Introduction to AI Lecture 12. Adversarial search. CS 1571 Intro to AI. Announcements

THEORY: NASH EQUILIBRIUM

Game Theory Refresher. Muriel Niederle. February 3, A set of players (here for simplicity only 2 players, all generalized to N players).

Introduction to Game Theory a Discovery Approach. Jennifer Firkins Nordstrom

Domination Rationalizability Correlated Equilibrium Computing CE Computational problems in domination. Game Theory Week 3. Kevin Leyton-Brown

Session Outline. Application of Game Theory in Economics. Prof. Trupti Mishra, School of Management, IIT Bombay

Overview GAME THEORY. Basic notions

1. Simultaneous games All players move at same time. Represent with a game table. We ll stick to 2 players, generally A and B or Row and Col.

ECON 312: Games and Strategy 1. Industrial Organization Games and Strategy

1\2 L m R M 2, 2 1, 1 0, 0 B 1, 0 0, 0 1, 1

Note: A player has, at most, one strictly dominant strategy. When a player has a dominant strategy, that strategy is a compelling choice.

Mixed Strategies; Maxmin

Rational decisions in non-probabilistic setting

Alternation in the repeated Battle of the Sexes

Math 152: Applicable Mathematics and Computing

CHAPTER LEARNING OUTCOMES. By the end of this section, students will be able to:

Dynamic Games: Backward Induction and Subgame Perfection

Dominant and Dominated Strategies

Computing optimal strategy for finite two-player games. Simon Taylor

Fictitious Play applied on a simplified poker game

Multi-player, non-zero-sum games

Lecture #3: Networks. Kyumars Sheykh Esmaili

Introduction to Game Theory

DYNAMIC GAMES. Lecture 6

Introduction to (Networked) Game Theory. Networked Life NETS 112 Fall 2014 Prof. Michael Kearns

ECON 301: Game Theory 1. Intermediate Microeconomics II, ECON 301. Game Theory: An Introduction & Some Applications

Contents. MA 327/ECO 327 Introduction to Game Theory Fall 2017 Notes. 1 Wednesday, August Friday, August Monday, August 28 6

Introduction to Game Theory I

Lect 15:Game Theory: the math of competition

GAME THEORY MODULE 4. After completing this supplement, students will be able to: 1. Understand the principles of zero-sum, two-person games.

Introduction to (Networked) Game Theory. Networked Life NETS 112 Fall 2016 Prof. Michael Kearns

Dice Games and Stochastic Dynamic Programming

ECO 463. SimultaneousGames

Advanced Microeconomics: Game Theory

Microeconomics of Banking: Lecture 4

How to divide things fairly

1 Deterministic Solutions

Reading Robert Gibbons, A Primer in Game Theory, Harvester Wheatsheaf 1992.

Game theory. Logic and Decision Making Unit 2

Game Theory. Vincent Kubala

Chapter 2 Basics of Game Theory

Exercises for Introduction to Game Theory SOLUTIONS

arxiv: v1 [cs.gt] 16 Jun 2015

2. Basics of Noncooperative Games

Lecture 10: September 2

Game Theory. Vincent Kubala

1. Introduction to Game Theory

NORMAL FORM (SIMULTANEOUS MOVE) GAMES

What is... Game Theory? By Megan Fava

UPenn NETS 412: Algorithmic Game Theory Game Theory Practice. Clyde Silent Confess Silent 1, 1 10, 0 Confess 0, 10 5, 5

Introduction: What is Game Theory?

The tenure game. The tenure game. Winning strategies for the tenure game. Winning condition for the tenure game

Exploitability and Game Theory Optimal Play in Poker

Variations on the Two Envelopes Problem

Distributed Optimization and Games

Part I. First Notions

Suppose Y is a random variable with probability distribution function f(y). The mathematical expectation, or expected value, E(Y) is defined as:

8.F The Possibility of Mistakes: Trembling Hand Perfection

ARTIFICIAL INTELLIGENCE (CS 370D)

Engineering Decisions

Chapter 30: Game Theory

INSTRUCTIONS: all the calculations on the separate piece of paper which you do not hand in. GOOD LUCK!

Finite games: finite number of players, finite number of possible actions, finite number of moves. Canusegametreetodepicttheextensiveform.

Make better decisions. Learn the rules of the game before you play.

Transcription:

GAME THEORY

Game Theory Mathematical theory that deals with the general features of competitive situations. Examples: parlor games, military battles, political campaigns, advertising and marketing campaigns, etc. Final outcome depends primarily upon combination of strategies selected by adversaries. Emphasis on decision-making processes of adversaries. We will focus on the simplest case: two-person, zero-sum games 522

The odds and evens game Each player simultaneously shows 1 or 2 fingers. One player takes evens, the other player takes odds. Player 1 wins the bet (say $1) if total of fingers is even and loses (say $1) if it is odd; vice-versa for Player 2. Each player has 2 strategies: show 1 or 2 fingers. Payoff table: Player 1 (even) Strategy Player 2 (odd) 1 2 1 1-1 2-1 1 523

Two-person, zero-sum game Characterized by: Strategies of player 1; Strategies of player 2; Payoff table. Strategy: predetermined rule that specifies completely how one intends to respond to each possible circumstance at each stage of the game. Payoff table: shows gain (positive or negative) for one player that would result from each combination of strategies for the two players. 524

Game theory Primary objective is the development of rational criteria for selecting a strategy. Two key assumptions are made: 1. Both players are rational; 2. Both players choose their strategies solely to promote their own welfare (no compassion for opponent). Contrasts with decision analysis, where it is assumed that the decision maker is playing a game with passive opponent nature which chooses its strategies in a random fashion. 525

Prototype example Two politicians are running against each other for Senate. Campaign plans must be made for final two days. Both politicians can do campaign in two key cities, spending either one full day in each city, or two full days in one city. Campaign manager in each city assesses the impact of possible combinations for politician and his opponent. Politician shall use information to choose his best strategy on how to use the two days. 526

Formulation Identify the two players, the strategies of each player and the payoff table. Each player has three strategies: 1. Spend one day in each city. 2. Spend two days in Bigtown. 3. Spend two days in Megalopolis. Appropriate entries for payoff table for one politician are the total net votes won from the opponent resulting from the two days of campaigning. 527

Variation 1 of example Total Net Votes Won by Politician 1 (in Units of 1,000 Votes) Politician 1 Strategy Politician 2 1 2 3 1 1 2 4 2 1 0 5 3 0 1-1 Which strategy should each player select? Apply concept of dominated strategies to rule out succession of inferior strategies until only one choice remains. 528

Dominated strategy A strategy is dominated by a second strategy if the second strategy is always at least as good (and sometimes better) regardless of what the opponent does. A dominated strategy can be eliminated immediately from further consideration. Payoff table includes no dominated strategies for player 2. For player 1, strategy 3 is dominated by strategy 1. Resulting reduced table: 1 2 3 1 1 2 4 2 1 0 5 529

Variation 1 of example (cont.) Strategy 3 for player 2 is now dominated by strategies 1 and 2 of player 1. Reduced table: Strategy 2 of player 1 is now dominated by strategy 1. Reduced table: 1 2 1 1 2 2 1 0 1 2 1 1 2 Strategy 2 for player 2 dominated by strategy 1. Both players should select their strategy 1. 530

Value of the game In the example, politician 1 receives a payoff 1000 votes from politician 2. Payoff to player 1 when both players play optimally is the value of the game. Game with value of zero is a fair game. Concept of dominated strategy is useful for: Reducing size of payoff table to be considered. Identifying optimal solution of the game in special cases as the variation 1 of the example. 531

Variation 2 of example Given the payoff table, which strategy should each player select? Strategy Politician 1 Total Net Votes Won by Politician 1 (in Units of 1,000 Votes) Politician 2 1 2 3 1-3 -2 6 2 2 0 2 3 5-2 -4 Maximum 5 0 6 Minimax value Minimum -3 0 Maxmin value -4 Saddle point (equilibrium solution) Both politicians break even: fair game! 532

Minimax criterion Each player should play in such a way as to minimize his maximum losses whenever the resulting choice of strategy cannot be exploited by the opponent to then improve his position. Select a strategy that would be best even if the selection were being announced to the opponent before the opponent chooses a strategy. Player 1 should select the strategy whose minimum payoff is largest, whereas player 2 should choose the one whose maximum payoff to player 1 is the smallest. 533

Variation 3 of example Given the payoff table, which strategy should each player select? Strategy Politician 1 Total Net Votes Won by Politician 1 (in Units of 1,000 Votes) Politician 2 1 2 3 1 0-2 2 2 5 4-3 3 2 3-4 Maximum 5 4 2 Cycle! Unstable solution Minimum -2 Maxmin value -3-4 Minimax value 534

Variation 3 of example (cont.) Originally suggested solution is an unstable solution (no saddle point). Whenever one player s strategy is predictable, the opponent can take advantage of this information to improve his position. An essential feature of a rational plan for playing a game such as this one is that neither player should be able to deduce which strategy the other will use. It is necessary to choose among alternative acceptable strategies on some kind of random basis. 535

Games with mixed strategies Whenever a game does not possess a saddle point, game theory advises each player to assign a probability distribution over her set of strategies. Let: x i = probability that player 1 will use strategy i (i = 1,2,, m) y j = probability that player 2 will use strategy j (j = 1,2,, n) Probabilities need to be nonnegative and add to 1. These plans (x 1, x 2,..., x m ) and (y 1, y 2,..., y n ) are usually referred to as mixed strategies, and the original strategies are called pure strategies. 536

When the game is actually played It is necessary for each player to use one of her pure strategies. Pure strategy would be chosen by using some random device to obtain a random observation from the probability distribution specified by the mixed strategy. This observation would indicate which particular pure strategy to use. 537

Expected payoff Suppose politicians 1 and 2 select the mixed strategies (x 1, x 2, x 3 ) = (½,½,0) and (y 1, y 2, y 3 ) = (0,½,½). Each player could then flip a coin to determine which of his two acceptable pure strategies he will actually use. Useful measure of performance is expected payoff: Expected payoff for player 1 m n i 1 j 1 pxy ij i j p ij is payoff if player 1 uses pure strategy i and player 2 uses pure strategy j. 538

Expected payoff (cont.) 4 possible payoffs (-2,2,4,-3), each with probability ¼ Expected payoff is ¼(-2+2+4-3)=¼ This measure of performance does not disclose anything about the risks involved in playing the game. It indicates what the average payoff will tend to be if the game is played many times. Game theory extends the concept of the minimax criterion to games that lack a saddle point and thus need mixed strategies. 539

Minimax criterion for mixed strategies A given player should select the mixed strategy that maximizes the minimum expected payoff to the player. Optimal mixed strategy for player 1 is the one that provides the guarantee (minimum expected payoff) that is best (maximal). Value of best guarantee is the maximin value Optimal strategy for player 2 provides the best (minimal) guarantee (maximum expected loss) Value of best guarantee is the minimax value 540

Stable and unstable solutions Using only pure strategies, games not having a saddle point turned out to be unstable because. Players would want to change their strategies to improve their positions. For games with mixed strategies, it is necessary that for optimal solution to be stable. This condition always holds for such games according to the minimax theorem of game theory. 541

Minimax theorem Minimax theorem: If mixed strategies are allowed, the pair of mixed strategies that is optimal according to the minimax criterion provides a stable solution with (the value of the game), so that neither player can do better by unilaterally changing her or his strategy. Mixed strategies becomes quite intuitive if the game is played repeatedly, it requires some interpretation when the game is to be played just once. How to find the optimal mixed strategy for each player? 542

Graphical solution procedure Consider any game with mixed strategies such that, after dominated strategies are eliminated, one of the players has only two pure strategies Mixed strategies are (x 1, x 2 ) and x 2 = 1 x 1, so it is necessary to solve only for the optimal value of x 1. Plot of expected payoff as a function of x 1 for each of her opponent s pure strategies can be done. Plot can be used to identify: point that maximizes the minimum expected payoff; opponent s minimax mixed strategy. 543

Back to variation 3 of example Politician 1 Politician 2 Probability y 1 y 2 y 3 Probability Pure strategy 1 2 3 x 1 1 0-2 2 1 -x 1 2 5 4-3 For each of the pure strategies available to player 2, the expected payoff for player 1 is (y 1, y 2,y 3 ) Expected payoff (1, 0, 0) (0, 1, 0) (0, 0, 1) 0x 1 + 5(1 - x 1 ) = 5-5x 1-2x 1 + 4(1 - x 1 ) = 4-6x 1 2x 1-3(1 - x 1 ) = -3 + 5x 1 544

Optimal solution for politician 1 max{min{ 3 5 x,4 6 x}} 0 x 1 1 1 1 x * 1 7 11 x * 2 4 11 2 11 Minimum expected payoff 545

Optimal solution for politician 2 Expected payoff resulting from optimal strategy for all values of x 1 satisfies: When player 1 is playing optimally, Also So y (5 5 x ) y (4 6 x ) y (3 5 x ) * * * 1 1 2 1 3 1 x1 7 11 20 * * * y1 2 y 11 11 2 2 2 y 11 3 11 * * * y1 y2 y3 1 * * * y1 0, y 5 6 2 and y 11 3 11 and 2 11 546

Solving by linear programming Any game with mixed strategies can be transformed to a linear programming problem applying the minimax theorem and using the definitions of maximin value and minimax value. Define xm 1 yn 1 547

LP problem for player 1 Maximize subject to... and m 1 p x p x... p x x 0 11 1 21 2 m1 m m 1 p x p x... p x x 0 12 1 22 2 m2 m m 1 p x p x... p x x 0 1n 1 2n 2 mn m m 1 x x... x 1 1 2 m x 0 for i=1,2,..., m i x 548

LP problem for player 2 Minimize subject to... and n 1 p y p y... p y y 0 11 1 12 2 1n n n 1 p y p y... p y y 0 21 1 22 2 2n n n 1 p y p y... p y y 0 m1 1 m2 2 mn n n 1 y y... y 1 1 2 y 0 for j=1,2,..., n j y n 549

Duality Player 2 LP problem and player 1 LP problem are dual to each other. Optimal mixed strategies for both players can be found by solving only one of the LP problems. Duality provides simple proof of the minimax theorem. 550

Still a loose end What to do about x m+1 and y n+1 being unrestricted in sign in the LP formulations? If 0, add nonnegativity constraints. If < 0, either: 1. Replace variable without a nonnegativity constraint by the difference of two nonnegative variables; 2. Reverse players 1 and 2 so that payoff table would be rewritten as the payoff to the original player 2; 3. Add a sufficiently large fixed constant to all entries in payoff table that new value of game will be positive. 551

Example Consider again variation 3 after dominated strategy 3 for player 1 is eliminated. Maximize subject to and 1 3 3 5x x 0 2x 4x x 0 1 2 3 2x 3x x 0 x x 1 2 3 1 2 x x 1 0, x 0 1 2 552

Example Applying the simplex method to this LP problem (after adding the constraint x 3 0) yields: * * * x1 7, x 4 2 2, x 11 11 3 11 Dual problem yields (adding the constraint y 4 0) * * * * y1 0, y 5 6 2 2, y3, y 11 11 4 11 553

Extensions Two-person, constant-sum game: sum of payoffs to two players is fixed constant (positive or negative) regardless of combination of strategies selected. N-person game, e.g., competition among business firms, international diplomacy, etc. Nonzero-sum game: e.g., advertising strategies of competing companies can affect not only how they will split the market but also the total size of the market for their competing products. Size of mutual gain (or loss) for the players depends on combination of strategies chosen. 554

Extensions Nonzero-sum games classified in terms of the degree to which the players are permitted to cooperate. Noncooperative game: there is no preplay communication between players. Cooperative game: where preplay discussions and binding agreements are permitted. Infinite games: players have infinite number of pure strategies available to them. Strategy to be selected can be represented by a continuous decision variable. 555

Conclusions General problem of how to make decisions in a competitive environment is a very common and important one. Fundamental contribution of game theory is a basic conceptual framework for formulating and analyzing such problems in simple situations. Research is continuing with some success to extend the theory to more complex situations. 556