Behavioral Strategies in Zero-Sum Games in Extensive Form

Similar documents
Game Theory Refresher. Muriel Niederle. February 3, A set of players (here for simplicity only 2 players, all generalized to N players).

Game theory attempts to mathematically. capture behavior in strategic situations, or. games, in which an individual s success in

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 6 Games and Strategy (ch.4)-continue

Introduction to Game Theory

Introduction to Algorithms / Algorithms I Lecturer: Michael Dinitz Topic: Algorithms and Game Theory Date: 12/4/14

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Game Theory

Appendix A A Primer in Game Theory

Game Theory: The Basics. Theory of Games and Economics Behavior John Von Neumann and Oskar Morgenstern (1943)

/633 Introduction to Algorithms Lecturer: Michael Dinitz Topic: Algorithmic Game Theory Date: 12/6/18

Summary Overview of Topics in Econ 30200b: Decision theory: strong and weak domination by randomized strategies, domination theorem, expected utility

CHAPTER LEARNING OUTCOMES. By the end of this section, students will be able to:

37 Game Theory. Bebe b1 b2 b3. a Abe a a A Two-Person Zero-Sum Game

Asynchronous Best-Reply Dynamics

Sequential Games When there is a sufficient lag between strategy choices our previous assumption of simultaneous moves may not be realistic. In these

Belief-based rational decisions. Sergei Artemov

Simple Decision Heuristics in Perfec Games. The original publication is availabl. Press

The extensive form representation of a game

Dynamic games: Backward induction and subgame perfection

3 Game Theory II: Sequential-Move and Repeated Games

CS510 \ Lecture Ariel Stolerman

Advanced Microeconomics: Game Theory

Applied Game Theory And Strategic Behavior Chapter 1 and Chapter 2. Author: Siim Adamson TTÜ 2010

Some introductory notes on game theory

Game Theory and Randomized Algorithms

8.F The Possibility of Mistakes: Trembling Hand Perfection

Microeconomics II Lecture 2: Backward induction and subgame perfection Karl Wärneryd Stockholm School of Economics November 2016

Dynamic Games: Backward Induction and Subgame Perfection

International Economics B 2. Basics in noncooperative game theory

Game Theory and Economics Prof. Dr. Debarshi Das Humanities and Social Sciences Indian Institute of Technology, Guwahati

Opponent Models and Knowledge Symmetry in Game-Tree Search

Topic 1: defining games and strategies. SF2972: Game theory. Not allowed: Extensive form game: formal definition

Computing Nash Equilibrium; Maxmin

ECON 312: Games and Strategy 1. Industrial Organization Games and Strategy

2. Extensive Form Games

1\2 L m R M 2, 2 1, 1 0, 0 B 1, 0 0, 0 1, 1

2. The Extensive Form of a Game

final examination on May 31 Topics from the latter part of the course (covered in homework assignments 4-7) include:

Domination Rationalizability Correlated Equilibrium Computing CE Computational problems in domination. Game Theory Week 3. Kevin Leyton-Brown

February 11, 2015 :1 +0 (1 ) = :2 + 1 (1 ) =3 1. is preferred to R iff

(a) Left Right (b) Left Right. Up Up 5-4. Row Down 0-5 Row Down 1 2. (c) B1 B2 (d) B1 B2 A1 4, 2-5, 6 A1 3, 2 0, 1

Game theory Computational Models of Cognition

Game Theory for Strategic Advantage Alessandro Bonatti MIT Sloan

Repeated Games. Economics Microeconomic Theory II: Strategic Behavior. Shih En Lu. Simon Fraser University (with thanks to Anke Kessler)

1 Simultaneous move games of complete information 1

Contents. MA 327/ECO 327 Introduction to Game Theory Fall 2017 Notes. 1 Wednesday, August Friday, August Monday, August 28 6

DECISION MAKING GAME THEORY

Mixed Strategies; Maxmin

Econ 302: Microeconomics II - Strategic Behavior. Problem Set #5 June13, 2016

Strategic Bargaining. This is page 1 Printer: Opaq

Refinements of Sequential Equilibrium

ECON 2100 Principles of Microeconomics (Summer 2016) Game Theory and Oligopoly

Economics of Strategy (ECON 4550) Maymester 2015 Foundations of Game Theory

Applied Game Theory And Strategic Behavior Chapter 1 and Chapter 2 review

Chapter 13. Game Theory

Lecture Notes on Game Theory (QTM)

Sequential games. Moty Katzman. November 14, 2017

1. Simultaneous games All players move at same time. Represent with a game table. We ll stick to 2 players, generally A and B or Row and Col.

Extensive-Form Correlated Equilibrium: Definition and Computational Complexity

GOLDEN AND SILVER RATIOS IN BARGAINING

Section Notes 6. Game Theory. Applied Math 121. Week of March 22, understand the difference between pure and mixed strategies.

CSCI 699: Topics in Learning and Game Theory Fall 2017 Lecture 3: Intro to Game Theory. Instructor: Shaddin Dughmi

Dominant and Dominated Strategies

ECON 282 Final Practice Problems

Economics II: Micro Winter 2009 Exercise session 4 Aslanyan: VŠE

EconS 424- Strategy and Game Theory Reputation and Incomplete information in a public good project How to nd Semi-separating equilibria?

U strictly dominates D for player A, and L strictly dominates R for player B. This leaves (U, L) as a Strict Dominant Strategy Equilibrium.

CS188 Spring 2014 Section 3: Games

Zolt-Gilburne Imagination Seminar. Knowledge and Games. Sergei Artemov

Game Theory. Wolfgang Frimmel. Subgame Perfect Nash Equilibrium

2. Basics of Noncooperative Games

Game theory lecture 5. October 5, 2013

Topics in Applied Mathematics

Pedigree Reconstruction using Identity by Descent

Extensive Games with Perfect Information. Start by restricting attention to games without simultaneous moves and without nature (no randomness).

Math 611: Game Theory Notes Chetan Prakash 2012

Exercises for Introduction to Game Theory SOLUTIONS

Best Response to Tight and Loose Opponents in the Borel and von Neumann Poker Models

Game Theory ( nd term) Dr. S. Farshad Fatemi. Graduate School of Management and Economics Sharif University of Technology.

ECON 301: Game Theory 1. Intermediate Microeconomics II, ECON 301. Game Theory: An Introduction & Some Applications

Lecture 7: Dominance Concepts

Chapter 15: Game Theory: The Mathematics of Competition Lesson Plan

Statistical Analysis of Nuel Tournaments Department of Statistics University of California, Berkeley

1. The chance of getting a flush in a 5-card poker hand is about 2 in 1000.

Game Theory: introduction and applications to computer networks

Extensive Form Games. Mihai Manea MIT

Note: A player has, at most, one strictly dominant strategy. When a player has a dominant strategy, that strategy is a compelling choice.

Stochastic Game Models for Homeland Security

Fictitious Play applied on a simplified poker game

Instability of Scoring Heuristic In games with value exchange, the heuristics are very bumpy Make smoothing assumptions search for "quiesence"

Chapter 2 Basics of Game Theory

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012

Dynamic Programming in Real Life: A Two-Person Dice Game

Game Theory Lecturer: Ji Liu Thanks for Jerry Zhu's slides

Student Name. Student ID

Introduction to Game Theory

Economics 201A - Section 5

Optimal Rhode Island Hold em Poker

Computational Aspects of Game Theory Bertinoro Spring School Lecture 2: Examples

Lecture 6: Basics of Game Theory

Game Theory. Department of Electronics EL-766 Spring Hasan Mahmood

Transcription:

Behavioral Strategies in Zero-Sum Games in Extensive Form Ponssard, J.-P. IIASA Working Paper WP-74-007 974

Ponssard, J.-P. (974) Behavioral Strategies in Zero-Sum Games in Extensive Form. IIASA Working Paper. WP-74-007 Copyright 974 by the author(s). http://pure.iiasa.ac.at/64/ Working Papers on work of the International Institute for Applied Systems Analysis receive only limited review. Views or opinions expressed herein do not necessarily represent those of the Institute, its National Member Organizations, or other organizations supporting the work. All rights reserved. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage. All copies must bear this notice and the full citation on the first page. For other purposes, to republish, to post on servers or to redistribute to lists, permission must be sought by contacting repository@iiasa.ac.at

BEHAVIORAL STRATEGIES IN ZERO SUM GAMES IN EXTENSIVE FORM J. P. Ponssard January 974 WP-74-7

Behavioral Strategies in Zero Sum Games in Extensive Form. Introduction The purpose of these notes is to present some formal properties of behavioral strategies in relation to some comments made by Aumann and Maschler [A-M]. In their main example, they show that the behavioral strategy generated by the optimal mixed strategy may be dominated in terms of security level by some other strategy. Wilson has proposed a constructive scheme to generate a decision tree for each player from a game tree as long as it is with perfect recall [W]. Using this scheme, one may precisely define the expected payoff conditional on being at a given informal set and on the other players' strategies. The security level conditional on being at a given informal set may similarly be defined using the player's decision tree. Note that since all moves are "sequentially played" in a decision tree, randomization is irrelevant for the security level. Then the mathematical counterpart of Aumann and Maschler's argument for zero sum games in extensive form may be stated as follows: at the information sets of a game tree, the optimal behavioral strategies may only satisfy one part of the double minimax inequality, namely the equilibrium part. Furthermore, the only class of games for which the double minimax inequality seems to remain valid is the class with perfect information since, in this case, the player's decision trees are identical. These notes are organized as follows: the second section merely consists of an example to illustrate our restatement of Aumann and Maschler's argument. Some more perplexing remarks about behavioral strategies after a non optimal move will also be presented.

2 2. An Example Consider the game tree depicted in Figure. It may be interpreted as a one stage poker game with a high or low card and two possible raise moves or a drop move for player and a drop or call move for player 2. Assume that player is the maximiser. o D 2 <--- c 3 /2 L -- - D R2 -- o -2 Move: Chance Player I Player II Figure The Game Tree There are two information sets for player II each one including two nodes. We want to define an expected payoff conditional on each information set and a given strategy of player. The optimal behavioral strategy for player I (which, for instance, may be obtained from the normal form) is to play R2 if the chance move is H and to play R2 with probability 2/3 and D with probability /3 if the chance move is L. Once player I's move is played, player II's conditional expectations may be defined by constructing his decision tree. According to Wilson's procedure, this decision tree

セ 3 is described in Figure 2. Note that the probabilities on Hand L are now conditional on HI or H2. Conditional on HI, these probabilities are not defined and we shall come back to this point later on. Conditional on H2, these probabilities are easily seen to be 3/5 and 2/5 respectively. D o d c H 2 - chance d c H ---rrs- 3 セM 2 5.-. -2 Player II chance Figure 2 Player II's Decision We are now in a position to define player II's expected payoff conditional on H2 being played and on player I's strategy. The expected payoff is if he plays d and l. 3 - セ 2 = if he plays c. His optimal behavioral stragegy 5 5 at this information set is to play d with probability 2/3 and c with probability /3. As such,.it maximizes his expected payoff conditional on R2 being played and on player I's optimal strategy. What is the security level associated with his optimal strategy given that H2 was played? If player I's strategy was to play H2 only if the outcome of the chance move is H, then player II's expectation, given his own optimal strategy and conditional on H2 being played and on player I's new strategy, would now be [ 2 c:; 3.3-0.2J + 3 =3 which is worse than (remember that player II is the minimiser).

4 Let (s, t) denote behavioral strategies for player I and II respectively and (s*, t*) denote the optimal strategies. Let E be a player's information set and V(s*, t* E) be his conditional payoff at this information set. Assume-that this information set belongs to player I. Then it is easy to show that: for all s, V( s, t* E) < V( s*, t*/ E), whereas the following inequality may not be true as shown in the preceding example: (ii) for all t, V( s*, t* E) < V( s*, tie) Thus, in terms of conditional payoffs, only the equilibrium part of the double minimax inequality remains satisfied. Note that as soon as the players are called simultaneously to make a move in a game tree, it is likely to result in different security levels for each of them since, according to our definition of conditional payoffs at an information set, it cannot be increased by means of randomization. Thus one would expect that the only class of games for which (i) and (ii) could be satisfied at each information set are games with perfect information. And indeed, one can easily show that for such games, (i) and (ii) remain true since then both players have the same decision tree. This would suggest that the minimax principle is essentially appropriate for the "static" normal form (and Von Neumann and Morgenstern do claim repeatedly that they are building a static theory [N-MJ). SUbsequently, "optimal" behavioral strategies in zero sum extensive games may ordinarily only qualify as "equilibrium" strategies in a "non-zero sum" setif'e. But would the equilibrium principle, with its well kno\-tn pi.tfalls, be actually suitable for ttdynamic" theory of extensive games? The following remarks are presented so as to point out some difficulties in the セケョ ュゥ aspect of the equilibrium principle. Consider again the poker example but now assume that player II is at the other information set; that is, assume that Rl was played. Player II's equilibrium behavioral strategy is degenerated and may be any convex combination of the following two strategies: the first one consists of playing d with probability セ and c wi th probabili エケセ L and the second one of playing d with probability t and c

5 with probability セ N As we said earlier, player II's expectation conditional on HI being played and on player I's equilibrium strategy is not mathematically defined since it is conditioned on any event with zero probability. Thus, inequality (i) is not defined. Nevertheless, let us try to interpret player II's equilibrium strategy by working backward. Clearly it does not guarantee him his conditional security level which is. If it were to maximize his expected payoff conditional on HI being played, then this would imply that the probabilities on Hand L would be 2/3 and /3 respectively. This, in turn, would imply that player I would have chosen HI with a probability, say, k if the chance move ish and with a probabili ty k/2 is the chance move is L. This seems to be a very definite statement to make since the only thing that player II knows about player I's strategy is that he made a mistake. How he made it is certainly a matter of opinion and not a mathematical fact (or would there be anything like an "optimal" mistake?). Thus, it seems unjustified to say that player II's equilibrium strategy maximizes his expected payoff conditional on HI being played. The only rationale for player II's equilibrium strategy appears to be that it makes move HI unattractive to player I and thus should enforce him to play his equilibrium strategy. As such, it may be interpreted as a good threat to deter player I from a deviating behavior. However, once player I did deviate, this threat has no theoretical justification any longer. (To carry out this threat may be worthwhile in a repeated situation but this is a one shot game. ) Thus, the equilibrium principle, just as the minimax principle, appears to be time dependent, though in a much weaker sense; that is, outside the "equilibrium paths" of the game tree.

6 References [A-M] Aumann, R.J. and M. Maschler, "Some Thoughts on the Minimax Principle", Management Science, Vol. 8, No.5. [N-M] Von Neumann, J. and A. Morgenstern, Theory of Games and Economic Behavior, Princeton University Press, Princeton, N.J., 944, Third Edition, 953. [W] Wilson, R., "Computing Equilibria of Two Person Games from the Extensive Form", W-P No. 76, May 970, Graduate School of. Business, Stanford University.