Game Theory: Normal Form Games

Similar documents
Game Theory Intro. Lecture 3. Game Theory Intro Lecture 3, Slide 1

Self-interested agents What is Game Theory? Example Matrix Games. Game Theory Intro. Lecture 3. Game Theory Intro Lecture 3, Slide 1

Game Theory Week 1. Game Theory Course: Jackson, Leyton-Brown & Shoham. Game Theory Course: Jackson, Leyton-Brown & Shoham Game Theory Week 1

Analyzing Games: Mixed Strategies

ESSENTIALS OF GAME THEORY

Mixed Strategies; Maxmin

Computing Nash Equilibrium; Maxmin

CS510 \ Lecture Ariel Stolerman

Domination Rationalizability Correlated Equilibrium Computing CE Computational problems in domination. Game Theory Week 3. Kevin Leyton-Brown

Chapter 3 Learning in Two-Player Matrix Games

Noncooperative Games COMP4418 Knowledge Representation and Reasoning

Dominant Strategies (From Last Time)

Microeconomics of Banking: Lecture 4

CMU-Q Lecture 20:

Reading Robert Gibbons, A Primer in Game Theory, Harvester Wheatsheaf 1992.

Dominance and Best Response. player 2

CSCI 699: Topics in Learning and Game Theory Fall 2017 Lecture 3: Intro to Game Theory. Instructor: Shaddin Dughmi

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012

Advanced Microeconomics (Economics 104) Spring 2011 Strategic games I

Dominant and Dominated Strategies

Minmax and Dominance

Game Theory: The Basics. Theory of Games and Economics Behavior John Von Neumann and Oskar Morgenstern (1943)

Repeated Games. ISCI 330 Lecture 16. March 13, Repeated Games ISCI 330 Lecture 16, Slide 1

U strictly dominates D for player A, and L strictly dominates R for player B. This leaves (U, L) as a Strict Dominant Strategy Equilibrium.

Game Theory. Wolfgang Frimmel. Dominance

Section Notes 6. Game Theory. Applied Math 121. Week of March 22, understand the difference between pure and mixed strategies.

LECTURE 26: GAME THEORY 1

Multi-player, non-zero-sum games

Lecture 6: Basics of Game Theory

Game Tree Search. CSC384: Introduction to Artificial Intelligence. Generalizing Search Problem. General Games. What makes something a game?

CMU Lecture 22: Game Theory I. Teachers: Gianni A. Di Caro

Game Theory ( nd term) Dr. S. Farshad Fatemi. Graduate School of Management and Economics Sharif University of Technology.

Math 464: Linear Optimization and Game

Multiagent Systems: Intro to Game Theory. CS 486/686: Introduction to Artificial Intelligence

(a) Left Right (b) Left Right. Up Up 5-4. Row Down 0-5 Row Down 1 2. (c) B1 B2 (d) B1 B2 A1 4, 2-5, 6 A1 3, 2 0, 1

Game theory attempts to mathematically. capture behavior in strategic situations, or. games, in which an individual s success in

Math 152: Applicable Mathematics and Computing

1. Introduction to Game Theory

1. Simultaneous games All players move at same time. Represent with a game table. We ll stick to 2 players, generally A and B or Row and Col.

ECON 282 Final Practice Problems

1\2 L m R M 2, 2 1, 1 0, 0 B 1, 0 0, 0 1, 1

Introduction to (Networked) Game Theory. Networked Life NETS 112 Fall 2016 Prof. Michael Kearns

Repeated Games. Economics Microeconomic Theory II: Strategic Behavior. Shih En Lu. Simon Fraser University (with thanks to Anke Kessler)

NORMAL FORM (SIMULTANEOUS MOVE) GAMES

Game Theory and Randomized Algorithms

Game Theory. Department of Electronics EL-766 Spring Hasan Mahmood

Basic Solution Concepts and Computational Issues

Problem 1 (15 points: Graded by Shahin) Recall the network structure of our in-class trading experiment shown in Figure 1

Finite games: finite number of players, finite number of possible actions, finite number of moves. Canusegametreetodepicttheextensiveform.

First Prev Next Last Go Back Full Screen Close Quit. Game Theory. Giorgio Fagiolo

ECON 312: Games and Strategy 1. Industrial Organization Games and Strategy

Topic 1: defining games and strategies. SF2972: Game theory. Not allowed: Extensive form game: formal definition

Adversarial Search and Game Theory. CS 510 Lecture 5 October 26, 2017

CS188: Artificial Intelligence, Fall 2011 Written 2: Games and MDP s

ECO 220 Game Theory. Objectives. Agenda. Simultaneous Move Games. Be able to structure a game in normal form Be able to identify a Nash equilibrium

GAME THEORY: STRATEGY AND EQUILIBRIUM

Lecture 23. Offense vs. Defense & Dynamic Games

Dominant and Dominated Strategies

Game theory. Logic and Decision Making Unit 2

Lecture Notes on Game Theory (QTM)

14.12 Game Theory Lecture Notes Lectures 10-11

UPenn NETS 412: Algorithmic Game Theory Game Theory Practice. Clyde Silent Confess Silent 1, 1 10, 0 Confess 0, 10 5, 5

Game Tree Search. Generalizing Search Problems. Two-person Zero-Sum Games. Generalizing Search Problems. CSC384: Intro to Artificial Intelligence

EC3224 Autumn Lecture #02 Nash Equilibrium

Game Theory: introduction and applications to computer networks

CSC384: Introduction to Artificial Intelligence. Game Tree Search

ECON 2100 Principles of Microeconomics (Summer 2016) Game Theory and Oligopoly

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Game Theory

Introduction to (Networked) Game Theory. Networked Life NETS 112 Fall 2014 Prof. Michael Kearns

Extensive Form Games: Backward Induction and Imperfect Information Games

FIRST PART: (Nash) Equilibria

3 Game Theory II: Sequential-Move and Repeated Games

Lecture 10: September 2

Computational Methods for Non-Cooperative Game Theory

Machine Learning in Iterated Prisoner s Dilemma using Evolutionary Algorithms

Multiple Agents. Why can t we all just get along? (Rodney King)

Lecture #3: Networks. Kyumars Sheykh Esmaili

ECO 5341 Strategic Behavior Lecture Notes 3

A Game Playing System for Use in Computer Science Education

Game Theory. Vincent Kubala

17.5 DECISIONS WITH MULTIPLE AGENTS: GAME THEORY

CS 1571 Introduction to AI Lecture 12. Adversarial search. CS 1571 Intro to AI. Announcements

Reinforcement Learning in Games Autonomous Learning Systems Seminar

INTRODUCTION TO GAME THEORY

CSC304 Lecture 2. Game Theory (Basic Concepts) CSC304 - Nisarg Shah 1

EconS Game Theory - Part 1

Introduction to Spring 2009 Artificial Intelligence Final Exam

Game Theory. Vincent Kubala

Game theory Computational Models of Cognition

Game Theory. Share information. Evaluate information (obtained from friends, acquaintances and coworkers) Develop trust. Accept or reject friendship

SF2972: Game theory. Mark Voorneveld, February 2, 2015

THEORY: NASH EQUILIBRIUM

Introduction to Game Theory

ECON 301: Game Theory 1. Intermediate Microeconomics II, ECON 301. Game Theory: An Introduction & Some Applications

Chapter 2 Basics of Game Theory

Multiagent Systems: Intro to Game Theory. CS 486/686: Introduction to Artificial Intelligence

What is... Game Theory? By Megan Fava

Non-Cooperative Game Theory

Introduction to Game Theory

Backward Induction and Stackelberg Competition

Transcription:

Game Theory: Normal Form Games CPSC 322 Lecture 34 April 3, 2006 Reading: excerpt from Multiagent Systems, chapter 3. Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 1

Lecture Overview Recap Game Theory Example Matrix Games Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 2

Rewards and Values Suppose the agent receives the sequence of rewards r 1, r 2, r 3, r 4,.... What value should be assigned? total reward V = i=1 r i average reward V = lim n discounted reward V = r 1 + + r n n i=1 γi 1 r i γ is the discount factor 0 γ 1 Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 3

Policies A stationary policy is a function: π : S A Given a state s, π(s) specifies what action the agent who is following π will do. An optimal policy is one with maximum expected value we ll focus on the case where value is defined as discounted reward. For an MDP with stationary dynamics and rewards with infinite or indefinite horizon, there is always an optimal stationary policy in this case. Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 4

Value of a Policy Q π (s, a), where a is an action and s is a state, is the expected value of doing a in state s, then following policy π. V π (s), where s is a state, is the expected value of following policy π in state s. Q π and V π can be defined mutually recursively: V π (s) = Q π (s, π(s)) Q π (s, a) = s P (s a, s) ( r(s, a, s ) + γv π (s ) ) Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 5

Value of the Optimal Policy Q (s, a), where a is an action and s is a state, is the expected value of doing a in state s, then following the optimal policy. V (s), where s is a state, is the expected value of following the optimal policy in state s. Q and V can be defined mutually recursively: Q (s, a) = s P (s a, s) ( r(s, a, s ) + γv (s ) ) V (s) = max Q (s, a) a π (s) = arg max Q (s, a) a Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 6

Value Iteration Idea: Given an estimate of the k-step lookahead value function, determine the k + 1 step lookahead value function. Set V 0 arbitrarily. e.g., zeros Compute Q i+1 and V i+1 from V i : Q i+1 (s, a) = s P (s a, s) ( r(s, a, s ) + γv i (s ) ) V i+1 (s) = max Q i+1 (s, a) a If we intersect these equations at Q i+1, we get an update equation for V : V i+1 (s) = max P (s ( a, s) r(s, a, s ) + γv i (s ) ) a s Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 7

Asynchronous VI: storing Q[s, a] Repeat forever: Select state s, action a; ( ) Q[s, a] P (s s, a) R(s, a, s ) + γ max Q[s, a ] ; a s Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 8

Lecture Overview Recap Game Theory Example Matrix Games Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 9

Non-Cooperative Game Theory What is it? Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 10

Non-Cooperative Game Theory What is it? mathematical study of interaction between rational, self-interested agents Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 10

Non-Cooperative Game Theory What is it? mathematical study of interaction between rational, self-interested agents Why is it called non-cooperative? Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 10

Non-Cooperative Game Theory What is it? mathematical study of interaction between rational, self-interested agents Why is it called non-cooperative? while it s most interested in situations where agents interests conflict, it s not restricted to these settings the key is that the individual is the basic modeling unit, and that individuals pursue their own interests cooperative/coalitional game theory has teams as the central unit, rather than agents You can think of a non-cooperative game as a decision diagram where different agents control different decision nodes, and where each agent has his own utility node. Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 10

TCP Backoff Game Should you send your packets using correctly-implemented TCP (which has a backoff mechanism) or using a defective implementation (which doesn t)? Consider this situation as a two-player game: both use a correct implementation: both get 1 ms delay one correct, one defective: 4 ms delay for correct, 0 ms for defective both defective: both get a 3 ms delay. Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 11

TCP Backoff Game Consider this situation as a two-player game: both use a correct implementation: both get 1 ms delay one correct, one defective: 4 ms delay for correct, 0 ms for defective both defective: both get a 3 ms delay. Questions: What action should a player of the game take? Would all users behave the same in this scenario? What global patterns of behaviour should the system designer expect? Under what changes to the delay numbers would behavior be the same? What effect would communication have? Repetitions? (finite? infinite?) Does it matter if I believe that my opponent is rational? Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 11

Defining Games Finite, n-person game: N, A, u : N is a finite set of n players, indexed by i A = A1,..., A n is a set of actions for each player i a A is an action profile u = {u 1,..., u n }, a utility function for each player, where u i : A R Writing a 2-player game as a matrix: row player is player 1, column player is player 2 rows are actions a A1, columns are a A 2 cells are outcomes, written as a tuple of utility values for each player Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 12

Lecture Overview Recap Game Theory Example Matrix Games Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 13

d D (for using a Defective one). If both you and your colleague verage packet delay is 1ms (millisecond). If you both adopt D the segames of additional in Matrix overhead at Form the network router. Finally, if one of e other adopts C then the D adopter will experience no delay at all, ill experience a delay of 4ms. ces arehere s shownthe in Figure TCP Backoff 3.1. YourGame options written are theas two a matrix rows, and ( normal form ) tions are andthe ascolumns. a decisionin network. each cell, the first number represents us your delay), and the second number represents your colleague s Recap Game Theory Example Matrix Games C D Action by Player 1 Action by Player 2 C 1, 1 4,0 D 0, 4 3, 3 P1 Utility P2 Utility ure 3.1 The TCP user s (aka the Prisoner s) Dilemma. ns what should you adopt, C or D? Does it depend on what you e will do? Furthermore, from the perspective of the network operaavior can he expect from the two users? Will any two users behave Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 14

d D (for using a Defective one). If both you and your colleague verage packet delay is 1ms (millisecond). If you both adopt D the segames of additional in Matrix overhead at Form the network router. Finally, if one of e other adopts C then the D adopter will experience no delay at all, ill experience a delay of 4ms. ces arehere s shownthe in Figure TCP Backoff 3.1. YourGame options written are theas two a matrix rows, and ( normal form ) tions are andthe ascolumns. a decisionin network. each cell, the first number represents us your delay), and the second number represents your colleague s Recap Game Theory Example Matrix Games C D Action by Player 1 Action by Player 2 C 1, 1 4,0 D 0, 4 3, 3 P1 Utility P2 Utility ure 3.1 The TCP user s (aka the Prisoner s) Dilemma. Play this game with someone near you, repeating five times. ns what should you adopt, C or D? Does it depend on what you e will do? Furthermore, from the perspective of the network operaavior can he expect from the two users? Will any two users behave Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 14

More General Form 3 Competition and Coordination: Normal form games Prisoner s dilemma is any game C D C a,a b,c D c,b d,d Figure 3.3 Any c > a > d > b define an instance of Prisoner s Dilemma. with c > a > d > b. To fully understand the role of the payoff numbers we would need to enter into a discussion of utility theory. Here, let us just mention that for most purposes, the analysis of any game is unchanged if the payoff numbers undergo any positive affine Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 15

Games of Pure Competition Players have exactly opposed interests There must be precisely two players (otherwise they can t have exactly opposed interests) For all action profiles a A, u 1 (a) + u 2 (a) = c for some constant c Special case: zero sum Thus, we only need to store a utility function for one player Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 16

the abbreviation we must explicit state whether this matrix represents a common-payoff game or a zero-sum one. Matching A classical Pennies example of a zero-sum game is the game of matching pennies. In this game, each of the two players has a penny, and independently chooses to display either heads or tails. The two players then compare their pennies. If they are the same then player 1 pockets both, and otherwise player 2 pockets them. The payoff matrix is shown One in Figure player 3.5. wants to match; the other wants to mismatch. Heads Tails Heads 1 1 Tails 1 1 Figure 3.5 Matching Pennies game. The popular children s game of Rock, Paper, Scissors, also known as Rochambeau, provides a three-strategy generalization of the matching-pennies game. The payoff matrix of this zero-sum game is shown in Figure 3.6. In this game, each of the two players can choose either Rock, Paper, or Scissors. If both players choose the same Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 17

the abbreviation we must explicit state whether this matrix represents a common-payoff game or a zero-sum one. Matching A classical Pennies example of a zero-sum game is the game of matching pennies. In this game, each of the two players has a penny, and independently chooses to display either heads or tails. The two players then compare their pennies. If they are the same then player 1 pockets both, and otherwise player 2 pockets them. The payoff matrix is shown One in Figure player 3.5. wants to match; the other wants to mismatch. Heads Tails Heads 1 1 Tails 1 1 Figure 3.5 Matching Pennies game. Play this game with someone near you, repeating five times. The popular children s game of Rock, Paper, Scissors, also known as Rochambeau, provides a three-strategy generalization of the matching-pennies game. The payoff matrix of this zero-sum game is shown in Figure 3.6. In this game, each of the two players can choose either Rock, Paper, or Scissors. If both players choose the same Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 17

Rock-Paper-Scissors 3 Competition and Coordination: Normal form games Generalized matching pennies. Rock Paper Scissors Rock 0 1 1 Paper 1 0 1 Scissors 1 1 0 Figure 3.6 Rock, Paper, Scissors game....believe it or not, there s an annual international competition for this game! VG GL VG 2,1 0,0 Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 18

Games of Cooperation Players have exactly the same interests. no conflict: all players want the same things a A, i, j, u i (a) = u j (a) we often write such games with a single payoff per cell why are such games noncooperative? Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 19

Game Theory: Normal Form Games c Shoham and Leyton-Brown, 2006 CPSC 322 Lecture 34, Slide 20 the agents have no conflicting interests; their sole challenge is to coordinate on an Recap Game Theory Example Matrix Games action that is maximally beneficial to all. Coordination Because of theirgame special nature, we often represent common value games with an abbreviated form of the matrix in which we list only one payoff in each of the cells. As an example, imagine two drivers driving towards each other in a country without traffic rules, and who must independently decide whether to drive on the left or on the right. If the players choose the same side (left or right) they have some high utility, and otherwise Which they side have ofathe lowroad utility. should The game you matrix drive on? is shown in Figure 3.4. Left Right Left 1 0 Right 0 1 Figure 3.4 Coordination game. At the other end of the spectrum from pure coordination games lie zero-sum games, which (bearing in mind the comment we made earlier about positive affine transformations) are more properly called constant-sum games. Unlike common-payoff games,

the agents have no conflicting interests; their sole challenge is to coordinate on an Recap Game Theory Example Matrix Games action that is maximally beneficial to all. Coordination Because of theirgame special nature, we often represent common value games with an abbreviated form of the matrix in which we list only one payoff in each of the cells. As an example, imagine two drivers driving towards each other in a country without traffic rules, and who must independently decide whether to drive on the left or on the right. If the players choose the same side (left or right) they have some high utility, and otherwise Which they side have ofathe lowroad utility. should The game you matrix drive on? is shown in Figure 3.4. Left Right Left 1 0 Right 0 1 Figure 3.4 Coordination game. Play this game with someone near you, repeating five times. At the other end of the spectrum from pure coordination games lie zero-sum games, which (bearing in mind the comment we made earlier about positive affine transformations) are more properly called constant-sum games. Unlike common-payoff games, c Shoham and Leyton-Brown, 2006 Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 20

Rock 0 1 1 Recap Game Theory Example Matrix Games We have so far defined the actions available to each player in a game, but not yet his Game set of Theory: strategies, Normal FormorGames his available choices. Certainly one kind of CPSC strategy 322 Lecture is to 34, select Slide 21 General Games: Battle of the Sexes Paper 1 0 1 Scissors 1 1 0 The most interesting games combine elements of cooperation and competition. Figure 3.6 Rock, Paper, Scissors game. B F B 2,1 0,0 F 0,0 1,2 Figure 3.7 Battle of the Sexes game. Strategies in normal-form games

Rock 0 1 1 Recap Game Theory Example Matrix Games General Games: Battle of the Sexes Paper 1 0 1 Scissors 1 1 0 The most interesting games combine elements of cooperation and competition. Figure 3.6 Rock, Paper, Scissors game. B F B 2,1 0,0 F 0,0 1,2 Figure 3.7 Battle of the Sexes game. Play this game with someone near you, repeating five times. Strategies in normal-form games We have so far defined the actions available to each player in a game, but not yet his set of strategies, or his available choices. Certainly one kind of strategy is to select Game Theory: Normal Form Games CPSC 322 Lecture 34, Slide 21