Optimization Incentives And Coordination Failure In Laboratory Stag Hunt Games

Raymond Battalio, Larry Samuelson, and John Van Huyck

December 1999

[ Download | Introduction | Conclusion | References | Footnotes | John's Web ]

Abstract: This paper reports an experiment comparing three stag hunt games that have the same best-response correspondence and the same expected payoff from the mixed equilibrium, but differ in the incentive to play a best response rather than an inferior response. In each game, risk dominance conflicts with payoff dominance and selects an inefficient pure strategy equilibrium. We find statistically and economically significant evidence that the differences in the incentive to optimize help explain observed behavior.


Introduction

The abstraction assumptions specifying feasible strategies, preferences over consequences, and substantively rational players, which define a strategic form game, construct a powerful framework within which to analyze strategic behavior. These abstraction assumptions in turn can be summarized by the best response correspondence. One need only know the best response correspondence of a strategic form game to identify its Nash equilibria. 

This paper reports an investigation of three stag hunt games using the experimental method. The three games have the same best response correspondence as well as similar payoff magnitudes, but produce different behavior. Games 2R, R, and 0.6R, shown in Figures 1, 2, and 3, were used in the experiment. 

Under the assumption that players maximize the expected value of their earnings, games 2R, R, and 0.6R have identical best response correspondences. Strategy X is a strict best response to any mixture that attaches a probability greater than k* to X, where k* = 0.8, while Y is a strict best response to any mixture attaching a lower probability to X. Each game has two pure-strategy equilibria, where (X,X) is payoff dominant and (Y,Y) is risk dominant,[1] as well as a mixed equilibrium in which X is played with probability k*. To the extent possible, the games also involve payoffs of similar magnitudes. In particular, the expected payoff from the mixed equilibrium is 36 for all three games.

        X         Y         X         Y         X         Y
   X      45,45        0,35    X      45,45       0,40    X      45,45       0,40
   Y       35,0       40,40    Y       40,0      20,20    Y      42,40      12,12

Figure 1: Game 2R                           Figure 2: Game R                          Figure 3: Game 0.6R

The classical approach to games typically either exploits only the information contained in a game’s best response correspondence, or augments this information with risk-dominance and payoff-dominance considerations in order to choose between strict Nash equilibria. In either case, games 2R, R and 0.6R are treated identically.[2] 

Our analysis of games 2R, R, and 0.6R is motivated by the observation that the pecuniary incentive to select a best response to an opponent’s strategy is twice as large in game 2R as it is in game R and six tenths as large in game 0.6R as it is in game R. We call this incentive the optimization premium: the difference between the payoff of the best response to an opponent’s strategy and the inferior response. The optimization premium may be irrelevant to substantively rational agents, but we expect people to more readily learn to play a best response when the optimization premium is large, and expect the differing optimization premia of games 2R, R, and 0.6R to induce systematically different play in laboratory experiments. 

It is well known that increasing the stakes for which subjects play games can affect their behavior, see Smith (1976) or Slonim and Roth (1998). We maintain approximately the same stakes in all of our games by fixing the payoff to the mixed equilibrium at 36 cents. One can think of the optimization premium as describing the steepness, rather than the level, of the payoff function near an equilibrium. A larger optimization premium implies not that equilibrium payoffs are higher, but rather that the penalty to suboptimal play is larger. 

Our experimental results provide evidence that changing the optimization premium between X and Y influences behavior. The sensitivity of individual subjects to the history of opponents’ play is greater in games with a larger optimization premium. Behavior converges more quickly in game 2R than in R, and more quickly in game R than in game 0.6R. The payoff dominant equilibrium is more likely to emerge the smaller is the optimization premium. 

The following section describes the experiment. Section 3 discusses how play may be expected to differ across the three games. Section 4 presents the experimental results. The penultimate section relates our findings to the literature and the final section contains concluding comments.

[ Top | Download | Introduction | Conclusion | References | Footnotes | John's Web ]


Download

Adobe Acrobat (PDF) format:

Surface mail request (comments, suggestions, references, etc.): john.vanhuyck@tamu.edu

[ Top | Download | Introduction | Conclusion | References | Footnotes | John's Web ]


Conclusion

Our results provide evidence that more than the best response correspondence matters when predicting human behavior in laboratory experiments. We have focused on the optimization premium, that is, the expected earnings difference between the two actions, in three stag hunt games that have the same best response correspondence, the same mixed strategy equilibrium, and the same expected payoff at this mixed strategy equilibrium, but have different pecuniary incentives to play a best response. We find statistically and economically significant evidence that the optimization premium helps explain observed behavior. The sensitivity of individual subjects to the history of opponents’ play is greater in games with a larger optimization premium. Behavior converges more quickly the larger the optimization premium. The risk dominant equilibrium is more likely to emerge the larger is the optimization premium.

[ Top | Download | Introduction | Conclusion | References | Footnotes | John's Web ]


References

L. Anderlini, “Communication, computability, and common interest games,” Economic Theory Discussion Paper 159, St. John’s College, Cambridge, 1990. 

Simon P. Anderson, André de Palma, and Jacques-François Thisse, Discrete Choice Theory of Product Differentiation, (Cambridge, MA: The MIT Press, 1992). 

Ken Binmore, John Gale, and Larry Samuelson, “Learning to be Imperfect: The Ultimatum Game,” Games and Economic Behavior 8, 1995, 56-90. 

Ken Binmore and Larry Samuelson, “Muddling through: Noisy equilibrium selection” Journal of Economic Theory 74(2) June 1997, 235-265. 

Robert Bloomfield, “Learning a mixed strategy equilibrium in the laboratory,” Journal of Economic Behavior and Organization, 25, 1994, 411-36.

Tilman Borgers and Rajiv Savin, “Learning through Reinforcement and the Replicator Dynamics,” Journal of Economic Theory 77, 1997, 1-14. 

Antonio Cabrales, “Stochastic Replicator Dynamics” Mimeo, Universitat Pompeu Fabra, Barcelona, 1993. 

Colin Camerer and Teck-Hua Ho, “Experience-weighted Attraction Learning in Games: Estimates from Weak-Link Games,” laser-script November 1996. 

Hans Carlsson and Eric van Damme, “Global games and equilibrium selection,” Econometrica, 61: 989-1018, 1993. 

Yin-Wong Cheung and Daniel Friedman, “Individual Learning in Normal Form Games: Some Laboratory Results,” laser-script December 1995. 

Kenneth Clark, Stephen Kay and Martin Sefton, “When are Nash Equilibria Self-Enforcing? An Experimental Analysis,” laser-script, May 1996. 

Russell Cooper, Douglas V. DeJong, Robert Forsythe, and Thomas W. Ross, “Communication in coordination games,” Quarterly Journal of Economics, 107:739-773, 1992. 

Erev, Ido and Alvin E. Roth, “On the need for low rationality, cognitive game theory: Reinforcement learning in experimental games with unique, mixed strategy equilibria”, laser-script August 1995.

Daniel Friedman, “Equilibrium in evolutionary games: Some experimental results,” Economic Journal, 106:1-25, 1996. 

Drew Fudenberg and David K. Levine, Theory of Learning in Games, laser-script December 1996. 

John C. Harsanyi, “A new theory of equilibrium selection for games with complete information,” Games and Economic Behavior, 8:91-122, 1995. 

John C. Harsanyi and Reinhard Selten, A General Theory of Equilibrium Selection in Games, MIT Press, Cambridge, Massachusetts, 1988.

John Hillas, “On the definition of the strategic stability of equilibria,” Econometrica, 58:1365-1390, 1990. 

Michihiro Kandori, George J. Mailath, and Rafael Rob, “Learning, mutation, and long run equilibria in games,” Econometrica, 61:29-56, 1993. 

Elon Kohlberg and Jean-Francois Mertens, “On the strategic stability of equilibria,” Econometrica, 54:1003-1038, 1986. 

R. Duncan Luce, Individual Choice Behavior: A Theoretical Analysis (New York,NY: John Wiley & Sons, 1959). 

G.S. Maddala, Limited-Dependent and Qualitative Variables in Econometrics, (New York, NY: Cambridge University Press, 1983) 

David P. Myatt and Chris Wallace, “Adaptive Dynamics and Payoff Heterogeneity,” Mimeo, Nuffield College, Oxford, 1998. 

Richard D. McKelvey and Thomas R. Palfrey, “Quantal Response Equilibria for Normal Form Games,” Games and Economic Behavior, 10(1) July 1995, 6-38. 

Dilip Mookherjee and Barry Sopher, “Learning Behavior in an Experimental Matching Pennies Game,” Games and Economic Behavior 7(1), July 1994, 62-91.

Frederick Rankin, John Van Huyck, and Raymond Battalio, “Strategic Similarity And Emergent Conventions: Evidence from Scrambled Payoff Perturbed Stag Hunt Games”, forthcoming Games and Economic Behavior. 

Arthur J. Robson and Fernando Vega-Redondo, “Efficient equilibrium selection in evolutionary games with random matching,” Journal of Economic Theory 70 (1996), 65-92. 

Alvin E. Roth and Ido Erev, “Learning in Extensive Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term,” Games and Economic Behavior 8 (1995), 164-212.

Karl Schlag, “Why Imitate, and if so, How?,” SFB Discussion Paper B-243, University of Bonn, 1994.

Martin Sefton, “Modelling Behavior in Coordination Game Experiments,” Laser-script, October 10, 1996. 

R. Slonim and A. E. Roth, “Learning in High Stakes Ultimatum Games: An Experiment in the Slovak Republic,” Econometrica 66 (1998), 569- 596. 

Vernon Smith, “Experimental Economics: Induced Value Theory,” American Economic Review 66 (1976), 274-279. 

Dale O. Stahl and and Paul W. Wilson, “On players’ models of other players: Theory and Experimental Evidence,” Games and Economic Behavior 10(1), 1995, 218-254. 

Paul Straub, “Risk Dominance and Coordination Failure in Static Games,” The Quarterly Review of Economics and Finance, 35(4) Winter 1995, 339-363. 

Eric van Damme, “Refinements of Nash equilibrium,” In Jean Jacques Laffont, editor, Advances in Economic Theory: Sixth World Congress. Cambridge University Press, Cambridge, 1992. 

John Van Huyck, Raymond Battalio, and Frederick Rankin, “On the Origin of Convention: Evidence from Coordination Games,” Economic Journal 107(442) May 1997, 576-597. 

John Van Huyck, Joe Cook, and Raymond Battalio, “Adaptive Behavior and Coordination Failure,” forthcoming in Journal of Economic Behavior and Organization.Jürgen Weibull. 

Evolutionary Game Theory. MIT Press, Cambridge, 1995. Peyton Young, “The evolution of conventions,” Econometrica, 61: 57-84, 1993.

[ Top | Download | Introduction | Conclusion | References | Footnotes | John's Web ]


Footnotes

[1]  See Harsanyi and Selten (1988) on payoff dominance and risk dominance.

[2]  Hillas (1990) introduces a reformulation of Kohlberg and Mertens’ (1986) strategic stability that makes the exclusive reliance on the best response correspondence particularly obvious. Among theories that make an equilibrium selection in the stag hunt game, Carlsson and van Damme (1993) and Harsanyi (1995) choose the risk dominant equilibrium, while Anderlini (1990) and Harsanyi and Selten (1988) choose the payoff dominant equilibrium.

[ Top | Download | Introduction | Conclusion | References | Footnotes | John's Web ]