The evolution of cooperation - 2

“In this book I will examine interactions between just two players at a time. A single player may be interacting with many others, but the player is assumed to be interacting with them one at a time.3 The player is also assumed to recognize another player and to remember how the two of them have interacted so far. This ability to recognize and remember allows the history of the particular interaction to be taken into account by a player’s strategy. A variety of ways to resolve the Prisoner’s Dilemma have been developed. Each involves allowing some additional activity that alters the strategic interaction in such a way as to fundamentally change the nature of the problem. The original problem remains, however, because there are many situations in which these remedies are not available. Therefore, the problem will be considered in its fundamental form, without these alterations.

There is no mechanism available to the players to make enforceable threats or commitments (Schelling 1960). Since the players cannot commit themselves to a particular strategy, each must take into account all possible strategies that might be used by the other player. Moreover the players have all possible strategies available to themselves

There is no way to be sure what the other player will do on a given move. This eliminates the possibility of rne-tagame analysis (Howard 1971) which allows such options as “make the same choice as the other is about to make.” It also eliminates the possibility of reliable reputations such as might be based on watching the other player interact with third parties. Thus the only information available to the players about each other is the history of their interaction so far.

There is no way to eliminate the other player or run away from the interaction. Therefore each player retains the ability to cooperate or defect on each move.

There is no way to change the other player’s payoffs. The payoffs already include whatever consideration each player has for the interests of the other (Taylor 1976, pp. 69-73).”

这里提了一下研究的iterative prisoner dillema的定义。一个玩家同时和另一个进行游戏，但每一轮可能和不同玩家进行。玩家间能相互确认身份并了解之前的相互间之前玩过的轮次记录，但不能看到对方和其他玩家的历史记录。玩家之间不能相互强迫做出选择，每一次选择都是独立的选择，相互没办法完全预判选择。每个选择必须要做出合作或者背叛。每轮的结果是固定的。

“The analysis of the data from these tournaments reveals four properties which tend to make a decision rule successful: avoidance of unnecessary conflict by cooperating as long as the other player does, provocability in the face of an uncalled for defection by the other, forgiveness after responding to a provocation, and clarity of behavior so that the other player can adapt to your pattern of action.”

在很多玩家参与的游戏中，优秀的策略满足一下几个特性：

不先背叛
面对一次背叛，立即会在下次面对同一个玩家时进行包袱
报复后会选择原谅
自己的策略很容易能让对方理解并一次选择对方的策略（而使得他自己的长期利益能够最大化）