Trivers, which suggestions from William Hamilton started a game called the prisoner's dilemma.. With reciprocal altruism there is a delay in time between the altruism and its reciprocation, but the players in the prisoner's dilemma gave themselves up at the same time (Wright 1994).
Reaching pareto-optimality in prisoner’s dilemma using conditional joint action learning. task. Second, for games such as the Prisoner’s Dilemma, the only NESSG outcome is Pareto-Dominated and playing that repeatedly will result in a Pareto-Dominated outcome for the iterated version of the game as well1. Hence, instead of trying to converge to an NESSG, we set our learning goal to.