Note Watkins CJCHTechnical PDayan
Overcoming Omniscience in Axelrod’s Model
Utility Based Q-learning to Maintain Cooperation in Prisoner’s Dilemma Games