Q-learning
Overcoming Omniscience in Axelrod’s Model
Utility Based Q-learning to Maintain Cooperation in Prisoner’s Dilemma Games