The iterated prisoner’s dilemma is a famous model of cooperation and conflict in game theory. Its origin can be traced back to the Cold War, and countless strategies for playing it have been proposed so far, either designed by hand or automatically generated by computers. In the 2000s, scholars started focusing on adaptive players, that is, able to classify their opponent’s behavior and adopt an effective counter-strategy. The player presented in this paper, pushes such idea even further: it builds a model of the current adversary from scratch, without relying on any pre-defined archetypes, and tweaks it as the game develops using an evolutionary algorithm; at the same time, it exploits the model to lead the game into the most favorable continuation. Models are compact non-deterministic finite state machines; they are extremely efficient in predicting opponents’ replies, without being completely correct by necessity. Experimental results show that such player is able to win several one-to- one games against strong opponents taken from the literature, and that it consistently prevails in round-robin tournaments of different sizes.
Exploiting Evolutionary Modeling to Prevail in Iterated Prisoner’s Dilemma Tournaments / Marco, Gaudesi; Piccolo, Elio; Squillero, Giovanni; Alberto, Tonda. - In: IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES. - ISSN 1943-068X. - STAMPA. - 8:3(2016), pp. 288-300. [10.1109/TCIAIG.2015.2439061]
Exploiting Evolutionary Modeling to Prevail in Iterated Prisoner’s Dilemma Tournaments
PICCOLO, Elio;SQUILLERO, Giovanni;
2016
Abstract
The iterated prisoner’s dilemma is a famous model of cooperation and conflict in game theory. Its origin can be traced back to the Cold War, and countless strategies for playing it have been proposed so far, either designed by hand or automatically generated by computers. In the 2000s, scholars started focusing on adaptive players, that is, able to classify their opponent’s behavior and adopt an effective counter-strategy. The player presented in this paper, pushes such idea even further: it builds a model of the current adversary from scratch, without relying on any pre-defined archetypes, and tweaks it as the game develops using an evolutionary algorithm; at the same time, it exploits the model to lead the game into the most favorable continuation. Models are compact non-deterministic finite state machines; they are extremely efficient in predicting opponents’ replies, without being completely correct by necessity. Experimental results show that such player is able to win several one-to- one games against strong opponents taken from the literature, and that it consistently prevails in round-robin tournaments of different sizes.File | Dimensione | Formato | |
---|---|---|---|
Tages14.pdf
accesso aperto
Tipologia:
2. Post-print / Author's Accepted Manuscript
Licenza:
Pubblico - Tutti i diritti riservati
Dimensione
502.41 kB
Formato
Adobe PDF
|
502.41 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2622370
Attenzione
Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo