The iterated prisoner's dilemma is a widely known model in game theory, fundamental to many theories of cooperation and trust among self-interested beings. There are many works in literature about developing efficient strategies for this problem, both inside and outside the machine learning community. This paper shift the focus from finding a “good strategy” in absolute terms, to dynamically adapting and optimizing the strategy against the current opponent. Turan evolves competitive non-deterministic models of the current opponent, and exploit them to predict its moves and maximize the payoff as the game develops. Experimental results show that the proposed approach is able to obtain good performances against different kind of opponent, whether their strategies can or cannot be implemented as finite state machines.
TURAN: Evolving non-deterministic players for the iterated prisoner's dilemma / Gaudesi, Marco; Piccolo, Elio; Squillero, Giovanni; Tonda, ALBERTO PAOLO. - STAMPA. - (2014), pp. 21-27. (Intervento presentato al convegno 2014 IEEE Congress on Evolutionary Computation (CEC) tenutosi a Beijing, China nel 06 - 11 July, 2014) [10.1109/CEC.2014.6900564].
TURAN: Evolving non-deterministic players for the iterated prisoner's dilemma
GAUDESI, MARCO;PICCOLO, Elio;SQUILLERO, Giovanni;
2014
Abstract
The iterated prisoner's dilemma is a widely known model in game theory, fundamental to many theories of cooperation and trust among self-interested beings. There are many works in literature about developing efficient strategies for this problem, both inside and outside the machine learning community. This paper shift the focus from finding a “good strategy” in absolute terms, to dynamically adapting and optimizing the strategy against the current opponent. Turan evolves competitive non-deterministic models of the current opponent, and exploit them to predict its moves and maximize the payoff as the game develops. Experimental results show that the proposed approach is able to obtain good performances against different kind of opponent, whether their strategies can or cannot be implemented as finite state machines.Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2565754
Attenzione
Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo