Reinforcement Learning for Stochastic Networks, Toulouse

Name: Reinforcement Learning for Stochastic Networks, Toulouse
Start: 2024-06-17T09:00:00+02:00
End: 2024-06-21T18:00:00+02:00
Location: ENSEEIHT

Jun 17 – 21, 2024

ENSEEIHT

Europe/Paris timezone

Reinforcement learning in a prisoner's dilemma

Jun 20, 2024, 1:30 PM

30m

A001 (ENSEEIHT)

A001

ENSEEIHT

Parallel session: Algorithmic collusion: Foundations for understanding the emergence of anticompetitive behaviour

Artur Dolgopolov (Bielefeld University)

I characterize the outcomes of a class of model-free reinforcement learning algorithms, such as stateless Q-learning, in a prisoner's dilemma. The behavior is studied in the limit as players stop experimenting after sufficiently exploring their options. A closed form relationship between the learning rate and game payoffs reveals whether the players will learn to cooperate or defect. The findings have implications for algorithmic collusion and also apply to asymmetric learners with different experimentation rules.

Artur Dolgopolov (Bielefeld University)

There are no materials yet.

Reinforcement Learning for Stochastic Networks, Toulouse

Reinforcement learning in a prisoner's dilemma

A001

ENSEEIHT

Speaker

Description

Author

Presentation materials

Choose timezone

Reinforcement Learning for Stochastic Networks, Toulouse

Speaker

Description

Author

Presentation materials