Reinforcement Learning for Stochastic Networks, Toulouse

Name: Reinforcement Learning for Stochastic Networks, Toulouse
Start: 2024-06-17T09:00:00+02:00
End: 2024-06-21T18:00:00+02:00
Location: ENSEEIHT

Jun 17 – 21, 2024

ENSEEIHT

Europe/Paris timezone

Session

Keynote: Sean Meyn (University of Florida)

Jun 21, 2024, 2:00 PM

Amphi B00 (ENSEEIHT)

Amphi B00

ENSEEIHT

There are no materials yet.

79. The Projected Bellman Equation in Reinforcement Learning

6/21/24, 2:00 PM

Abstract: A topic of discussion throughout the 2020 Simons program on reinforcement learning: is the Q-learning algorithm convergent outside of the tabular setting? It is now known that stability can be assured using a matrix gain algorithm, but this requires assumptions, which begs the next question: does a solution to the projected Bellman equation exist? This is the most minimal...
Go to contribution page

Building timetable...

Choose timezone

Reinforcement Learning for Stochastic Networks, Toulouse

Presentation materials