Reinforcement Learning for Stochastic Networks, Toulouse

Name: Reinforcement Learning for Stochastic Networks, Toulouse
Start: 2024-06-17T09:00:00+02:00
End: 2024-06-21T18:00:00+02:00
Location: ENSEEIHT

Jun 17 – 21, 2024

ENSEEIHT

Europe/Paris timezone

Session

Keynote: Éric Moulines (École Polytechnique)

Jun 18, 2024, 9:30 AM

Amphi B00 (ENSEEIHT)

Amphi B00

ENSEEIHT

There are no materials yet.

88. Finite Sample analysis of linear stochastic approximation and TD learning

6/18/24, 9:30 AM

Abstract: In this talk, we consider the problem of obtaining sharp bounds for linear stochastic approximation. We then apply these results to temporal difference (TD) methods with linear functional approximation for policy evaluation in discounted Markov decision processes. We show that a simple algorithm with a universal and instance-independent step size together with Polyak-Ruppert tail...
Go to contribution page

Building timetable...

Choose timezone

Reinforcement Learning for Stochastic Networks, Toulouse

Presentation materials