Reinforcement Learning for Stochastic Networks, Toulouse

Name: Reinforcement Learning for Stochastic Networks, Toulouse
Start: 2024-06-17T09:00:00+02:00
End: 2024-06-21T18:00:00+02:00
Location: ENSEEIHT

Jun 17 – 21, 2024

ENSEEIHT

Europe/Paris timezone

Computing the bias of stochastic approximation with constant step-size via Stein's method.

Jun 17, 2024, 1:30 PM

30m

A001

Parallel session: Policy gradient methods: optimization and convergence

Nicolas Gast (Inria, Univ. Grenoble Alpes)

Stochastic approximation algorithms are quite popular in reinforcement learning notably because they are powerful tools to study the convergence of algorithms based on stochastic gradient descent (like Q-learning of policy gradient). In this talk, I will focus on constant step-size stochastic approximation and present tools to compute its asymptotic bias, which is non-zero (both for Martingale noise or Markovian noise), contrary to the case of decreasing step-size. The analysis is based on a fine comparison of the generators of the stochastic system and its deterministic counterpart. It is similar to Stein's method.

Nicolas Gast (Inria, Univ. Grenoble Alpes)

Sebastian Allmeier (Inria, Univ. Grenoble Alpes)

There are no materials yet.

Reinforcement Learning for Stochastic Networks, Toulouse

Computing the bias of stochastic approximation with constant step-size via Stein's method.

A001

Speaker

Description

Primary author

Co-author

Presentation materials

Choose timezone

Reinforcement Learning for Stochastic Networks, Toulouse

Speaker

Description

Primary author

Co-author

Presentation materials