Reinforcement Learning for Stochastic Networks, Toulouse

Name: Reinforcement Learning for Stochastic Networks, Toulouse
Start: 2024-06-17T09:00:00+02:00
End: 2024-06-21T18:00:00+02:00
Location: ENSEEIHT

Jun 17 – 21, 2024

ENSEEIHT

Europe/Paris timezone

Lipschitz Lifelong Reinforcement Learning: transferring value functions across MDPs

Jun 21, 2024, 9:30 AM

Amphi B00 (ENSEEIHT)

Amphi B00

ENSEEIHT

Keynote: Emmanuel Rachelson (ISAE-SUPAERO)

Abstract: How close are the optimal value functions of two Markov decision processes that share the same state and action spaces but have different dynamics and rewards? In this talk, we will consider the problem of knowledge transfer when an agent is facing a series of reinforcement learning (RL) tasks. We will introduce a novel metric between Markov decision processes (MDPs) and establish that close MDPs have close optimal value functions. These theoretical results lead us to a value-transfer method for Lifelong RL, which we use to build a PAC-MDP algorithm with improved convergence rate. Beyond value transfer, this talk will open up on challenges and opportunities deriving from such an analysis.

There are no materials yet.

Reinforcement Learning for Stochastic Networks, Toulouse

Lipschitz Lifelong Reinforcement Learning: transferring value functions across MDPs

Amphi B00

ENSEEIHT

Description

Presentation materials

Choose timezone

Reinforcement Learning for Stochastic Networks, Toulouse

Description

Presentation materials