Reinforcement Learning for Stochastic Networks, Toulouse

Name: Reinforcement Learning for Stochastic Networks, Toulouse
Start: 2024-06-17T09:00:00+02:00
End: 2024-06-21T18:00:00+02:00
Location: ENSEEIHT

Jun 17 – 21, 2024

ENSEEIHT

Europe/Paris timezone

Session

Parallel session: Reinforcement learning for combinatorial problems

Jun 18, 2024, 1:30 PM

A002 (ENSEEIHT)

A002

ENSEEIHT

There are no materials yet.

22. The Traveling Salesman Problem: Novel Approaches Grounded in Evolutionary Reinforcement Learning

Prof. SAFA BHAR LAYEB (LR-OASIS National Engineering School of Tunis, University of Tunis El Manar, Tunis, Tunisia)

6/18/24, 1:30 PM

Deep Reinforcement Learning (DRL) has showcased remarkable achievements across various domains, such as image recognition and automation. Nevertheless, its potential in the realm of logistics and transportation, particularly in tackling routing challenges, remains mostly untapped. On the contrary, Evolutionary Algorithms (EA) have enjoyed widespread adoption for solving combinatorial...
Go to contribution page
55. Scalable Policies for the Dynamic Traveling Multi-Maintainer Problem with Alerts

Peter Verleijsdonk (Eindhoven University of Technology)

6/18/24, 2:00 PM

Downtime of industrial assets such as wind turbines and medical imaging devices is costly. To avoid such downtime costs, companies seek to initiate maintenance just before failure, which is challenging because: (i) Asset failures are notoriously difficult to predict, even in the presence of real-time monitoring devices that signal degradation; and (ii) Limited resources are available to serve...
Go to contribution page
72. Fleming-Viot particle systems to accelerate optimal policy learning in the presence of costly rare events

Daniel Mastropietro (INP Toulouse, CNRS-IRIT)

6/18/24, 2:30 PM

In this talk we present Fleming-Viot particle systems to increase the efficiency in discovering rare events that have an impact in the learning speed of optimal policies. The approach is used to learn the critic of Actor-Critic policy gradient methods that learn optimal parameters of parameterized policies, giving rise to what we call the FVAC method. We have successfully applied FVAC to two...
Go to contribution page

Building timetable...

Reinforcement Learning for Stochastic Networks, Toulouse

Session

Parallel session: Reinforcement learning for combinatorial problems

A002

ENSEEIHT

Description

Presentation materials

Choose timezone

Reinforcement Learning for Stochastic Networks, Toulouse

Description

Presentation materials