28 juillet 2025 à 1 août 2025
Fuseau horaire Europe/Paris

Reinforcement Learning Methods for Risk-Sensitive Sequential Decision Making

1 août 2025, 09:15
1h
Caquot

Caquot

Plenary talk Stochastic Programming Plenary session

Orateur

Erick Delage (HEC Montréal)

Description

This talk surveys recent developments in reinforcement learning (RL) methods for risk-aware model-free decision-making in Markov decision processes (MDPs). In the discounted setting, we adapt two popular risk neutral RL methods to account for risk aversion. The first approach minimizes a dynamic utility-based shortfall risk measure, while the other optimizes a specific quantile of the total discounted cost. We then present an RL framework for average-cost MDPs that incorporates dynamic risk measures. Together, these contributions represent a significant step toward scalable, risk-aware, model-free, sequential decision-making methods. The presentation will highlight the theoretical motivations, convergence guarantees, and empirical performance of these algorithms, offering insights into their applicability in finance and beyond.

Author

Erick Delage (HEC Montréal)

Co-auteurs

Esther Derman (Université de Montréal) M. Mohammad Ghavamzadeh (Amazon) M. Jia Lin Hau (University of New Hampshire) Jonathan Li (Telfer School of Management, University of Ottawa) Saeed Marzban (HEC Montréal) M. Marek Petrik (University of New Hampshire) Weikai Wang (HEC Montréal)

Documents de présentation