Jun 17 – 21, 2024
ENSEEIHT
Europe/Paris timezone

Session

Parallel session: Some applications of reinforcement learning to networks

Jun 19, 2024, 1:30 PM
A001 (ENSEEIHT)

A001

ENSEEIHT

Description

Organizer and chair: Vivek Borkar

Presentation materials

There are no materials yet.

  1. Dr Konstantin Avrachenkov (INRIA Sophia Antipolis)
    6/19/24, 1:30 PM

    Restless Multi-Armed Bandits (RMABs) are extensively used in scheduling,
    resource allocation, marketing and clinical trials, just to name a few
    application areas. RMABs are Markov Decision Processes with two actions
    (active and passive modes) for each arm and with a constraint on the
    number of active arms per time slot. Since in general RMABs are
    PSPACE-complete, several heuristics such...

    Go to contribution page
  2. Prakirt Jhunjhunwala (Columbia Business School)
    6/19/24, 2:00 PM

    In an unobservable queue, where customers lack the complete wait time information, a throughput-maximizing server aims to exploit the information asymmetry by strategically signaling coarse congestion information to incentivize customers' arrival into the system. The customers make a calculated decision about joining the queue by creating a belief of their utility given the congestion signal...

    Go to contribution page
  3. Shalabh Bhatnagar (Indian Institute of Science)
    6/19/24, 2:30 PM

    The smart grid is comprised of
    different microgrids and is supported by different technologies. The microgrid has to make a lot of decisions, and this has to be automated for efficiency. We have Markov Decision Processes as a frame-
    work for sequential decision-making under uncertainty and
    reinforcement learning techniques to learn the optimal decisions. In this work, we formulate this...

    Go to contribution page
Building timetable...