Reinforcement Learning for Stochastic Networks, Toulouse

Name: Reinforcement Learning for Stochastic Networks, Toulouse
Start: 2024-06-17T09:00:00+02:00
End: 2024-06-21T18:00:00+02:00
Location: ENSEEIHT

Jun 17 – 21, 2024

ENSEEIHT

Europe/Paris timezone

Session

Parallel session: Multi-agent systems

Jun 18, 2024, 4:00 PM

A002 (ENSEEIHT)

A002

ENSEEIHT

There are no materials yet.

57. Backlogged Bandits for Network Utility Maximization

Juaren Steiger (Queen's University)

6/18/24, 4:00 PM

We consider network utility maximization for job admission, routing, and scheduling in a queueing network with unknown job utilities as a type of multi-armed bandit problem. This "Backlogged Bandit" problem is a bandit learning problem with delayed feedback due to the end-to-end delay of a job waiting in the queue of each node in its path through the network. While recent work has explored...
Go to contribution page
24. Interpersonal trust: An asymptotic analysis of a stochastic coordination game with multi-agent learning

Benedikt Meylahn (Korteweg-de Vries Institute for Mathematics, University of Amsterdam)

6/18/24, 4:30 PM

We study the interpersonal trust of a population of agents, asking whether chance may decide if a population ends up in a high trust or low trust state. We model this by a discrete time, random matching stochastic coordination game. Agents are endowed with an exponential smoothing learning rule about the behaviour of their neighbours. We find that, with probability one in the long run the...
Go to contribution page
23. Multiagent Q-learning with `satisficing' criteria

Prof. Vivek Borkar (Indian Institute of Technology Bombay)

6/18/24, 5:00 PM

We consider multiagent Q-learning with each agent having her
own reward function, but all agents influencing the transition
mechanism. By relaxing the exact optimality to a requirement of
`satisficing', modelled as driving the average costs to prescribed
acceptable regions, we propose a scheme that provably achieves this.
Go to contribution page

Building timetable...

Reinforcement Learning for Stochastic Networks, Toulouse

Session

Parallel session: Multi-agent systems

A002

ENSEEIHT

Description

Presentation materials

Choose timezone

Reinforcement Learning for Stochastic Networks, Toulouse

Description

Presentation materials