Reinforcement Learning for Stochastic Networks, Toulouse

Name: Reinforcement Learning for Stochastic Networks, Toulouse
Start: 2024-06-17T09:00:00+02:00
End: 2024-06-21T18:00:00+02:00
Location: ENSEEIHT

Jun 17 – 21, 2024

ENSEEIHT

Europe/Paris timezone

Backlogged Bandits for Network Utility Maximization

Jun 18, 2024, 4:00 PM

30m

A002 (ENSEEIHT)

A002

ENSEEIHT

Parallel session: Multi-agent systems

Juaren Steiger (Queen's University)

We consider network utility maximization for job admission, routing, and scheduling in a queueing network with unknown job utilities as a type of multi-armed bandit problem. This "Backlogged Bandit" problem is a bandit learning problem with delayed feedback due to the end-to-end delay of a job waiting in the queue of each node in its path through the network. While recent work has explored techniques for learning under delayed feedback in this problem, such as the parallel instance technique, we find that the celebrated drift-plus-penalty technique classically used in the optimization of queueing networks already adequately controls the feedback delay in some problem instances. To that end, we focus our attention on developing theoretical techniques to study this style of algorithm in the Backlogged Bandits framework. In this talk, we present our recent work that explores the special case of routing in a bipartite (single-hop) queueing network, and discuss the challenges and our progress toward the general multi-hop case.

Juaren Steiger (Queen's University)

Bin Li (Pennsylvania State University) Dr Ning Lu (Queen's University)

There are no materials yet.

Reinforcement Learning for Stochastic Networks, Toulouse

Backlogged Bandits for Network Utility Maximization

A002

ENSEEIHT

Speaker

Description

Primary author

Co-authors

Presentation materials

Choose timezone

Reinforcement Learning for Stochastic Networks, Toulouse

Speaker

Description

Primary author

Co-authors

Presentation materials