Reinforcement Learning for Stochastic Networks, Toulouse

Name: Reinforcement Learning for Stochastic Networks, Toulouse
Start: 2024-06-17T09:00:00+02:00
End: 2024-06-21T18:00:00+02:00
Location: ENSEEIHT

Jun 17 – 21, 2024

ENSEEIHT

Europe/Paris timezone

Multiagent Q-learning with `satisficing' criteria

Jun 18, 2024, 5:00 PM

30m

A002 (ENSEEIHT)

A002

ENSEEIHT

Parallel session: Multi-agent systems

Prof. Vivek Borkar (Indian Institute of Technology Bombay)

We consider multiagent Q-learning with each agent having her
own reward function, but all agents influencing the transition
mechanism. By relaxing the exact optimality to a requirement of
`satisficing', modelled as driving the average costs to prescribed
acceptable regions, we propose a scheme that provably achieves this.

Mr Keshav Patel Keval (Indian Institute of Technology Bombay) Prof. Vivek Borkar (Indian Institute of Technology Bombay)

There are no materials yet.

Reinforcement Learning for Stochastic Networks, Toulouse

Multiagent Q-learning with `satisficing' criteria

A002

ENSEEIHT

Speaker

Description

Authors

Presentation materials

Choose timezone

Reinforcement Learning for Stochastic Networks, Toulouse

Speaker

Description

Authors

Presentation materials