Jun 17 – 21, 2024
ENSEEIHT
Europe/Paris timezone

Multiagent Q-learning with `satisficing' criteria

Jun 18, 2024, 5:00 PM
30m
A002 (ENSEEIHT)

A002

ENSEEIHT

Speaker

Prof. Vivek Borkar (Indian Institute of Technology Bombay)

Description

We consider multiagent Q-learning with each agent having her
own reward function, but all agents influencing the transition
mechanism. By relaxing the exact optimality to a requirement of
`satisficing', modelled as driving the average costs to prescribed
acceptable regions, we propose a scheme that provably achieves this.

Authors

Mr Keshav Patel Keval (Indian Institute of Technology Bombay) Prof. Vivek Borkar (Indian Institute of Technology Bombay)

Presentation materials

There are no materials yet.