The Dynamics of Q-learning in Population Games

The Dynamics of Q-learning in Population Games: a Physics-Inspired Continuity Equation Model, Shuyue Hu★, Chin-Wing Leung, Ho-fung Leung, and Harold Soh★, International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
Links: Paper | Github

Although learning has found wide application in multi-agent systems, its effects on the temporal evolution of a system are far from understood.

This paper focuses on the dynamics of Q-learning in large-scale multi-agent systems modeled as population games. We revisit the replicator equation model for Q-learning dynamics and observe that this model is inappropriate for our concerned setting. Motivated by this, we develop a new formal model, which bears a formal connection with the continuity equation in physics. We show that our model always accurately describes the Q-learning dynamics in population games across different initial settings of MASs and game configurations. We also show that our model can be applied to different exploration mechanisms, describe the mean dynamics, and be extended to Q-learning in 2-player and n-player games. Last but not least, we show that our model can provide insights into algorithm parameters and facilitate parameter tuning.

Resources

You can find our paper here.

Citation

Please consider citing our paper if you build upon our results and ideas.

Shuyue Hu★, Chin-Wing Leung, Ho-fung Leung, and Harold Soh★, “The Dynamics of Q-learning in Population Games: a Physics-Inspired Continuity Equation Model”, International Conference on Autonomous Agents and Multiagent Systems (AAMAS)

@inproceedings{hu2022,
author    = {Shuyue Hu and Chin-Wing Leung and Ho-fung Leung and Harold Soh},
title     = {The Dynamics of Q-learning in Population Games: a Physics-Inspired Continuity Equation Model},
year      = {2022},
booktitle = {International Conference on Autonomous Agents and Multiagent Systems (AAMAS)}}

Contact

If you have questions or comments, please contact Shuyue Hu.

Acknowledgements

This research is supported by the National Research Foundation Singapore under its AI Singapore Programme (Award Number: AISG-RP-2019-011).

The Dynamics of Q-learning in Population Games

Resources

Citation

Contact

Acknowledgements

CLeAR

Arena 5.0 – A Photorealistic ROS2 Simulation Framework for Social Navigation with NVIDIA Isaac Gym

GeNIE for In-the-Wild Navigation

Arena 4.0 - A Comprehensive ROS2 Development and Benchmarking Platform for Human-centric Navigation Using Generative-Model-based Environment Generation

CLeAR

Recent posts

We're Hiring!

Arena 5.0 – A Photorealistic ROS2 Simulation Framework for Social Navigation with NVIDIA Isaac Gym

Menu

The Dynamics of Q-learning in Population Games

Resources

Citation

Contact

Acknowledgements

CLeAR

You may also like...

Arena 5.0 – A Photorealistic ROS2 Simulation Framework for Social Navigation with NVIDIA Isaac Gym

GeNIE for In-the-Wild Navigation

Arena 4.0 - A Comprehensive ROS2 Development and Benchmarking Platform for Human-centric Navigation Using Generative-Model-based Environment Generation

We're Hiring!

Arena 5.0 – A Photorealistic ROS2 Simulation Framework for Social Navigation with NVIDIA Isaac Gym