TY - GEN
T1 - Fully decentralized multi-agent reinforcement learning with networked agents
AU - Zhang, Kaiqing
AU - Yang, Zhuoran
AU - Liu, Han
AU - Zhang, Tong
AU - Başar, Tamer
N1 - Publisher Copyright:
© Copyright 2018 by the author(s). All rights reserved.
PY - 2018
Y1 - 2018
N2 - We consider the fully decentralized multi-agent reinforcement learning (MARL) problem, where the agents are connected via a time-varying and possibly sparse communication network. Specifically, we assume that the reward functions of the agents might correspond to different tasks, and are only known to the corresponding agent. Moreover, each agent makes individual decisions based on both the information observed locally and the messages received from its neighbors over the network. To maximize the globally averaged return over the network, we propose two fully decentralized actor-critic algorithms, which are applicable to large-scale MARL problems in an online fashion. Convergence guarantees are provided when the value functions are approximated within the class of linear functions. Our work appears to be the first theoretical study of fully decentralized MARL algorithms for networked agents that use function approximation.
AB - We consider the fully decentralized multi-agent reinforcement learning (MARL) problem, where the agents are connected via a time-varying and possibly sparse communication network. Specifically, we assume that the reward functions of the agents might correspond to different tasks, and are only known to the corresponding agent. Moreover, each agent makes individual decisions based on both the information observed locally and the messages received from its neighbors over the network. To maximize the globally averaged return over the network, we propose two fully decentralized actor-critic algorithms, which are applicable to large-scale MARL problems in an online fashion. Convergence guarantees are provided when the value functions are approximated within the class of linear functions. Our work appears to be the first theoretical study of fully decentralized MARL algorithms for networked agents that use function approximation.
UR - http://www.scopus.com/inward/record.url?scp=85057269259&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85057269259&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85057269259
T3 - 35th International Conference on Machine Learning, ICML 2018
SP - 9340
EP - 9371
BT - 35th International Conference on Machine Learning, ICML 2018
A2 - Krause, Andreas
A2 - Dy, Jennifer
PB - International Machine Learning Society (IMLS)
T2 - 35th International Conference on Machine Learning, ICML 2018
Y2 - 10 July 2018 through 15 July 2018
ER -