Robust multi-agent multi-armed bandits

Daniel Vial, Sanjay Shakkottai, R. Srikant

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Recent works have shown that agents facing independent instances of a stochastic K-armed bandit can collaborate to decrease regret. However, these works assume that each agent always recommends their individual best-arm estimates to other agents, which is unrealistic in envisioned applications (machine faults in distributed computing or spam in social recommendation systems). Hence, we generalize the setting to include n honest and m malicious agents who recommend best-arm estimates and arbitrary arms, respectively. We first show that even with a single malicious agent, existing collaboration-based algorithms fail to improve regret guarantees over a single-agent baseline. We propose a scheme where honest agents learn who is malicious and dynamically reduce communication with (i.e., "block") them. We show that collaboration indeed decreases regret for this algorithm, assuming m is small compared to K but without assumptions on malicious agents' behavior, thus ensuring that our algorithm is robust against any malicious recommendation strategy.

Original languageEnglish (US)
Title of host publicationMobiHoc 2021 - Proceedings of the 2021 22nd International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing
PublisherAssociation for Computing Machinery
Pages161-170
Number of pages10
ISBN (Electronic)9781450385589
DOIs
StatePublished - Jul 26 2021
Event22nd International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing, MobiHoc 2021 - Shanghai, China
Duration: Jul 26 2021Jul 29 2021

Publication series

NameProceedings of the International Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc)

Conference

Conference22nd International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing, MobiHoc 2021
Country/TerritoryChina
CityShanghai
Period7/26/217/29/21

Keywords

  • Adversarial agents
  • Multi-agent systems
  • Multi-armed bandits

ASJC Scopus subject areas

  • Hardware and Architecture
  • Computer Networks and Communications
  • Software

Fingerprint

Dive into the research topics of 'Robust multi-agent multi-armed bandits'. Together they form a unique fingerprint.

Cite this