Multiagent, Multitarget Path Planning in Markov Decision Processes

Farhad Nawaz, Melkior Ornik

Research output: Contribution to journalArticlepeer-review

Abstract

Missions for autonomous systems often require agents to visit multiple targets in complex operating conditions. This work considers the problem of visiting a set of targets in minimum time by a team of noncommunicating agents in a Markov decision process (MDP). The single-agent problem is at least NP-complete by reducing it to a Hamiltonian path problem. We first discuss an optimal algorithm based on Bellman's optimality equation that is exponential in the number of target states. Then, we tradeoff optimality for time complexity by presenting a suboptimal algorithm that is polynomial at each time step. We prove that the proposed algorithm generates optimal policies for certain classes of MDPs. Extending our procedure to the multiagent case, we propose a target partitioning algorithm that approximately minimizes the expected time to visit the targets. We prove that our algorithm generates optimal partitions for clustered target scenarios. We present the performance of our algorithms on random MDPs and gridworld environments inspired by ocean dynamics. We show that our algorithms are much faster than the optimal procedure and more optimal than the currently available heuristic.

Original languageEnglish (US)
Pages (from-to)7560-7574
Number of pages15
JournalIEEE Transactions on Automatic Control
Volume68
Issue number12
DOIs
StatePublished - Dec 1 2023

Keywords

  • Agents and Autonomous systems
  • Clustering algorithms
  • Costs
  • Graph partitioning
  • Heuristic algorithms
  • Markov processes
  • Partitioning algorithms
  • Planning
  • Stochastic systems
  • Task analysis
  • Agents and autonomous systems
  • stochastic systems
  • graph partitioning

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Computer Science Applications
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Multiagent, Multitarget Path Planning in Markov Decision Processes'. Together they form a unique fingerprint.

Cite this