Replicating centralized MAPF solvers with MAPF-GPT
Determine how effectively MAPF-GPT, a decentralized imitation-learning policy for multi-agent pathfinding, can replicate the behavior of centralized MAPF solvers other than LaCAM, specifically the optimal Conflict-Based Search (CBS) algorithm.
References
It is also unclear how effectively MAPF-GPT can replicate the behavior of the other existing centralized approaches (such as CBS that is an optimal MAPF solver). This dependence on the type of behavioral expert policy requires further research.
— MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale
(2409.00134 - Andreychuk et al., 2024) in Appendix: Limitations