Analysis of Statistical Forward Planning Methods in Pommerman

  • Diego Perez-Liebana Queen Mary University of London
  • Raluca D. Gaina Queen Mary University of London
  • Olve Drageset Maastricht University
  • Ercüment İlhan Queen Mary University of London
  • Martin Balla Queen Mary University of London
  • Simon M. Lucas Queen Mary University of London

Abstract

Pommerman is a complex multi-player and partially observable game where agents try to be the last standing to win. This game poses very interesting challenges to AI, such as collaboration, learning and planning. In this paper, we compare two Statistical Forward Planning algorithms, Monte Carlo Tree Search (MCTS) and Rolling Horizon Evolutionary Algorithm (RHEA) in Pommerman. We provide insights on how the agents actually play the game, inspecting their behaviours to explain their performance. Results show that MCTS outperforms RHEA in several game settings, but leaving room for multiple avenues of future work: tuning these methods, improving opponent modelling, identifying trap moves and introducing of assumptions for partial observability settings.

Published
2019-10-08