Transferring Multi-Agent Reinforcement Learning Policies for Autonomous Driving using Sim-to-Real
E. Candela*,
L. Parada*,
L. Marques*, T. Georgescu, Y. Demiris, and P. Angeloudis.
In 35th IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022.
Autonomous Driving requires high levels of coordination and collaboration between agents. Achieving effective coordination in multi-agent systems is a difficult task
that remains largely unresolved. Multi-Agent Reinforcement
Learning has arisen as a powerful method to accomplish
this task because it considers the interaction between agents
and also allows for decentralized training—which makes it
highly scalable. However, transferring policies from simulation
to the real world is a big challenge, even for single-agent
applications. Multi-agent systems add additional complexities to
the Sim-to-Real gap due to agent collaboration and environment
synchronization. In this paper, we propose a method to transfer
multi-agent autonomous driving policies to the real world. For
this, we create a multi-agent environment that imitates the
dynamics of the Duckietown multi-robot testbed, and train
multi-agent policies using the MAPPO algorithm with different
levels of domain randomization. We then transfer the trained
policies to the Duckietown testbed and show that when using
our method, domain randomization can reduce the reality gap
by 90%. Moreover, we show that different levels of parameter
randomization have a substantial impact on the Sim-to-Real gap.
Finally, our approach achieves significantly better results than
a rule-based benchmark.