Reinforcement learning based path planning of the mobile agents with constrained locomotion for the material handling applications

Published in 2020 IEEE 4th Conference on Information & Communication Technology (CICT), Chennai, India, 2020

This paper presents the intelligent path planning model of the mobile base agent of SwarmItFIX robot with novel Swing and Dock (SaD) locomotion for material handling/transfer applications. In this work, the Markov Decision Process (MDP) path planning problem of SaD agent is solved using two Reinforcement Learning (RL) based dynamic programming methods viz Policy Iteration (PI), and Value Iteration (VI). Being tested with 16 different test cases, both the algorithms return the optimal sequence of steps with reduced makespan for the mobile agents to reach the goal positions positively. The results of both the methods were compared with each other in the section V, and found to be convincing. Hence the proposed control scheme is being implemented in the SwarmItFIX setup available at the University of Genova, Italy.

Recommended citation: Veeramani, Satheeshkumar, and Sreekumar Muthuswamy. "Reinforcement learning based path planning of the mobile agents with constrained locomotion for the material handling applications." 2020 IEEE 4th Conference on Information & Communication Technology (CICT). IEEE, 2020.
Download Paper