Application of Deep Reinforcement Learning for Tracking Control of 3WD Omnidirectional Mobile Robot

Authors

  • Atif Mehmood UET, Taxila
  • Inam ul Hasan Shaikh
  • Ahsan Ali

DOI:

https://doi.org/10.5755/j01.itc.50.3.25979

Keywords:

3WD-Omnidirectional mobile robot, Deep Reinforcement Learning (DRL), Deep Deterministic Policy Gradient (DDPG), Reinforcement Learning Toolbox (RL toolbox)

Abstract

Deep reinforcement learning, the fastest growing technique, to solve real-world complex problems by creating
a simple mathematical framework. It includes an agent, action, environment, and a reward. An agent will interact
with the environment, takes an optimal action aiming to maximize the total reward. This paper proposes
the compelling technique of deep deterministic policy gradient for solving the complex continuous action
space of 3-wheeled omnidirectional mobile robots. Three-wheeled Omnidirectional mobile robots tracking is
a difficult task because of the orientation of the wheels which makes it rotate around its own axis rather to
follow the trajectory. A deep deterministic policy gradient (DDPG) algorithm has been designed to train in environments
with continuous action space to follow the trajectory by training the neural networks defined for
the policy and value function to maximize the reward function defined for the tracking of the trajectory. DDPG
agent environment is created in the Reinforcement learning toolbox in MATLAB 2019 while for Actor and critic
network design deep neural network designer is used. Results are shown to illustrate the effectiveness of the
technique with a convergence of error approximately to zero.

Author Biographies

Atif Mehmood, UET, Taxila

Atif Mehmood received the B.Sc. degree in electrical engineering from the University of Lahore, Lahore, Pakistan. He is currently pursuing the M.Sc. degree with the University of Engineering and Technology, Taxila, Pakistan. His research interests include machine learning, robotics, myo electric prosthetic and control design.

Inam ul Hasan Shaikh

INAM UL HASAN SHAIKH received the Ph.D. degree in electrical engineering from The University of Manchester, U.K. He is currently an Assistant Professor with the Department of Electrical Engineering, University of Engineering and Technology, Taxila, Pakistan. His research interests include iterative learning control, H1, and LPV control.

Ahsan Ali

AHSAN ALI received the Ph.D. degree in control systems from TU Hamburg-Harburg, Germany. He is currently an Assistant Professor with the Department of Electrical Engineering, University of Engineering and Technology, Taxila, Pakistan. His research interests include machine learning, model identification, LPV modeling, and control design.

Downloads

Published

2021-09-24

Issue

Section

Articles