TY - GEN
T1 - Continuous Control of a Soft Continuum Arm using Deep Reinforcement Learning
AU - Satheeshbabu, Sreeshankar
AU - Uppalapati, Naveen K.
AU - Fu, Tianshi
AU - Krishnan, Girish
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/5
Y1 - 2020/5
N2 - Soft Continuum Arms (SCAs) are challenging to control due to their highly nonlinear characteristics and sensitivity to external loading. Recent efforts to address the control problem using machine learning techniques are limited to simple SCA architectures. In this paper, we train a model-free reinforcement learning control policy based on Deep Deterministic Policy Gradient (DDPG) for end effector path tracking on a BR2 SCA. Unlike simple SCA architectures, the BR2 SCA has the functionality to bend and rotate spatially thus leading to enhanced workspace and ability to perform complex tasks. The control policy is first validated in simulations and then implemented on a prototype BR2 with state feedback. An average tracking error less than 3 cm (< diameter of the SCA) is reported using the proposed control policy. The efficacy of the control policy is validated for different loading conditions both in simulations and on the SCA prototype.
AB - Soft Continuum Arms (SCAs) are challenging to control due to their highly nonlinear characteristics and sensitivity to external loading. Recent efforts to address the control problem using machine learning techniques are limited to simple SCA architectures. In this paper, we train a model-free reinforcement learning control policy based on Deep Deterministic Policy Gradient (DDPG) for end effector path tracking on a BR2 SCA. Unlike simple SCA architectures, the BR2 SCA has the functionality to bend and rotate spatially thus leading to enhanced workspace and ability to perform complex tasks. The control policy is first validated in simulations and then implemented on a prototype BR2 with state feedback. An average tracking error less than 3 cm (< diameter of the SCA) is reported using the proposed control policy. The efficacy of the control policy is validated for different loading conditions both in simulations and on the SCA prototype.
UR - http://www.scopus.com/inward/record.url?scp=85088130402&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85088130402&partnerID=8YFLogxK
U2 - 10.1109/RoboSoft48309.2020.9116003
DO - 10.1109/RoboSoft48309.2020.9116003
M3 - Conference contribution
AN - SCOPUS:85088130402
T3 - 2020 3rd IEEE International Conference on Soft Robotics, RoboSoft 2020
SP - 497
EP - 503
BT - 2020 3rd IEEE International Conference on Soft Robotics, RoboSoft 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd IEEE International Conference on Soft Robotics, RoboSoft 2020
Y2 - 15 May 2020 through 15 July 2020
ER -