TY - GEN
T1 - Automated Deep Reinforcement Learning Environment for Hardware of a Modular Legged Robot
AU - Ha, Sehoon
AU - Kim, Joohyung
AU - Yamane, Katsu
PY - 2018/8/20
Y1 - 2018/8/20
N2 - In this paper, we present an automated learning environment for developing control policies directly on the hardware of a modular legged robot. This environment facilitates the reinforcement learning process by computing the rewards using a vision-based tracking system and relocating the robot to the initial position using a resetting mechanism. We employ two state-of-the-art deep reinforcement learning (DRL) algorithms, Trust Region Policy Optimization (TRPO) and Deep Deterministic Policy Gradient (DDPG), to train neural network policies for simple rowing and crawling motions. Using the developed environment, we demonstrate both learning algorithms can effectively learn policies for simple locomotion skills on highly stochastic hardware and environments. We further expedite learning by transferring policies learned on a single legged configuration to multi-legged ones.
AB - In this paper, we present an automated learning environment for developing control policies directly on the hardware of a modular legged robot. This environment facilitates the reinforcement learning process by computing the rewards using a vision-based tracking system and relocating the robot to the initial position using a resetting mechanism. We employ two state-of-the-art deep reinforcement learning (DRL) algorithms, Trust Region Policy Optimization (TRPO) and Deep Deterministic Policy Gradient (DDPG), to train neural network policies for simple rowing and crawling motions. Using the developed environment, we demonstrate both learning algorithms can effectively learn policies for simple locomotion skills on highly stochastic hardware and environments. We further expedite learning by transferring policies learned on a single legged configuration to multi-legged ones.
UR - http://www.scopus.com/inward/record.url?scp=85053545215&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85053545215&partnerID=8YFLogxK
U2 - 10.1109/URAI.2018.8442201
DO - 10.1109/URAI.2018.8442201
M3 - Conference contribution
AN - SCOPUS:85053545215
SN - 9781538663349
T3 - 2018 15th International Conference on Ubiquitous Robots, UR 2018
SP - 348
EP - 354
BT - 2018 15th International Conference on Ubiquitous Robots, UR 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 15th International Conference on Ubiquitous Robots, UR 2018
Y2 - 27 June 2018 through 30 June 2018
ER -