TY - JOUR
T1 - Tumbling Robot Control Using Reinforcement Learning
T2 - An Adaptive Control Policy That Transfers Well to the Real World
AU - Schwartzwald, Andrew
AU - Tlachac, Matthew
AU - Guzman, Luis
AU - Bacharis, Athanasios
AU - Papanikolopoulos, Nikolaos
N1 - Publisher Copyright:
© 1994-2011 IEEE.
PY - 2023/6/1
Y1 - 2023/6/1
N2 - Tumbling robots are simple platforms that are able to traverse large obstacles relative to their size, at the cost of being difficult to control. Existing control methods apply only a subset of possible robot motions and make the assumption of flat terrain. Reinforcement learning (RL) allows for the development of sophisticated control schemes that can adapt to diverse environments. By utilizing domain randomization while training in simulation, a robust control policy can be learned that transfers well to the real world. In this article, we implement autonomous set point navigation on a tumbling robot prototype and evaluate it on flat, uneven, and valley-hill terrain. Our results demonstrate that RL-based control policies can generalize well to challenging environments that were not encountered during training. The flexibility of our system demonstrates the viability of nontraditional robots for navigational tasks.
AB - Tumbling robots are simple platforms that are able to traverse large obstacles relative to their size, at the cost of being difficult to control. Existing control methods apply only a subset of possible robot motions and make the assumption of flat terrain. Reinforcement learning (RL) allows for the development of sophisticated control schemes that can adapt to diverse environments. By utilizing domain randomization while training in simulation, a robust control policy can be learned that transfers well to the real world. In this article, we implement autonomous set point navigation on a tumbling robot prototype and evaluate it on flat, uneven, and valley-hill terrain. Our results demonstrate that RL-based control policies can generalize well to challenging environments that were not encountered during training. The flexibility of our system demonstrates the viability of nontraditional robots for navigational tasks.
UR - http://www.scopus.com/inward/record.url?scp=85135759074&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85135759074&partnerID=8YFLogxK
U2 - 10.1109/MRA.2022.3188215
DO - 10.1109/MRA.2022.3188215
M3 - Article
AN - SCOPUS:85135759074
SN - 1070-9932
VL - 30
SP - 86
EP - 95
JO - IEEE Robotics and Automation Magazine
JF - IEEE Robotics and Automation Magazine
IS - 2
ER -