Open access
Datum
2021Typ
- Conference Paper
ETH Bibliographie
yes
Altmetrics
Abstract
We present a learning algorithm for training a single policy that imitates multiple gaits of a walking robot. To achieve this, we use and extend MPC-Net, which is an Imitation Learning approach guided by Model Predictive Control (MPC). The strategy of MPC-Net differs from many other approaches since its objective is to minimize the control Hamiltonian, which derives from the principle of optimality. To represent the policies, we employ a mixture-of-experts network (MEN) and observe that the performance of a policy improves if each expert of a MEN specializes in controlling exactly one mode of a hybrid system, such as a walking robot. We introduce new loss functions for single- and multi-gait policies to achieve this kind of expert selection behavior. Moreover, we benchmark our algorithm against Behavioral Cloning and the original MPC implementation on various rough terrain scenarios. We validate our approach on hardware and show that a single learned policy can replace its teacher to control multiple gaits. Mehr anzeigen
Persistenter Link
https://doi.org/10.3929/ethz-b-000476607Publikationsstatus
publishedExterne Links
Buchtitel
2021 IEEE International Conference on Robotics and Automation (ICRA)Seiten / Artikelnummer
Verlag
IEEEKonferenz
Thema
Imitation Learning; Legged Robots; Optimization and Optimal ControlOrganisationseinheit
09570 - Hutter, Marco / Hutter, Marco
Förderung
166232 - Data-driven control approaches for advanced legged locomotion (SNF)
188596 - Perceptive Dynamic Locomotion on Rough Terrain (SNF)
852044 - Learning Mobility for Real Legged Robots (EC)
Anmerkungen
RSLETH Bibliographie
yes
Altmetrics