Tuning Legged Locomotion Controllers via Safe Bayesian Optimization

7th Annual Conference on Robot Learning (CoRL 2023)

Abstract

This paper presents a data-driven strategy to streamline the deployment of model-based controllers in legged robotic hardware platforms. Our approach leverages a model-free safe learning algorithm to automate the tuning of control gains, addressing the mismatch between the simplified model used in the control formulation and the real system. This method substantially mitigates the risk of hazardous interactions with the robot by sample-efficiently optimizing parameters within a probably safe region. Additionally, we extend the applicability of our approach to incorporate the different gait parameters as contexts, leading to a safe, sample-efficient exploration algorithm capable of tuning a motion controller for diverse gait patterns. We validate our method through simulation and hardware experiments, where we demonstrate that the algorithm obtains superior performance on tuning a model-based motion controller for multiple gaits safely.

Paper: [PMLR] Open access: [ArXiv] Code: [GitHub]

Full Supplementary Video

Bibtex

@InProceedings{pmlr-v229-widmer23a,
  title = {Tuning Legged Locomotion Controllers via Safe Bayesian Optimization},
  author = {Widmer, Daniel and Kang, Dongho and Sukhija, Bhavya and H\"{u}botter, Jonas and Krause, Andreas and Coros, Stelian},
  booktitle = {Proceedings of The 7th Conference on Robot Learning},
  pages = {2444--2464},
  year = {2023},
  editor = {Tan, Jie and Toussaint, Marc and Darvish, Kourosh},
  volume = {229},
  series = {Proceedings of Machine Learning Research},
  month = {06--09 Nov},
  publisher = {PMLR}
}

Acknowledgment

We would like to thank Lenart Treven and Flavio De Vincenti for their feedback on this work.

This project has received funding from the Swiss National Science Foundation under NCCR Automation, grant agreement 51NF40 180545, the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme, grant agreement No. 866480, and the Microsoft Swiss Joint Research Center.