Reinforcement learning on Upkies

A project log for Upkie wheeled biped robots

Wheeled biped robots with fully open source hardware and software.

Upkie wheeled bipeds • 12/09/2023 at 15:53•0 Comments

The latest release of Upkie's software brings a functional reinforcement learning pipeline with sim-to-real transfer. The pipeline is based on Stable Baselines3, with standard sim-to-real tricks that could very well work on other wheeled biped robots. The pipeline trains on the Gymnasium environments in upkie.envs (pip-installable from PyPI) and is implemented in the PPO balancer. Here is a policy trained in Bullet and running on a real Upkie:

There is also a usage video showing how to run the pipeline:

Hoping this helps newcomers get started with reinforcement learning on real robots!

Previous log

Blender project

11/12/2023 at 10:36 • 0 comments

Next log

Model predictive control on Upkies

12/13/2024 at 09:15 • 0 comments

Reinforcement learning on Upkies

Blender project

Model predictive control on Upkies

Discussions

Become a Hackaday.io Member