CaT: Constraints as Terminations for Legged Locomotion Reinforcement Learning

Chane-Sane, Elliot; Leziart, Pierre-Alexandre; Flayols, Thomas; Stasse, Olivier; Souères, Philippe; Mansard, Nicolas

Computer Science > Robotics

arXiv:2403.18765 (cs)

[Submitted on 27 Mar 2024]

Title:CaT: Constraints as Terminations for Legged Locomotion Reinforcement Learning

Authors:Elliot Chane-Sane, Pierre-Alexandre Leziart, Thomas Flayols, Olivier Stasse, Philippe Souères, Nicolas Mansard

View PDF HTML (experimental)

Abstract:Deep Reinforcement Learning (RL) has demonstrated impressive results in solving complex robotic tasks such as quadruped locomotion. Yet, current solvers fail to produce efficient policies respecting hard constraints. In this work, we advocate for integrating constraints into robot learning and present Constraints as Terminations (CaT), a novel constrained RL algorithm. Departing from classical constrained RL formulations, we reformulate constraints through stochastic terminations during policy learning: any violation of a constraint triggers a probability of terminating potential future rewards the RL agent could attain. We propose an algorithmic approach to this formulation, by minimally modifying widely used off-the-shelf RL algorithms in robot learning (such as Proximal Policy Optimization). Our approach leads to excellent constraint adherence without introducing undue complexity and computational overhead, thus mitigating barriers to broader adoption. Through empirical evaluation on the real quadruped robot Solo crossing challenging obstacles, we demonstrate that CaT provides a compelling solution for incorporating constraints into RL frameworks. Videos and code are available at this https URL.

Comments:	Project webpage: this https URL
Subjects:	Robotics (cs.RO); Machine Learning (cs.LG)
Cite as:	arXiv:2403.18765 [cs.RO]
	(or arXiv:2403.18765v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2403.18765

Submission history

From: Elliot Chane-Sane [view email]
[v1] Wed, 27 Mar 2024 17:03:31 UTC (11,633 KB)

Computer Science > Robotics

Title:CaT: Constraints as Terminations for Legged Locomotion Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:CaT: Constraints as Terminations for Legged Locomotion Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators