Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform

Huang, Shengyi; Weng, Jiayi; Charakorn, Rujikorn; Lin, Min; Xu, Zhongwen; Ontañón, Santiago

Computer Science > Machine Learning

arXiv:2310.00036 (cs)

[Submitted on 29 Sep 2023]

Title:Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform

Authors:Shengyi Huang, Jiayi Weng, Rujikorn Charakorn, Min Lin, Zhongwen Xu, Santiago Ontañón

View PDF

Abstract:Distributed Deep Reinforcement Learning (DRL) aims to leverage more computational resources to train autonomous agents with less training time. Despite recent progress in the field, reproducibility issues have not been sufficiently explored. This paper first shows that the typical actor-learner framework can have reproducibility issues even if hyperparameters are controlled. We then introduce Cleanba, a new open-source platform for distributed DRL that proposes a highly reproducible architecture. Cleanba implements highly optimized distributed variants of PPO and IMPALA. Our Atari experiments show that these variants can obtain equivalent or higher scores than strong IMPALA baselines in moolib and torchbeast and PPO baseline in CleanRL. However, Cleanba variants present 1) shorter training time and 2) more reproducible learning curves in different hardware settings. Cleanba's source code is available at \url{this https URL}

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2310.00036 [cs.LG]
	(or arXiv:2310.00036v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.00036

Submission history

From: Shengyi Huang [view email]
[v1] Fri, 29 Sep 2023 17:20:07 UTC (19,540 KB)

Computer Science > Machine Learning

Title:Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators