iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: http://github.com/thu-ml/tianshou/issues/1165
MPO Implementation · Issue #1165 · thu-ml/tianshou · GitHub
Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MPO Implementation #1165

Open
ziqiao30 opened this issue Jul 4, 2024 · 1 comment
Open

MPO Implementation #1165

ziqiao30 opened this issue Jul 4, 2024 · 1 comment
Labels
new algorithm Adding a new RL algorithm
Milestone

Comments

@ziqiao30
Copy link

ziqiao30 commented Jul 4, 2024

Hi there,

Will you consider implementing MPO in the near future? If I want to add a PBT for hyperparameter tuning, what would you suggest me to do?

Best regards,
ziqiao

@MischaPanch
Copy link
Collaborator

Hi. Yes, that would be on our list, it's a fairly standard algorithm (though it doesn't seem to be of much practical use from what I've seen). We were generally considering to only add new algorithms after the 2.0 release, where some core algorithm abstractions would be refactored, but for MPO we might make an exception. I'll discuss it with the other contributors and will come back to you soon.

@MischaPanch MischaPanch added the new algorithm Adding a new RL algorithm label Jul 6, 2024
@MischaPanch MischaPanch added this to the Release 2.0.0 milestone Sep 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new algorithm Adding a new RL algorithm
Projects
None yet
Development

No branches or pull requests

2 participants