iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://dblp.dagstuhl.de/rec/journals/corr/abs-2105-14363.rdf
Niladri S. Chatterji et al.: On the Theory of Reinforcement Learning with Once-per-Episode Feedback. (2021) journals/corr/abs-2105-14363 2105.14363 On the Theory of Reinforcement Learning with Once-per-Episode Feedback. 4 Niladri S. Chatterji 1 Aldo Pacchiano 2 Peter L. Bartlett 3 Michael I. Jordan 4 CoRR CoRR abs/2105.14363 2021 provenance information for RDF data of dblp record 'journals/corr/abs-2105-14363' 2021-06-02T11:46:42+0200