Peter L. Bartlett et al.: Experiments with Infinite-Horizon, Policy-Gradient Estimation (2011)journals/corr/abs-1106-06661106.0666Experiments with Infinite-Horizon, Policy-Gradient Estimation3Peter L. Bartlett1Jonathan Baxter2Lex Weaver3CoRRCoRRabs/1106.06662011provenance information for RDF data of dblp record 'journals/corr/abs-1106-0666'2018-08-13T16:47:19+0200