Provider: Schloss Dagstuhl - Leibniz Center for Informatics
Database: dblp computer science bibliography
Content:text/plain; charset="utf-8"
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-1906-06062
AU - Lorberbom, Guy
AU - Maddison, Chris J.
AU - Heess, Nicolas
AU - Hazan, Tamir
AU - Tarlow, Daniel
TI - Direct Policy Gradients: Direct Optimization of Policies in Discrete Action Spaces.
JO - CoRR
VL - abs/1906.06062
PY - 2019//
UR - http://arxiv.org/abs/1906.06062
ER -