iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://dblp.org/rec/journals/corr/abs-1906-06062.ris
Provider: Schloss Dagstuhl - Leibniz Center for Informatics Database: dblp computer science bibliography Content:text/plain; charset="utf-8" TY - Informal or Other Publication ID - DBLP:journals/corr/abs-1906-06062 AU - Lorberbom, Guy AU - Maddison, Chris J. AU - Heess, Nicolas AU - Hazan, Tamir AU - Tarlow, Daniel TI - Direct Policy Gradients: Direct Optimization of Policies in Discrete Action Spaces. JO - CoRR VL - abs/1906.06062 PY - 2019// UR - http://arxiv.org/abs/1906.06062 ER -