Provider: Schloss Dagstuhl - Leibniz Center for Informatics
Database: dblp computer science bibliography
Content:text/plain; charset="utf-8"
TY - CPAPER
ID - DBLP:conf/cdc/ZhangKZB19
AU - Zhang, Kaiqing
AU - Koppel, Alec
AU - Zhu, Hao
AU - Basar, Tamer
TI - Convergence and Iteration Complexity of Policy Gradient Method for Infinite-horizon Reinforcement Learning.
BT - 58th IEEE Conference on Decision and Control, CDC 2019, Nice, France, December 11-13, 2019
SP - 7415
EP - 7422
PY - 2019//
DO - 10.1109/CDC40024.2019.9030265
UR - https://doi.org/10.1109/CDC40024.2019.9030265
ER -