Boyi Liu et al.: Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy. (2019)conf/nips/LiuCYW19Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy.4Boyi Liu1Qi Cai2Zhuoran Yang3Zhaoran Wang 0001410564-10575NeurIPSNeurIPS20192019provenance information for RDF data of dblp record 'conf/nips/LiuCYW19'2023-12-27T11:36:00+0100