Global optimality guarantees for policy gradient methods
Ziniu Li (CUHK-SZ)
Speaker: Ziniu Li
Title: Global convergence of Policy Gradient Methods in Markov Decision Process
Time: Jun 11 2pm-5pm
Reference:
Bhandari, Jalaj, and Daniel Russo. “Global optimality guarantees for policy gradient methods.” arXiv preprint arXiv:1906.01786 (2019).