Slides

Speaker: Ziniu Li

Title: Global convergence of Policy Gradient Methods in Markov Decision Process

Time: Jun 11 2pm-5pm

Reference:

Bhandari, Jalaj, and Daniel Russo. “Global optimality guarantees for policy gradient methods.” arXiv preprint arXiv:1906.01786 (2019).