Policy gradient methods
Yingru Li (Tencent Joint Lab, CUHK-Shenzhen)
Slides YouTube
State of the art and applications