Multi-step learning and Value-based approximation methods
Mark Gluzman (iDDA, CUHK-Shenzhen)
See Slides and recorded Video for the lecture on youtube.
$TD(\lambda)$
$LSPE(\lambda)$
$LSTD(\lambda)$