(1) The Gambler's problem and beyond and (2) Off-Policy Evaluation - A Distributionally Robust Approach
Prof. Baoxiang Wang and Jie Wang (CUHK-SZ)
(1) The Gambler’s problem and beyond Slides
(2) Off-Policy Evaluation - A Distributionally Robust Approach Slides