Speaker: Yingru Li

Reference:

Zhang, Tong. “Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning.” arXiv preprint arXiv:2110.00871 (2021).