On the decoupling coefficient analysis
Yingru Li (CUHK-Shenzhen)
Speaker: Yingru Li
Reference:
Zhang, Tong. “Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning.” arXiv preprint arXiv:2110.00871 (2021).