RL Seminar
  • About
  • Current schedule
  • Other semesters
  • Resources
  • GitHub

Martingales and the method of mixture with an application to stochastic linear bandit

Hao Liang (CUHK-SZ)