Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost

Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost | doi.page

Citations Over TimeTop 10% of 1988 papers