Bandit problems: Sequential allocation of experiments. (1985)

by DA Berry, B Fristedt