Multi-armed Bandit
In probability theory, the multi-armed bandit problem (sometimes called the K–[1] or N-armed bandit problem[2]) is a problem in which a fixed limited set of resources must be allocated between competing (alternative) choices in a way…