py/maBandits: matlab and python packages for multi-armed bandits. This package contains a python and a matlab implementation of the most widely used algorithms for multi-armed bandit problems. The purpose of this package is to provide simple environments for comparison and numerical evaluation of policies. Part of the code proposed here was used to produce the Figures included in our bandit papers (referenced below). The python code is provided with some C extensions that make it faster, but configuration-dependent. Some (basic) compilation work is required to use it. However, a plain python version is also included so that these extensions are by no way necessary to run the experiments.
References in zbMATH (referenced in 1 article )
Showing result 1 of 1.
- Cappé, Olivier; Garivier, Aurélien; Maillard, Odalric-Ambrym; Munos, Rémi; Stoltz, Gilles: Kullback-Leibler upper confidence bounds for optimal sequential allocation (2013)