pymdptoolbox: Markov Decision Process (MDP) Toolbox. The MDP toolbox provides classes and functions for the resolution of descrete-time Markov Decision Processes. The list of algorithms that have been implemented includes backwards induction, linear programming, policy iteration, q-learning and value iteration along with several variations.

Keywords for this software

Anything in here will be replaced on browsers that support the canvas element