RLPy: a value-function-based reinforcement learning framework for education and research. RLPy is an object-oriented reinforcement learning software package with a focus on value-function-based methods using linear function approximation and discrete actions. The framework was designed for both educational and research purposes. It provides a rich library of fine-grained, easily exchangeable components for learning agents (e.g., policies or representations of value functions), facilitating recently increased specialization in reinforcement learning. RLPy is written in Python to allow fast prototyping, but is also suitable for large-scale experiments through its built-in support for optimized numerical libraries and parallelization. Code profiling, domain visualizations, and data analysis are integrated in a self-contained package available under the Modified BSD License at url {http://github.com/rlpy/rlpy}. All of these properties allow users to compare various reinforcement learning algorithms with little effort.

Keywords for this software

Anything in here will be replaced on browsers that support the canvas element