Created by: crypto-jeronimo
What is this Python project?
The MDP toolbox provides classes and functions for the resolution of descrete-time Markov Decision Processes. The list of algorithms that have been implemented includes backwards induction, linear programming, policy iteration, q-learning and value iteration along with several variations.
What's the difference between this Python project and similar ones?
- Eight MDP algorithms implemented.
- Fast array manipulation using NumPy.
- Full sparse matrix support using SciPy's sparse package.
- Optional linear programming support using cvxopt.
--
Anyone who agrees with this pull request could vote for it by adding a