Reinforcement-learning-OpenAI-Gym
Implementation of DP based policy iteration, value iteration and Q-learning algorithm on taxi_v3 environment of Gym toolkit. Implementation of value function approximation based Q-learning algorithm for for the mountain car and cart-pole environments of gym.